It's the Chinese Room argument all over again. People hear "predicting the next ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		Sharlin on Oct 4, 2023 \| parent \| context \| favorite \| on: Language Models Represent Space and Time It's the Chinese Room argument all over again. People hear "predicting the next token" and all they can imagine is some sort of a statistical database lookup, ignoring the fact that when you have a huge corpus of data with incredibly complex internal correlations and all that data also happens to correlate with some unknown external thing, it's almost certain that a powerful learner will end up modeling that external thing if and when doing so will cause a quantum leap in prediction performance! A model that includes the external-thing hypothesis will almost certainly be simpler, ceteris paribus, than a model that doesn't.

esafak on Oct 4, 2023 [–]

That is what a model is; a parsimonious explanation of observations that admits extrapolation.

Sharlin on Oct 5, 2023 | [–]

Exactly!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact