LLMs aren’t world models

Delete Set public Set private Add tags Delete tags

10977 shaares
392 private links

10977 shaares · 392 private links

Filters

Links per page

20 50 100

LLMs aren’t world models

A friend who plays better chess than me — and knows more math & CS than me - said that he played some moves against a newly released LLM, and it must be at least as good as him. I said, no way, I’m going to cRRRush it, in my best Russian accent. I make a few moves – but unlike him, I don't make good moves1, which would be opening book moves it has seen a million times; I make weak moves, which it hasn't 2. The thing makes decent moves in response, with cheerful commentary about how we're attacking this and developing that — until about move 10, when it tries to move a knight which isn't there, and loses in a few more moves. This was a year or two ago; I’ve just tried this again, and it lost track of the board state by move 9.

we could say that the whole argument that LLMs learn about the world is that they have to understand the world as a side effect of modeling the distribution of text.

LLMs are limited by text inputs: color are numbers, etc...

Ideally, you would want to quantify "how much of the world LLMs model."

“a fundamentally incorrect approach to a problem can be taken very far in practice with sufficient engineering effort.”

Take:

LLMs are not by themselves sufficient as a path to general machine intelligence; in some sense they are a distraction because of how far you can take them despite the approach being fundamentally incorrect.
LLMs will never6 manage to deal with large code bases “autonomously”, because they would need to have a model of the program, and they don’t even learn to track chess pieces having read everything there is to read about chess.
LLMs will never reliably know what they don’t know, or stop making things up.
LLMs will always be able to teach a student complex (standard) curriculum, answer an expert’s question with a useful (known) insight, and yet fail at basic (novel) questions on the same subject, all at the same time.
LLM-style language processing is definitely a part of how human intelligence works — and how human stupidity works.

ai · thoughts · story

August 17, 2025 at 12:39:39 GMT+2 * · permalink

https://yosefk.com/blog/llms-arent-world-models.html

Filters

Links per page

20 50 100