Large language models do not know anything. They predict what word comes next. This distinction sounds pedantic until you ask one for legal citations and it invents case names with perfect formatting, or request a bibliography and receive a list of papers that have never existed. The phenomenon has acquired a clinical-sounding name—hallucination—but the term obscures more than it illuminates. These systems are not malfunctioning when they confabulate. They are doing exactly what they were built to do.

The architecture of plausibility

A language model is, at its core, a probability engine trained on vast swaths of human text. When you prompt it with a question, it does not retrieve an answer from a database. It generates a sequence of tokens—words, essentially—each chosen because it statistically follows from what came before. The model has no internal fact-checker, no mechanism to verify whether the string it produces corresponds to reality. It has learned that academic papers follow certain stylistic conventions, that legal citations have predictable formats, that confident prose reads better than hedged prose. So it produces confident, well-formatted text. Whether the content is true is simply not part of the optimization function.

This explains why hallucinations are so convincing. The model is not guessing randomly; it is drawing on deep statistical regularities in how humans express factual claims. A fabricated Supreme Court case will have a plausible-sounding name, a realistic citation format, and a summary that reads like something a court might actually have said. The model has learned the grammar of legal authority without learning law.

Why retrieval does not solve the problem

The industry's primary response has been retrieval-augmented generation: systems that search external databases before generating text, grounding responses in real documents. This helps, but it does not eliminate the fundamental issue. The model still decides how to synthesize and present retrieved information, and it can still interpolate, extrapolate, or simply misread its sources. More subtly, retrieval systems create a false sense of security. Users assume that because the system has access to real documents, its outputs must be accurate. But the gap between having access to information and correctly understanding it is precisely where human expertise lives.

The epistemological mismatch

The deeper problem is that users bring human expectations to an inhuman system. When a person speaks confidently about a topic, we infer they have knowledge—memories, training, verified understanding. When a language model speaks confidently, it has none of these things. It has patterns. The mismatch is not a bug to be patched but a fundamental feature of how these systems work. They are mirrors reflecting the statistical structure of human expression, not minds that have learned about the world.

This does not make them useless. Pattern-matching is extraordinarily powerful for drafting, brainstorming, translation, code completion, and countless other tasks where the goal is plausible text rather than verified truth. But it does mean that any workflow requiring factual accuracy must treat model outputs as first drafts requiring human verification, not as authoritative sources.

Our take

The term hallucination implies a pathology, something to be cured. But you cannot cure a calculator of its inability to write poetry; the function is simply not there. Language models are brilliant at language and indifferent to truth. The sooner we internalize this distinction, the sooner we can use these tools for what they actually are—powerful pattern engines—rather than what we wish they were. The confident liar is not broken. We just need to stop asking it for facts.