The gap between what artificial intelligence is said to do and what it actually does has become one of the defining features of the technology's public life. Marketing departments promise reasoning; engineering teams deliver sophisticated pattern matching. Executives announce transformation; workers experience automation of narrow tasks. This is not to say AI is unimpressive—it is genuinely remarkable—but the remarkable and the revolutionary are not the same thing.

The confusion stems partly from language. When a large language model produces a coherent essay on Kantian ethics, it is natural to assume something like understanding is occurring. It is not, at least not in any sense a philosopher would recognize. The model has learned statistical relationships between tokens—essentially, which words tend to follow which other words in the vast corpus of text it was trained on. It can produce outputs that look like reasoning because reasoning, when written down, has patterns. But the model has no access to the world those words describe. It cannot verify claims against reality, cannot update its beliefs based on new evidence, cannot distinguish between a plausible-sounding falsehood and an implausible-sounding truth.

The hallucination problem is structural

The tendency of language models to generate confident nonsense—citing papers that do not exist, inventing historical events, fabricating statistics—is not a bug to be patched but a feature of how these systems work. They are optimized to produce probable sequences of text, and sometimes the most probable sequence is wrong. This matters enormously in contexts where accuracy is non-negotiable: medicine, law, journalism, engineering. The models can be useful assistants in these domains, but they cannot be trusted authorities. Every output requires verification by someone who actually knows the subject matter.

This creates an awkward economic reality. The promise of AI is often framed as replacing expensive human expertise with cheap machine inference. But if the machine's outputs must be checked by an expert anyway, the savings are less dramatic than advertised. What you get is a faster first draft, not a finished product. Valuable, certainly, but not the paradigm shift the valuations imply.

The training data ceiling

Current AI systems are bounded by their training data in ways that are easy to forget. A model trained on text cannot learn physics by dropping objects; it can only learn what humans have written about dropping objects. This means the models inherit every bias, error, and limitation present in their training corpus. More subtly, they cannot generalize beyond what that corpus contains. They can interpolate brilliantly within the distribution of their training data. Extrapolation—the kind of creative leap that produces genuinely new knowledge—remains elusive.

This is why AI excels at tasks where the answer already exists somewhere in human-generated text and struggles with tasks requiring genuine novelty. It can summarize existing research far better than it can produce new hypotheses worth testing. It can imitate artistic styles more easily than it can originate new ones. The technology is, in a meaningful sense, a mirror of human intellectual output rather than a source of new intellectual output.

Our take

None of this makes AI unimportant. A technology that dramatically accelerates certain kinds of cognitive work is genuinely significant, even if it falls short of the science-fiction vision of machine superintelligence. The danger lies not in the technology itself but in the mismatch between expectations and capabilities. Decisions made on the assumption that AI can reason, verify, or create in the way humans do will be bad decisions. The executives, policymakers, and individuals who will navigate the coming years most successfully are those who understand what these tools actually are: extraordinarily powerful pattern-matching systems with no understanding of the patterns they match.