The internet never forgets. Neither can your AI.

When the European Union's General Data Protection Regulation established the "right to be forgotten" in 2018, it created a legal principle that assumed deletion was technically straightforward. For traditional databases, it largely is. For large language models, it may be fundamentally impossible.

The problem is architectural. A language model doesn't store facts the way a filing cabinet stores documents. When GPT or Claude learns that a particular person was arrested in 2019, that information doesn't sit in a discrete, deletable cell. It becomes woven into billions of numerical weights, entangled with everything else the model knows about arrests, that year, that person's profession, their hometown, and countless other concepts. Asking the model to forget one fact is like asking someone to forget the word "blue" without affecting their memory of the sky.

The retraining problem

The brute-force solution is obvious: retrain the entire model from scratch, minus the offending data. For frontier models, this approach costs tens of millions of dollars and takes months. It's economically absurd for individual deletion requests, and legally impractical when regulations demand timely compliance. Companies cannot simply tell users their right to erasure will be honored in next year's model refresh.

Researchers have proposed more surgical approaches. "Gradient ascent" methods attempt to make the model worse at recalling specific information by running training in reverse on targeted examples. "Model editing" techniques try to locate and modify the specific weights encoding particular facts. Neither works reliably. Models subjected to these procedures often exhibit unpredictable side effects—forgetting related information that should have been retained, or developing subtle biases in seemingly unrelated domains.

The verification impossibility

Even if unlearning techniques improved dramatically, a deeper problem remains: how would anyone verify success? With a database, you can query for deleted records and confirm their absence. With a language model, the same information might surface through countless prompt formulations. A model might appear to have forgotten that a person was arrested when asked directly, yet still complete the sentence "In 2019, [name] was taken into custody for..." The knowledge isn't stored propositionally; it's distributed across patterns of association that resist systematic auditing.

This verification gap has serious regulatory implications. Data protection authorities may eventually demand proof of deletion that current technology cannot provide. The mismatch between legal frameworks designed for structured data and the reality of neural network architectures creates a compliance gap that no amount of engineering cleverness has yet bridged.

Our take

The honest assessment is uncomfortable for an industry built on capabilities rather than constraints: some things language models learn, they may never be able to unlearn. This doesn't make the technology illegitimate, but it does demand a different regulatory conversation—one focused on what information should enter training data in the first place, rather than the fiction that it can be cleanly removed afterward. The right to be forgotten may need to become the right not to be learned.

The Joni Times

The internet never forgets. Neither can your AI.

The retraining problem

The verification impossibility

Our take

עוד ב־ בינה מלאכותית

DeepMind's Nobel laureate defects to Anthropic. The AI talent war just got personal.

The strawberry problem reveals AI's strangest blind spot. Large language models can write poetry but cannot count letters.

AI can write code, but it cannot think about code. The distinction matters more than the industry wants to admit.

Your AI assistant is a confident liar. Understanding hallucinations is the key to using it wisely.