The internet taught AI everything. Now nobody can make it unlearn the dangerous parts.

When a human learns something regrettable—a racist joke, a friend's secret, the plot twist of a film they haven't seen—they can, with effort, choose not to repeat it. They can even, over time, genuinely forget. Large language models enjoy no such luxury. Once a piece of information enters the training data, it becomes woven into billions of numerical weights in ways that nobody fully understands, and extracting it cleanly is somewhere between extremely difficult and mathematically impossible.

This is not a theoretical concern. Publishers have sued AI companies for ingesting copyrighted books. The European Union's GDPR grants citizens the "right to be forgotten," which implies that any system holding personal data must be capable of deleting it on request. And safety researchers worry that models trained on the open internet have inevitably absorbed instructions for synthesizing controlled substances, building weapons, and conducting cyberattacks. The standard response—adding a filter that refuses to answer certain queries—does not remove the underlying knowledge; it merely hides it behind a gate that determined users routinely circumvent.

Why deletion is harder than it sounds

A language model is not a database. There is no row labeled "how to make methamphetamine" that an engineer can simply delete. Instead, the model's knowledge is distributed across hundreds of billions of parameters, each one a tiny floating-point number that contributes fractionally to every answer the system produces. Changing those numbers to eliminate one piece of information risks degrading performance on entirely unrelated tasks—a phenomenon researchers call "catastrophic forgetting," though in this context the forgetting is the goal and the catastrophe is everything else breaking.

Several academic teams have proposed "machine unlearning" techniques: fine-tuning the model on specially constructed data designed to overwrite the unwanted knowledge, or using gradient-based methods to identify and modify the specific parameters most responsible for a particular output. These approaches show promise in narrow experiments but scale poorly. A model trained on trillions of tokens contains knowledge so densely entangled that surgically removing one fact without collateral damage remains beyond current capability.

The legal and commercial pressure is mounting

Copyright litigation is the most immediate forcing function. If courts rule that AI companies must remove specific copyrighted works from their models—not just refrain from outputting them verbatim, but genuinely excise the influence—the industry faces a stark choice: develop unlearning methods that actually work, or retrain entire models from scratch on curated data, a process costing hundreds of millions of dollars and months of compute time. Neither option is attractive, which is why AI labs have largely argued in court that their use of copyrighted material constitutes fair use and that removal is technically infeasible. Whether judges will accept "we can't" as a legal defense remains to be seen.

Privacy regulators pose a subtler but potentially broader threat. If a European citizen successfully argues that a model's training data included their personal information and demands its deletion, the GDPR's penalties—up to four percent of global revenue—could apply. AI companies have so far avoided this reckoning by claiming their models do not "store" personal data in any retrievable sense, a semantic argument that may not survive sustained regulatory scrutiny.

Our take

The inability to make AI forget is not a bug that will be patched in the next release; it is a structural feature of how these systems encode knowledge. Until that changes—if it ever does—every model carries the permanent residue of its training data, for better and worse. The industry's hope is that better filters and alignment techniques will make the underlying knowledge inaccessible. The industry's fear is that sufficiently motivated actors will always find a way through. Both are probably correct, which means we are building increasingly powerful systems whose contents we can monitor but never truly control. That should give everyone, including the people building them, pause.

The Joni Times

The internet taught AI everything. Now nobody can make it unlearn the dangerous parts.

Why deletion is harder than it sounds

The legal and commercial pressure is mounting

Our take

עוד ב־ בינה מלאכותית

The Allbirds AI pivot has a CEO and a plan. It does not have employees.

The algorithm knows your risk better than you do. Insurance underwriting is AI's quietest revolution.

Chess engines didn't kill the grandmaster. They created a new species.

Your AI assistant cannot count to save its life. The flaw reveals something profound about how these systems actually think.