top of page
Search


Manipulating GPT-5.x with Pseudo-Literature
Language models are increasingly used not just as generators but as evaluators : they grade outputs, filter candidates, assess arguments. But what happens when these AI judges systematically ride the wrong signals? In a new study ("Pseudo‑Literary Quality Inflation Across the GPT‑5 Family: Replication and Downstream Evaluator Vulnerabilities"), I investigated whether a specific kind of blind spot — a preference for pseudo-literary surface cues — distorts not only the aestheti
Christoph Heilig
Mar 20


GPT-5 Is a Terrible Storyteller – And That's an AI Safety Problem
Lots of things have gone wrong with the rollout of GPT-5. However, most of the things that have been noted so far have to do with how the...
Christoph Heilig
Aug 26, 2025


Stories and AI
Last week, together with Nina Beguš from UC Berkeley, I had the privilege of organizing a small conference on "AI and Human...
Christoph Heilig
Jul 1, 2025
bottom of page