Literature

Manipulating GPT-5.x with Pseudo-Literature

Language models are increasingly used not just as generators but as evaluators : they grade outputs, filter candidates, assess arguments. But what happens when these AI judges systematically ride the wrong signals? In a new study ("Pseudo‑Literary Quality Inflation Across the GPT‑5 Family: Replication and Downstream Evaluator Vulnerabilities"), I investigated whether a specific kind of blind spot — a preference for pseudo-literary surface cues — distorts not only the aestheti

Christoph Heilig

Mar 20