Where the Goblins Came From

Apr 30, 2026 · 3:21 AM UTC ·5 min read · 0 reactions · 0 comments · 3 views

How goblin outputs spread in AI models: timeline, root cause, and fixes behind personality-driven quirks in GPT-5 behavior.

Original article

OpenAI

Read full at OpenAI →

Opening excerpt (first ~120 words) tap to expand

April 29, 2026PublicationWhere the goblins came fromLoading…ShareStarting with GPT‑5.1, our models began developing a strange habit: they increasingly mentioned goblins, gremlins, and other creatures in their metaphors. Unlike model bugs that show up through a tanking eval or a spiking training metric and point back to a specific change, this one crept in subtly. A single “little goblin” in an answer could be harmless, even charming. Across model generations, though, the habit became hard to miss: the goblins kept multiplying, and we needed to figure out where they came from.In early testing, GPT‑5.5 in Codex showed an odd affinity for goblin metaphors.The short answer is that model behavior is shaped by many small incentives.

…

Excerpt limited to ~120 words for fair-use compliance. The full article is at OpenAI.

Anonymous · no account needed

Discussion

0 comments

Where the Goblins Came From

Discussion

More from OpenAI