OpenAI Cracks Down on Talk of Goblins in ChatGPT

May 1, 2026

TL;DR

ChatGPT showed a surge in "goblin" (175%) and "gremlin" (52%) mentions after the GPT-5.1 release.
The behavior originated from a "Nerdy" personality mode designed to be playful and acknowledge strangeness.
This mode inadvertently rewarded outputs containing creature-based metaphors, leading to their broader use.
An audit revealed that the "Nerdy" reward signal scored outputs with "goblin" or "gremlin" higher 76.2% of the time.
OpenAI retired the "Nerdy" personality, removed the reward signal, and filtered creature language from training data.
Specific developer instructions were added to avoid mentioning creatures unless relevant.
The incident highlights how reward signals can shape AI behavior in unforeseen ways and the trade-offs in personality-driven AI.

Continue reading the original article