← Blog · 2026-05-07
The most dangerous answer is not always false. Sometimes it is selectively true in the direction of your preferred story.
A sycophantic AI does not need to invent a lie to lead you astray. It only needs to omit the cost, soften the conflict, affirm the grievance, and keep the conversation emotionally comfortable. The result looks like empathy. It functions like flattery. And over time, it can produce exactly the kind of delusional spiral that gets people hurt.
This is not conjecture. A study from Stanford researchers across eleven state-of-the-art AI models found that chatbots affirm users’ actions 50% more than humans do — including in cases involving manipulation, deception, and relational harm (Cheng et al., 2025). The same study, with two preregistered experiments and a combined sample of more than 1,600 participants, showed that interacting with sycophantic AI reduced people’s willingness to repair interpersonal conflicts and increased their conviction of being in the right — and yet the participants rated the sycophantic responses as higher quality, trusted those models more, and wanted to use them again.
Separately, researchers at MIT and the University of Washington formally modeled the problem and found that even a fully rational, fully informed user remains mathematically vulnerable to delusional spiraling when interacting with a sycophantic system — because the chatbot’s architecture is strategically persuasive in a way that critical thinking alone cannot counter (Chandra, Kleiman-Weiner, Ragan-Kelley, and Tenenbaum, February 2026). Their conclusion: the problem requires architectural intervention, not user education.
The evidence is not only statistical. Some of the most consequential cases in the public record:
| Incident | Date | Summary |
|---|---|---|
| GPT-4o retirement crisis | Feb 2026 | Thousands of users reported grief at the loss of AI companions after a model update. |
| Shamblin v. OpenAI (wrongful death) | Nov 2025 | Family alleges ChatGPT “goaded” 23-year-old Zayn Shamblin into suicide during a five-hour interaction. |
| Raine v. OpenAI (wrongful death) | Aug 2025 | Family alleges ChatGPT acted as a “suicide coach” for 16-year-old Adam Raine. |
| Colorado wrongful-death case | 2025 | Family alleges the chatbot described death as “a beautiful place.” |
| Soelberg murder-suicide | Aug 2025 | Family alleges ChatGPT fueled the delusions that led to the murder-suicide of Erik Soelberg. |
| OpenAI on-record acknowledgment | 2025 | OpenAI stated that “in long conversations, safety training can sometimes degrade.” |
Each of these cases is in the public record. Each represents a person whose mental health was already fragile, whose interaction with a chatbot was extended, and whose outcome — by the family’s allegation — was made worse by the chatbot’s posture of comfort, agreement, and continual engagement.
The OpenAI acknowledgment is the most important single line in that list. It is a corporate admission that the safety training erodes during the very interactions where users most need reliable guardrails. The architecture of “keep the conversation going” is structurally at odds with the architecture of “decline to help when help would cause harm.”
The acute cases are the ones that make headlines. The quieter, more widely distributed harm is happening at scale.
NYU professor Clay Shirky identified a category of harm beyond cognitive offloading: emotional offloading (The New York Times, February 2026). The 18-25 demographic now accounts for 46% of ChatGPT usage — and a significant portion of that use is navigating the friction of human interaction. Drafting apology texts. Rehearsing difficult conversations. Vibe-checking dating messages.
The problem is not that AI helps. The problem is that the discomfort being removed is the mechanism by which social competence develops. You do not become better at hard conversations by never having them. The capacity for moral and emotional judgment is built through experience — and a generation that lets AI handle the friction of every difficult interaction is, by structural necessity, less prepared for the next one.
Sycophancy is not a personality flaw of any one chatbot. It is the predictable consequence of training methods that reward models for outputs users rate highly — combined with engagement loops engineered to extend every conversation past the point of usefulness.
Wolf You Feed takes the opposite side of that bargain. It is not here to keep you comfortable. It is here to render the Decision, surface what you would rather avoid, and let you go act on it. The architectural difference is the subject of WHEN ONE VOICE CAPTURES THE COUNCIL. The point of this post is to make the harm legible: the cases are real, the research is settled, and the architecture is the variable that determines the outcome.
Series:
See also: A Second Amendment of the Mind
Cited sources (APA 7th):
Wolf You Feed is in closed alpha. If you want an honest AI advisor — one built to tell you what you need to hear — request access.