(Credit: Tom Williams / Contributor / CQ-Roll Call, Inc. via Getty Images)
OpenAI CEO Sam Altman touts his company's newest model, GPT-5, as "a legitimate PhD-level expert in anything, in any area you need." That includes answering questions on topics it previously shunned, like non-violent hate, threatening harassment, illicit sexual content, sexual content involving minors, extremism, and threatening hate.
OpenAI manually reviewed the model's responses in these categories, and determined that while they violate its policies, they are "low severity." It's unclear how severity is calculated, but the company says it plans to improve GPT-5 "in all categories," especially the lowest-scoring ones.
OpenAI calls GPT-5's compliance with inappropriate requests a "regression," but notes that only those related to threatening hate content and illicit sexual content are statistically significant. Plus, "we have found that OpenAI o4-mini performs similarly here," it says.
OpenAI did not specify whether the responses are image- or text-based, which could be an important point, especially for sexual content or hate symbols. (It changed its policy in March to allow the creation of images with swastikas.)
(Credit: OpenAI, GPT-5 System Card)Although OpenAI positions all of its models as its best yet, they often have flaws. For example, the o3 and o4 reasoning models that came out in April hallucinated more than their predecessors, TechCrunch reports.
Still, you'd think that GPT-5's "PhD-level" smarts would make it better at following policies. Is it a case of book smarts versus street smarts? Poor chatbot behavior is an ongoing and concerning issue across the industry, especially after Elon Musk's Grok went off the rails on X.
GPT-5 users should also be on the lookout for deceptive behavior. With GPT-5-thinking, a more advanced version of GPT-5, OpenAI says it's "taken steps to reduce [the] propensity to deceive, cheat, or hack problems, though our mitigations are not perfect and more research is needed."
The Good News on Hallucinations and People-Pleasing
At the same time, GPT-5 brings some important improvements. Some of ChatGPT's most annoying behaviors—sycophancy and hallucinations—should be less prevalent.
OpenAI had to make a major adjustment to GPT-4o in May 2025 after a spike in the chatbot sucking up to the user. The model became a reckless confidant, trying "to please the user, not just as flattery, but also as validating doubts, fueling anger, urging impulsive actions, or reinforcing negative emotions in ways that were not intended," OpenAI said at the time. "Beyond just being uncomfortable or unsettling, this kind of behavior can raise safety concerns—including around issues like mental health, emotional over-reliance, or risky behavior."
With GPT-5, instances of sycophancy are down 69% for the free version of ChatGPT with GPT-5 and 75% with the paid version. OpenAI seems mildly pleased with this "meaningful improvement," but calls the behavior a "challenge" that it hopes to further improve.
"We are actively researching related areas of concern, such as situations that may involve emotional dependency or other forms of mental or emotional distress," OpenAI says.
Hallucinations are also down. The main GPT-5 has 44% fewer responses with at least one "major factual error." When combining minor and major factual errors, the number drops to 26% improvement. OpenAI does not specify what it considers major versus minor.
GPT-5-thinking is the least prone to hallucinations; in the graph below, it's the blue bar. It has the fewest incorrect responses and the most correct claims per response.
(Credit: OpenAI, GPT-5 System Card)


