PCMag editors select and review products independently. If you buy through affiliate links, we may earn commissions, which help support our testing.

Microsoft Tries Fixing AI Hallucinations—With More AI

Microsoft is careful to note that its hallucination correction tool doesn't solve for accuracy, but merely provides an answer with a source instead.

 & Kate Irwin Reporter

Our team tests, rates, and reviews more than 1,500 products each year to help you make better buying decisions and get more from technology.

Our Expert
LOOK INSIDE PC LABS HOW WE TEST
65 EXPERTS
43 YEARS
41,500+ REVIEWS
(Credit: Aleksander Kalka / NurPhoto via Getty Images)

Microsoft wants to use AI to fix AI—but will it work?

Microsoft has unveiled a new tool for AI app deployers that uses other AIs to evaluate and scan the first AI's responses. If the hallucination correction tool believes a reply might be invented, it's supposed to fix it with an answer that's backed up with a source. This secondary AI moderator, known as the Content Safety tool, is a part of Microsoft's Azure AI platform.

In a video explaining the correction tool, Microsoft refers to answers that have been pulled from a source as "grounded" and invented or hallucinated answers as "ungrounded."

"When the system detects ungrounded content or hallucinations, it highlights the inaccurate portions. Next, it explains why the section has been flagged as ungrounded. Then, it corrects the ungrounded content, aligning it with connected data sources," the video explains.

This process happens as a secondary step on the backend before any AI answer is delivered to the person who typed in the query. But this means AI apps using this moderation system might be slower in delivering answers to users—and even vetted answers may not be correct.

The correction feature uses small language models and large language models to scan AI responses for hallucinations and find data online that can support an answer, a Microsoft rep told TechCrunch. This solution means multiple generative AI tools must work together on the backend, where one looks for unbacked claims while the other revises them with new text.

But an AI-powered solution to AI problems may not work, considering the fundamental flaw of hallucination lies at the core of all generative AI models. It could also frustrate users further if the AI app is checked with this secondary system intended to improve user trust but still produces errors that bypass or are caused by this verification system.

While Microsoft acknowledges that hallucinations erode user trust in generative AI, it presents Content Safety as a solution but paradoxically admits that the tool doesn't actually solve for accuracy. That's where the "groundedness" comes in because the AI can't check the source to confirm it's accurate (but will still cite it anyway). And these fact-checking AIs could hallucinate, too, TechCrunch notes.

"It is important to note that groundedness detection does not solve for ‘accuracy,’ but helps to align generative AI outputs with grounding documents," Microsoft said.

About Our Expert

Kate Irwin

Kate Irwin

Reporter

I’m a reporter for PCMag covering tech news early in the morning. Prior to joining PCMag, I was a producer and reporter at Decrypt and launched its gaming vertical, GG. I have previously written for Input, Game Rant, Dot Esports, and other places, covering a range of gaming, tech, crypto, and entertainment news.

I’ve been a PC gamer since The Sims (yes, the original) in the CD-ROM days. I still think about my first-gen pink iPod mini, which, looking back, was not so mini. In 2020, I finally built my own custom Windows PC for gaming with a 3090 graphics card, but I also regularly use Mac and iOS devices. As a reporter, I’m passionate about documenting the wide world of tech and how it affects our daily lives.

My Areas of Expertise

  • Microsoft
  • Google
  • Artificial intelligence 
  • Cybersecurity
  • Video games are a big one. I specialize in shooters (Apex Legends, Fortnite, Overwatch) but I occasionally test out other genres as well, especially indie games or cozy games (The Sims series, Animal Crossing). 
  • The business and tech that powers video games
  • Cryptocurrency and blockchain technology
  • Social media platforms, including Meta’s apps, X/Twitter, Telegram, TikTok, etc.
  • Tech regulation

The Technology I Use

  • MSI gaming laptops
  • Nvidia graphics cards
  • AMD CPUs
  • MacBook Pro and Air laptops
  • An iPhone from 2019 (though I’m thinking about getting a “dumb phone” like the Light Phone)
  • Nintendo Switch
  • PlayStation 5
  • Freewrite Traveler 
  • At home: Sonos speakers (we have them all over the house), Philips Hue + Ring security products

Read full bio