PCMag editors select and review products independently. If you buy through affiliate links, we may earn commissions, which help support our testing.

OpenAI's GPT-4 Can Play Doom, But Only Like a Noob

The AI model gets stuck in corners and ignores enemies, according to a Microsoft scientist who warns that the blunders could allow for 'potential misuse' in more serious applications.

 & Michael Kan Principal Reporter

Our team tests, rates, and reviews more than 1,500 products each year to help you make better buying decisions and get more from technology.

Our Expert
LOOK INSIDE PC LABS HOW WE TEST
65 EXPERTS
43 YEARS
41,500+ REVIEWS
(Credit: Adrian de Wynter)

A Microsoft scientist has concocted a way for OpenAI’s GPT-4 model to play Doom, a game at which the AI apparently does not excel.

Adrian de Wynter published a paper and a blog post documenting his attempts to see if GPT-4 could play the classic PC game. To do so, he tapped GPT-4 with Vision API, which lets the AI model receive visual images and create a corresponding action. 

He then crafted a program to take screenshots from a computer running Doom. The screenshots were fed to GPT-4, with the goal of having it beat the game. The answers were then converted into outputs used to control the game.

(Credit: Adrian de Wynter)

For his experiment, de Wynter gave GPT-4 access to a system of prompt generators, which had received a walkthrough on how to win the game, and could account for past actions taken. The results found that OpenAI’s technology can play Doom to a “passable” degree, meaning the AI is smart enough to traverse the map and shoot at enemies. 

But when compared to the performance of a human, GPT-4 “kinda fails at the game and my grandma has played it wayyy better than this model,” de Wynter wrote in his blog post. For example, the AI model would “sometimes do really dumb things like getting stuck in corners and just punching the wall like an angsty teenager, or shooting point blank explosive barrels,” he said.

In addition, GPT-4 would often ignore enemies and act as if they didn’t exist as soon as they dropped out of its field of view. “Most of the time the model died because it got stuck in a corner and ended up being shot in the back,” de Wynter added. 

The results suggest  GPT-4 can struggle when it comes to long-term reasoning and “object permanence,” or the ability to recognize that an object still exists, even though it can’t be seen. The other problem is that OpenAI’s model sometimes made up, or hallucinated, why it needed to take certain actions while playing the game.  

To be fair, GPT-4 wasn’t designed to play first-person shooters. Still, de Wynter also concluded that OpenAI’s model is smart enough to “operate doors, follow simple objectives, and identify enemies and fire weapons at them” using only simple depictions of a video game world. 

“So, while this is a very interesting exploration around planning and reasoning, and could have applications in automated videogame testing, it is quite obvious that this model is not aware of what it is doing,” he added. “I strongly urge everyone to think about what deployment of these models implies for society and their potential misuse.”

About Our Expert

Michael Kan

Michael Kan

Principal Reporter

My Experience

I've been a journalist for over 15 years. I got my start as a schools and cities reporter in Kansas City and joined PCMag in 2017, where I cover satellite internet services, cybersecurity, PC hardware, and more. I'm currently based in San Francisco, but previously spent over five years in China, covering the country's technology sector.

Since 2020, I've covered the launch and explosive growth of SpaceX's Starlink satellite internet service, writing 600+ stories on availability and feature launches, but also the regulatory battles over the expansion of satellite constellations, fights with rival providers like AST SpaceMobile and Amazon, and the effort to expand into satellite-based mobile service. I've combed through FCC filings for the latest news and driven to remote corners of California to test Starlink's cellular service.

I also cover cyber threats, from ransomware gangs to the emergence of AI-based malware. In 2024 and 2025, the FTC forced Avast to pay consumers $16.5 million for secretly harvesting and selling their personal information to third-party clients, as revealed in my joint investigation with Motherboard.

I also cover the PC graphics card market. Pandemic-era shortages led me to camp out in front of a Best Buy to get an RTX 3000. I'm now following how the AI-driven memory shortage is impacting the entire consumer electronics market. I'm always eager to learn more, so please jump in the comments with feedback and send me tips.

The Best Tech I've Had:

  • My first video game console: a Nintendo Famicom
  • I loved my Sega Saturn despite PlayStation's popularity.
  • The iPod Video I received as a gift in college
  • Xbox 360 FTW
  • The Galaxy Nexus was the first smartphone I was proud to own.
  • The PC desktop I built in 2013, which still works to this day.

Read full bio