PCMag editors select and review products independently. If you buy through affiliate links, we may earn commissions, which help support our testing.

How Do You Beat The AlphaGo AI? Make A Smarter Version

DeepMind has created a more powerful version of AlphaGo that doesn't rely on human input to learn.

 & Michael Kan Principal Reporter

Our team tests, rates, and reviews more than 1,500 products each year to help you make better buying decisions and get more from technology.

Our Expert
LOOK INSIDE PC LABS HOW WE TEST
65 EXPERTS
43 YEARS
41,500+ REVIEWS

AlphaGo, the first computer program to beat the world champion of the game of Go, recently found itself losing dozens of matches.

Who was the program's new challenger? A more powerful version of itself.

This new and improved AlphaGo can actually learn to play the game on its own, without any feedback from humans, according to DeepMind, the Alphabet-owned company behind the computer program.

On Wednesday, DeepMind detailed this latest evolution of AlphaGo, which it calls Zero, in a new research paper published in Nature.

What sets Zero apart from the older versions of AlphaGo is how the program learns. Previous iterations did so by competing with human players, both amateur and professional.

Zero is different. This version learned by playing the game against itself, DeepMind wrote in a blog post.

To accomplish this, the company used a machine learning technique called "reinforcement learning" to push Zero to optimize its gameplay. The program's algorithms were then fine-tuned to predict future moves and the eventual winner of each match.

"This technique is more powerful than previous versions of AlphaGo because it is no longer constrained by the limits of human knowledge," the company said.

That change helped Zero become an even stronger Go player than the earlier iterations.

The skill levels of the different AlphaGo versions.

After only three days of self-training, the new version was pitted against an earlier AlphaGo program that defeated 18-time world champion Lee Sedol last year. Zero performed so well that it won all 100 matches played.

"The system progressively learned the game of Go from scratch, accumulating thousands of years of human knowledge during a period of just a few days," DeepMind said.

After 40 days of self-training, Zero was then pitted against the AlphaGo program that defeated the current world champion Ke Jie earlier this year. It went on to win 89 of the 100 games played.

How any of this research might apply to other fields outside of an ancient board game still isn't clear. But according to DeepMind, the new version of AlphaGo shows that A.I. programs don't always have to rely on human-created data to become smart.

About Our Expert

Michael Kan

Michael Kan

Principal Reporter

My Experience

I've been a journalist for over 15 years. I got my start as a schools and cities reporter in Kansas City and joined PCMag in 2017, where I cover satellite internet services, cybersecurity, PC hardware, and more. I'm currently based in San Francisco, but previously spent over five years in China, covering the country's technology sector.

Since 2020, I've covered the launch and explosive growth of SpaceX's Starlink satellite internet service, writing 600+ stories on availability and feature launches, but also the regulatory battles over the expansion of satellite constellations, fights with rival providers like AST SpaceMobile and Amazon, and the effort to expand into satellite-based mobile service. I've combed through FCC filings for the latest news and driven to remote corners of California to test Starlink's cellular service.

I also cover cyber threats, from ransomware gangs to the emergence of AI-based malware. In 2024 and 2025, the FTC forced Avast to pay consumers $16.5 million for secretly harvesting and selling their personal information to third-party clients, as revealed in my joint investigation with Motherboard.

I also cover the PC graphics card market. Pandemic-era shortages led me to camp out in front of a Best Buy to get an RTX 3000. I'm now following how the AI-driven memory shortage is impacting the entire consumer electronics market. I'm always eager to learn more, so please jump in the comments with feedback and send me tips.

The Best Tech I've Had:

  • My first video game console: a Nintendo Famicom
  • I loved my Sega Saturn despite PlayStation's popularity.
  • The iPod Video I received as a gift in college
  • Xbox 360 FTW
  • The Galaxy Nexus was the first smartphone I was proud to own.
  • The PC desktop I built in 2013, which still works to this day.

Read full bio