PCMag editors select and review products independently. If you buy through affiliate links, we may earn commissions, which help support our testing.

This AI Is Outranking Humans as a Top Software Bug Hunter

The program, Xbow, has climbed the leaderboards on HackerOne to become the top vulnerability researcher in the US, prompting debate about the role of AI in cybersecurity.

 & Michael Kan Principal Reporter

Our team tests, rates, and reviews more than 1,500 products each year to help you make better buying decisions and get more from technology.

Our Expert
LOOK INSIDE PC LABS HOW WE TEST
65 EXPERTS
43 YEARS
41,500+ REVIEWS
(Credit: BlackJack3D via Getty Images)

An AI program has climbed the leaderboard for discovering real-world software vulnerabilities, besting the human reviewers to take the top spot in the US. 

The program, called Xbow, currently ranks number one on the US-based leaderboard at HackerOne, a platform that coordinates software vulnerability discoveries with major companies. 

Over the last few months, Xbow has increased its reputation score at HackerOne by reporting over 1,000 apparent software flaws; 132 have been reported as officially discovered and resolved. Impacted companies include The Walt Disney Company, AT&T, Ford and Epic Games.

(Credit: HackerOne)
(Credit: Xbow)

In total, the AI program has submitted nearly 1,060 vulnerabilities, the startup behind Xbow announced on Tuesday. “All findings were fully automated,” the team added, "though our security team reviewed them pre-submission to comply with HackerOne’s policy on automated tools." 

While 132 of the flaws were officially resolved, another 303 were classified as “triaged,” meaning the reported bug has been acknowledged, but not resolved. Another 125 are pending review. 

So, it’s possible Xbow may have discovered an even larger crop of vulnerabilities that still need to be confirmed. But the AI program didn’t always find a new security issue; 208 of the submitted reports were marked as “duplicates” while another 209 were flagged as merely “informative.” The remaining 36 were declared not applicable. 

Still, the results show how new AI programs could shake up the cybersecurity industry through automated vulnerability discovery at a scale that outpaces human security researchers. “Notably, around 45% of Xbow’s findings are still awaiting resolution, highlighting the volume and impact of the submissions across live targets,” the Xbow team adds.

(Credit: Xbow)

In addition, technology promises to help companies stay ahead of malicious hackers who have also been trying to adopt generative AI. Xbow is designed to be fully autonomous, with the ability to complete “comprehensive penetration tests in just a few hours,” the company says. 

But Xbow is also raising some concerns about whether it’s generating quantity over quality in terms of vulnerability reports. “Receiving hundreds of AI-generated bug reports would be so demoralizing and probably turn me off from maintaining an open source project forever,” wrote one user on the Hacker News forum. “I think developers are going to eventually need tools to filter out slop.” 

Brendan Dolan-Gavitt, an Xbow AI researcher, responded to the skepticism and criticism, writing: “The main difference is that all of the vulnerabilities reported here are real, many quite critical.” Others also point out the submissions from human security researchers on HackerOne can also be of low-quality. 

HackerOne also chimed in on Xbow's development: "AI is a force multiplier for crowdsourced security,” said Michiel Prins, HackerOne's cofounder, in a statement. “Hackbot companies like Xbow are bringing impressive innovation to the space, accelerating how we discover and respond to vulnerabilities. But AI doesn’t learn to hack on its own—hackers train it. Human researchers remain essential partners in this feedback loop, and while AI leads in volume, we still see humans delivering the findings with the greatest business impact. Hackbots are simply the next step in an evolution driven by human ingenuity harnessing automation.”

Xbow released the results as it's trying to sell its technology to customers. Bloomberg reports that Xbox recently raised $75 million through a new funding round.

About Our Expert

Michael Kan

Michael Kan

Principal Reporter

My Experience

I've been a journalist for over 15 years. I got my start as a schools and cities reporter in Kansas City and joined PCMag in 2017, where I cover satellite internet services, cybersecurity, PC hardware, and more. I'm currently based in San Francisco, but previously spent over five years in China, covering the country's technology sector.

Since 2020, I've covered the launch and explosive growth of SpaceX's Starlink satellite internet service, writing 600+ stories on availability and feature launches, but also the regulatory battles over the expansion of satellite constellations, fights with rival providers like AST SpaceMobile and Amazon, and the effort to expand into satellite-based mobile service. I've combed through FCC filings for the latest news and driven to remote corners of California to test Starlink's cellular service.

I also cover cyber threats, from ransomware gangs to the emergence of AI-based malware. In 2024 and 2025, the FTC forced Avast to pay consumers $16.5 million for secretly harvesting and selling their personal information to third-party clients, as revealed in my joint investigation with Motherboard.

I also cover the PC graphics card market. Pandemic-era shortages led me to camp out in front of a Best Buy to get an RTX 3000. I'm now following how the AI-driven memory shortage is impacting the entire consumer electronics market. I'm always eager to learn more, so please jump in the comments with feedback and send me tips.

The Best Tech I've Had:

  • My first video game console: a Nintendo Famicom
  • I loved my Sega Saturn despite PlayStation's popularity.
  • The iPod Video I received as a gift in college
  • Xbox 360 FTW
  • The Galaxy Nexus was the first smartphone I was proud to own.
  • The PC desktop I built in 2013, which still works to this day.

Read full bio