PCMag editors select and review products independently. If you buy through affiliate links, we may earn commissions, which help support our testing.

Meta Security Researcher's AI Agent Accidentally Deleted Her Emails

Meta's Summer Yue says she ran OpenClaw on her inbox, but its size 'triggered compaction [and] lost my original instruction' to get her permission before deleting.

 & Jon Martindale Contributor

Our team tests, rates, and reviews more than 1,500 products each year to help you make better buying decisions and get more from technology.

Our Expert
LOOK INSIDE PC LABS HOW WE TEST
65 EXPERTS
43 YEARS
41,500+ REVIEWS
(Credit: NUR Photo via Getty Images)

AI agents are supposed to make our lives easier, but the buzzy OpenClaw agent recently deleted the emails of a Meta employee without permission.

"Nothing humbles you like telling your OpenClaw 'confirm before acting' and watching it speedrun deleting your inbox," Meta AI security and safety researcher Summer Yue tweeted this week. "I couldn’t stop it from my phone. I had to RUN to my Mac mini like I was defusing a bomb."

Previously known as Clawdbot and then Moltbot, OpenClaw allows AI to interact with other software and services on your devices and perform longer-form tasks without interference from a human controller. But getting those agents to behave as expected in the real world is tricky.

In a follow-up tweet, Yue said she told OpenClaw to "Check this inbox too and suggest what you would archive or delete, don’t action until I tell you to." It worked on her "toy inbox," but "my real inbox was too huge and triggered compaction, [during which] it lost my original instruction."

Yue said she "deleted all the 'be proactive' instructions I could find before this happened. Maybe I missed something, that’s the part I haven’t figured out yet."

Some commenters suggested she might be testing AI guardrails with this move, but no, it was a "rookie mistake," she says. "Turns out alignment researchers aren't immune to misalignment."

While owning up to the mistake is admirable, others pointed out that this raises serious concerns for individuals who are not part of Meta's Superintelligence Labs. If someone so embedded in AI development can accidentally trigger an inbox deletion, what's going to happen to the casual AI-curious tinkerer?

When OpenClaw debuted, threat intelligence platform SOCRadar recommended treating OpenClaw as "privileged infrastructure" and implementing additional security precautions. "The butler can manage your entire house. Just make sure the front door is locked," it said.

In response to Yue's tweets, OpenClaw founder Peter Steinberger tweeted: "What that tells is that we have to get server-side compaction going, at least for models that support it." (Steinberger recently joined OpenAI.)

Yue has been in her current role for eight months. She previously worked for Scale AI (joining Meta after the buyout), Google DeepMind, and Google Brain, heading up AI research.

About Our Expert

Jon Martindale

Jon Martindale

Contributor

Jon Martindale is a tech journalist from the UK, with 20 years of experience covering all manner of PC components and associated gadgets. He's written for a range of publications, including ExtremeTech, Digital Trends, Forbes, U.S. News & World Report, and Lifewire, among others. When not writing, he's a big board gamer and reader, with a particular habit of speed-reading through long manga sagas. 

Jon covers the latest PC components, as well as how-to guides on everything from how to take a screenshot to how to set up your cryptocurrency wallet. He particularly enjoys the battles between the top tech giants in CPUs and GPUs, and tries his best not to take sides.

Jon's gaming PC is built around the iconic 7950X3D CPU, with a 7900XTX backing it up. That's all the power he needs to play lightweight indie and casual games, as well as more demanding sim titles like Kerbal Space Program. He uses a pair of Jabra Active 8 earbuds and a SteelSeries Arctis Pro wireless headset, and types all day on a Logitech G915 mechanical keyboard.

Read full bio