(Credit: NUR Photo via Getty Images)
AI agents are supposed to make our lives easier, but the buzzy OpenClaw agent recently deleted the emails of a Meta employee without permission.
"Nothing humbles you like telling your OpenClaw 'confirm before acting' and watching it speedrun deleting your inbox," Meta AI security and safety researcher Summer Yue tweeted this week. "I couldn’t stop it from my phone. I had to RUN to my Mac mini like I was defusing a bomb."
Previously known as Clawdbot and then Moltbot, OpenClaw allows AI to interact with other software and services on your devices and perform longer-form tasks without interference from a human controller. But getting those agents to behave as expected in the real world is tricky.
In a follow-up tweet, Yue said she told OpenClaw to "Check this inbox too and suggest what you would archive or delete, don’t action until I tell you to." It worked on her "toy inbox," but "my real inbox was too huge and triggered compaction, [during which] it lost my original instruction."
Yue said she "deleted all the 'be proactive' instructions I could find before this happened. Maybe I missed something, that’s the part I haven’t figured out yet."
Some commenters suggested she might be testing AI guardrails with this move, but no, it was a "rookie mistake," she says. "Turns out alignment researchers aren't immune to misalignment."
While owning up to the mistake is admirable, others pointed out that this raises serious concerns for individuals who are not part of Meta's Superintelligence Labs. If someone so embedded in AI development can accidentally trigger an inbox deletion, what's going to happen to the casual AI-curious tinkerer?
When OpenClaw debuted, threat intelligence platform SOCRadar recommended treating OpenClaw as "privileged infrastructure" and implementing additional security precautions. "The butler can manage your entire house. Just make sure the front door is locked," it said.
In response to Yue's tweets, OpenClaw founder Peter Steinberger tweeted: "What that tells is that we have to get server-side compaction going, at least for models that support it." (Steinberger recently joined OpenAI.)
Yue has been in her current role for eight months. She previously worked for Scale AI (joining Meta after the buyout), Google DeepMind, and Google Brain, heading up AI research.


