Close Menu
GeekBlog

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    DuckDuckGo Installs Surge as Users Flee Google’s AI Search Overhaul

    June 17, 2026

    Android 17 Is Here: Everything New in Google’s June 2026 Pixel Drop

    June 17, 2026

    Google’s Gemini Spark Is Making Standalone Apps Obsolete

    June 16, 2026
    Facebook X (Twitter) Instagram Threads
    GeekBlog
    • Home
    • Mobile
    • Tech News
    • Blog
    • How-To Guides
    • AI & Software
    Facebook
    GeekBlog
    Home»Tech News»A Meta AI security researcher said an OpenClaw agent ran amok on her inbox 
    Tech News

    A Meta AI security researcher said an OpenClaw agent ran amok on her inbox 

    Michael ComaousBy Michael ComaousFebruary 24, 20264 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
    Y Combinator crew dressed like crabs
    Share
    Facebook Twitter LinkedIn Pinterest Email Copy Link

    The now-viral X post from Meta AI security researcher Summer Yue reads, at first, like satire. She told her OpenClaw AI agent to check her overstuffed email inbox and suggest what to delete or archive.  

    The agent proceeded to run amok. It started deleting all her email in a “speed run” while ignoring her commands from her phone telling it to stop. 

    “I had to RUN to my Mac mini like I was defusing a bomb,” she wrote, posting images of the ignored stop prompts as receipts.  

    The Mac Mini, an affordable Apple computer that sits flat on a desk and fits in the palm of your hand, has become the favored device these days for running OpenClaw. (The Mini is selling “like hotcakes,” one “confused” Apple employee apparently told famed AI researcher Andrej Karpathy when he bought one to run an OpenClaw alternative called NanoClaw.) 

    OpenClaw is, of course, the open source AI agent that achieved fame through Moltbook, an AI-only social network. OpenClaw agents were at the center of that now largely debunked episode on Moltbook in which it looked like the AIs were plotting against humans.  

    But OpenClaw’s mission, according to its GitHub page, is not focused on social networks. It aims to be a personal AI assistant that runs on your own devices.  

    The Silicon Valley in-crowd has fallen so in love with OpenClaw that “claw” and “claws” have become the buzzwords of choice for agents that run on personal hardware. Other such agents include ZeroClaw, IronClaw, and PicoClaw. Y Combinator’s podcast team even appeared on their most recent episode dressed in lobster costumes. 

    Techcrunch event

    Boston, MA
    |
    June 9, 2026

    But Yue’s post serves as a warning. As others on X noted, if an AI security researcher could run into this problem, what hope do mere mortals have? 

    “Were you intentionally testing its guardrails or did you make a rookie mistake?” a software developer asked her on X.  

    “Rookie mistake tbh,” she replied. She had been testing her agent with a smaller “toy” inbox, as she called it, and it had been running well on less important email. It had earned her trust, so she thought she’d let it loose on the real thing. 

    Yue believes that the large amount of data in her real inbox “triggered compaction,” she wrote. Compaction happens when the context window — the running record of everything the AI has been told and has done in a session — grows too large, causing the agent to begin summarizing, compressing, and managing the conversation.  

    At that point, the AI may skip over instructions that the human considers quite important.  

    In this case, it may have skipped her last prompt — where she told it not to act — and reverted back to its instructions from the “toy” inbox. 

    As several others on X pointed out, prompts can’t be trusted to act as security guardrails. Models may misconstrue or ignore them. 

    Various people offered suggestions that ranged from the exact syntax Yue should have used to stop the agent, to various methods to ensure better adherence to guardrails, like writing instructions to dedicated files or using other open source tools. 

    In the interest of full transparency, TechCrunch could not independently verify what happened to Yue’s inbox. (She didn’t respond to our request for comment, though she did respond to many questions and comments sent her way on X.) 

    But it doesn’t really matter. 

    The point of the tale is that agents aimed at knowledge workers, at their current stage of development, are risky. People who say they are using them successfully are cobbling together methods to protect themselves.

    One day, perhaps soon (by 2027? 2028?), they may be ready for widespread use. Goodness knows many of us would love help with email, grocery orders, and scheduling dentist appointments. But that day has not yet come. 

    Source: techcrunch.com

    Meta Security
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email Copy Link
    Previous ArticleBillions of dollars later and still nobody knows what an Xbox is
    Next Article Data center builders thought farmers would willingly sell land, learn otherwise
    Michael Comaous
    • Website

    Michael Comaous is a dedicated professional with a passion for technology, innovation, and creative problem-solving. Over the years, he has built experience across multiple industries, combining strategic thinking with hands-on expertise to deliver meaningful results. Michael is known for his curiosity, attention to detail, and ability to explain complex topics in a clear and approachable way. Whether he’s working on new projects, writing, or collaborating with others, he brings energy and a forward-thinking mindset to everything he does.

    Related Posts

    6 Mins Read

    DuckDuckGo Installs Surge as Users Flee Google’s AI Search Overhaul

    7 Mins Read

    Google’s Gemini Spark Is Making Standalone Apps Obsolete

    6 Mins Read

    How Modular Data Centers Are Solving AI’s Biggest Infrastructure Problem

    6 Mins Read

    Google I/O 2026: Inside Google’s Most Aggressive AI Push Yet

    5 Mins Read

    AI Glasses Just Had Their Biggest Year Ever – And 2026 Could Top It

    3 Mins Read

    Stop falling for scams when Norton’s antivirus software is 70% off right now

    Top Posts

    The Mesh Router Placement Strategy That Finally Gave Me Full Home Coverage

    August 4, 20251,109 Views

    Discord will require a face scan or ID for full access next month

    February 9, 2026769 Views

    Best Stores for Buying MP3 and Digital Music You Can Keep Forever

    August 2, 2025558 Views
    Stay In Touch
    • Facebook

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    The Mesh Router Placement Strategy That Finally Gave Me Full Home Coverage

    August 4, 20251,109 Views

    Discord will require a face scan or ID for full access next month

    February 9, 2026769 Views

    Best Stores for Buying MP3 and Digital Music You Can Keep Forever

    August 2, 2025558 Views
    Our Picks

    DuckDuckGo Installs Surge as Users Flee Google’s AI Search Overhaul

    June 17, 2026

    Android 17 Is Here: Everything New in Google’s June 2026 Pixel Drop

    June 17, 2026

    Google’s Gemini Spark Is Making Standalone Apps Obsolete

    June 16, 2026

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook
    • About Us
    • Contact us
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    © 2026 GeekBlog

    Type above and press Enter to search. Press Esc to cancel.

    Ad Blocker Enabled!
    Ad Blocker Enabled!
    Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.