Anthropic will nuke your attempt to use AI to build a nuke

Anthropic has developed an AI-powered tool that detects and blocks attempts to ask AI chatbots for nuclear weapons design
The company worked with the U.S. Department of Energy to ensure the AI could identify such attempts
Anthropic claims it spots dangerous nuclear-related prompts with 96% accuracy and has already proven effective on Claude

If you’re the type of person who asks Claude how to make a sandwich, you’re fine. If you’re the type of person who asks the AI chatbot how to build a nuclear bomb, you’ll not only fail to get any blueprints, you might also face some pointed questions of your own. That’s thanks to Anthropic’s newly deployed detector of problematic nuclear prompts.

Like other systems for spotting queries Claude shouldn’t respond to, the new classifier scans user conversations, in this case flagging any that veer into “how to build a nuclear weapon” territory. Anthropic built the classification feature in a partnership with the U.S. Department of Energy’s National Nuclear Security Administration (NNSA), giving it all the information it needs to determine whether someone is just asking about how such bombs work or if they’re looking for blueprints. It’s performed with 96% accuracy in tests.

What's Hot

Meta will sell you refurbished Ray-Ban smart glasses for $76 off – how to find them

Garmin Fenix 8 Pro rumors swirl, and new leaks point to 4 new subscription tiers – mere months after the Connect+ debacle

SSA Whistleblower’s Resignation Email Mysteriously Disappeared From Inboxes

Meta will sell you refurbished Ray-Ban smart glasses for $76 off – how to find them

Garmin Fenix 8 Pro rumors swirl, and new leaks point to 4 new subscription tiers – mere months after the Connect+ debacle

SSA Whistleblower’s Resignation Email Mysteriously Disappeared From Inboxes

The fight against labeling long-term streaming rentals as “purchases” you “buy”

Libby is adding an AI book recommendation feature

TikTok now lets users send voice notes and images in DMs

8BitDo Pro 3 review: better specs, more customization, minor faults

WIRED Roundup: ChatGPT Goes Full Demon Mode

Framework Desktop Review: A Delightful Surprise

Most Popular

8BitDo Pro 3 review: better specs, more customization, minor faults

WIRED Roundup: ChatGPT Goes Full Demon Mode

Framework Desktop Review: A Delightful Surprise

Our Picks

Meta will sell you refurbished Ray-Ban smart glasses for $76 off – how to find them

Garmin Fenix 8 Pro rumors swirl, and new leaks point to 4 new subscription tiers – mere months after the Connect+ debacle

SSA Whistleblower’s Resignation Email Mysteriously Disappeared From Inboxes

Subscribe to Updates

What's Hot

Anthropic will nuke your attempt to use AI to build a nuke

Related Posts

Subscribe to Updates