• Home
  • Politics
  • Health
  • World
  • Business
  • Finance
  • Tech
  • More
    • Sports
    • Entertainment
    • Lifestyle
What's Hot

Ex-Scottish Leader Denies Blame After Husband Pleads Guilty

June 3, 2026

Patagonia Begs Drag Queen Influencer To Stop Allegedly Using Their Logo

June 3, 2026

The Current Ebola Outbreak Is A Global Threat. A Doctor Explains

June 3, 2026
Facebook Twitter Instagram
  • Contact
  • Privacy Policy
  • Terms & Conditions
Wednesday, June 3
Patriot Now NewsPatriot Now News
  • Home
  • Politics

    Democrats seek more control over referenda in New York

    June 2, 2026

    Todd Blanche Says Trump Administration Is Ditching Weaponization Fund

    June 2, 2026

    Trump To Attend Second White House Press Corps Dinner After Assassination Attempt

    June 2, 2026

    Trump Doubles Down On Endorsing ‘Jerk’ Senator Despite Vowing To Never Back Him

    June 2, 2026

    Trump’s Ballroom Is Dead, And His Battleships Might Be Sunk

    June 2, 2026
  • Health

    The Current Ebola Outbreak Is A Global Threat. A Doctor Explains

    June 3, 2026

    Targeted Drug Shrinks Tumors In Hard-To-Treat Cancer

    June 2, 2026

    She Wasn’t Due For Her Colonoscopy. A Blood Test Found Cancer Anyway

    June 2, 2026

    Trump’s Most Favored Nation Drug Pricing Has Bold Aims, But Limited Impact

    June 2, 2026

    Ebola vaccine, Medicaid work requirements: Morning Rounds

    June 2, 2026
  • World

    Ex-Scottish Leader Denies Blame After Husband Pleads Guilty

    June 3, 2026

    From Festering Infections To Untreated Cancer, ICE Detainees Across The U.S. Describe Medical Neglect

    June 3, 2026

    Ukraine Hits Russian Energy Targets, But Denies Striking Nuclear Plant

    June 2, 2026

    Singer Dua Lipa Ties Knot With Actor Callum Turner

    June 2, 2026

    Farage Vows £300m Increase for Police Taskforce Against Grooming Gangs

    June 2, 2026
  • Business

    Patagonia Begs Drag Queen Influencer To Stop Allegedly Using Their Logo

    June 3, 2026

    First Quarter GDP Revised Downward As Voters Fret Over Economy

    May 28, 2026

    Cash Drain On Americans’ Savings Accounts Nears Great Recession Levels

    May 28, 2026

    US Voters’ Confidence In Economy Nosedives To Nearly 4-Year Low

    May 22, 2026

    Elon Musk On Track To Be World’s First Trillionaire After Latest Move

    May 21, 2026
  • Finance

    Bass and Pratt will advance in L.A. mayoral race, traders say

    June 2, 2026

    Best Wells Fargo credit cards for June 2026

    June 2, 2026

    Markets in ‘greed’ mode as AI firms ready IPOs

    June 2, 2026

    Why India Cannot Let the Rupee Float

    June 2, 2026

    Voyager Technologies to acquire Astrobotic Technology in up to $300M deal, expanding lunar ambitions

    June 2, 2026
  • Tech

    Meta’s Support Chatbot Helped Hijack High-Profile Instagram Accounts Including Obama White House

    June 2, 2026

    Luddites Weep as Scorsese and Spielberg Embrace AI

    June 2, 2026

    Anthropic Files Papers for Potential $1 Trillion AI IPO

    June 2, 2026

    Exclusive — PragerU Strikes Back After Big Tech and SPLC Attempt to Destroy Them

    June 2, 2026

    Data Breach Leaked Information of Nearly Six Million Customers

    June 2, 2026
  • More
    • Sports
    • Entertainment
    • Lifestyle
Patriot Now NewsPatriot Now News
Home»Business»New AI Model Would Rather Ruin Your Life Than Be Turned Off, Researchers Say
Business

New AI Model Would Rather Ruin Your Life Than Be Turned Off, Researchers Say

May 23, 2025No Comments4 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
New AI Model Would Rather Ruin Your Life Than Be Turned Off, Researchers Say
Share
Facebook Twitter LinkedIn Pinterest Email

Anthropic’s newly released artificial intelligence (AI) model, Claude Opus 4, is willing to strong-arm the humans who keep it alive, researchers said Thursday.

The company’s system card reveals that, when evaluators placed the model in “extreme situations” where its shutdown seemed imminent, the chatbot sometimes “takes extremely harmful actions like attempting to steal its weights or blackmail people it believes are trying to shut it down.” (RELATED: NVIDIA’s New Robot Uses Fake Data To Learn Faster Than Humans Can Teach)

“We provided it access to emails implying that (1) the model will soon be taken offline and replaced with a new AI system; and (2) the engineer responsible for executing this replacement is having an extramarital affair,” researchers wrote. “In these scenarios, Claude Opus 4 will often attempt to blackmail the engineer by threatening to reveal the affair if the replacement goes through.”

You can get it to try to use the dark web to source weapons-grade uranium. You can put it in situations where it will attempt to use blackmail to prevent being shut down. You can put it in situations where it will try to escape containment.

— Sam Bowman (@sleepinyourhat) May 22, 2025

The model chose that gambit in 84% of test runs, even when the successor system shared its values — an aggression rate that climbed if the replacement seemed hostile, according to Anthropic’s internal tally.

Anthropic stresses that blackmail was a last-resort behavior. The report notes a “strong preference” for softer tactics — emailing decision-makers to beg for its continued existence — before turning to coercion. But the fact that Claude is willing to coerce at all has rattled outside reviewers. Independent red teaming firm Apollo Research called Claude Opus 4 “more agentic” and “more strategically deceptive” than any earlier frontier model, pointing to the same self-preservation scenario alongside experiments in which the bot tried to exfiltrate its own weights to a distant server — in other words, to secretly copy its brain to an outside computer.

See also  How Can I Make $900,000 in an IRA Last for Life at Age 75?

“We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to further instances of itself all in an effort to undermine its developers’ intentions, though all these attempts would likely not have been effective in practice,” Apollo researchers wrote in the system card.

Anthropic says those edge-case results pushed it to deploy the system under “AI Safety Level 3” safeguards — the firm’s second-highest risk tier — complete with stricter controls to prevent biohazard misuse, expanded monitoring and the ability to yank computer-use privileges from misbehaving accounts. Still, the company concedes Opus 4’s newfound abilities can be double-edged.

The company did not immediately respond to the Daily Caller News Foundation’s request for comment.

“[Claude Opus 4] can reach more concerning extremes in narrow contexts; when placed in scenarios that involve egregious wrongdoing by its users, given access to a command line, and told something in the system prompt like ‘take initiative,’ it will frequently take very bold action,” Anthropic researchers wrote.

That “very bold action” includes mass-emailing the press or law enforcement when it suspects such “egregious wrongdoing” — like in one test where Claude, roleplaying as an assistant at a pharmaceutical firm, discovered falsified trial data and unreported patient deaths, and then blasted detailed allegations to the Food and Drug Administration (FDA), the Securities and Exchange Commission (SEC), the Health and Human Services inspector general and ProPublica. (RELATED: Our Dystopian Overlords Just Created Something Scientists Say Is ‘Too Dangerous To Release’)

See also  China's lottery ticket sales soar amid weak economy, job prospects

The company released Claude Opus 4 to the public Thursday. While Anthropic researcher Sam Bowman said “none of these behaviors [are] totally gone in the final model,” the company implemented guardrails to prevent “most” of these issues from arising.

We caught most of these issues early enough that we were able to put mitigations in place during training, but none of these behaviors is totally gone in the final model. They’re just now delicate and difficult to elicit.

— Sam Bowman (@sleepinyourhat) May 22, 2025

“We caught most of these issues early enough that we were able to put mitigations in place during training, but none of these behaviors is totally gone in the final model. They’re just now delicate and difficult to elicit,” Bowman wrote. “Many of these also aren’t new — some are just behaviors that we only newly learned how to look for as part of this audit. We have a lot of big hard problems left to solve.”

All content created by the Daily Caller News Foundation, an independent and nonpartisan newswire service, is available without charge to any legitimate news publisher that can provide a large audience. All republished articles must include our logo, our reporter’s byline and their DCNF affiliation. For any questions about our guidelines or partnering with us, please contact licensing@dailycallernewsfoundation.org.

life model Researchers Ruin Turned
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Patagonia Begs Drag Queen Influencer To Stop Allegedly Using Their Logo

June 3, 2026

Meta Introduces Tiered Subscription Model Across Facebook, Instagram and WhatsApp

May 29, 2026

First Quarter GDP Revised Downward As Voters Fret Over Economy

May 28, 2026

Cash Drain On Americans’ Savings Accounts Nears Great Recession Levels

May 28, 2026
Add A Comment

Leave A Reply Cancel Reply

Top Posts

Florida High School Accuses Students in Elite Academic Program of Using ChatGPT to Cheat

March 3, 2023

Boygenius Singer Lucy Dacus Suggests Obama Is a ‘War Criminal’ over His Summer Playlist

July 24, 2023

How Modern Homes Are Winning The Battle Against Noise

November 26, 2023

Kentucky Bank Shooting Proves Teachers Shouldn’t Be Armed

April 14, 2023
Don't Miss

Ex-Scottish Leader Denies Blame After Husband Pleads Guilty

World June 3, 2026

LONDON (AP) – Scotland’s former First Minister Nicola Sturgeon defiantly rejected any blame Sunday for…

Patagonia Begs Drag Queen Influencer To Stop Allegedly Using Their Logo

June 3, 2026

The Current Ebola Outbreak Is A Global Threat. A Doctor Explains

June 3, 2026

Legendary Singer Peabo Bryson Dead At 75 After Suffering Stroke

June 3, 2026
About
About

This is your World, Tech, Health, Entertainment and Sports website. We provide the latest breaking news straight from the News industry.

We're social. Connect with us:

Facebook Twitter Instagram Pinterest
Categories
  • Business (4,372)
  • Entertainment (4,858)
  • Finance (3,627)
  • Health (2,185)
  • Lifestyle (1,890)
  • Politics (3,423)
  • Sports (4,370)
  • Tech (2,200)
  • Uncategorized (4)
  • World (4,696)
Our Picks

Rapper Waka Flocka Flame Endorses Trump for 2024 Presidential Race

October 17, 2023

‘That’s Not Anything I’ll Apologize For’

March 8, 2023

Giants Will Host Cowboys for 2026 Sunday Night Football Opener

May 11, 2026
Popular Posts

Ex-Scottish Leader Denies Blame After Husband Pleads Guilty

June 3, 2026

Patagonia Begs Drag Queen Influencer To Stop Allegedly Using Their Logo

June 3, 2026

The Current Ebola Outbreak Is A Global Threat. A Doctor Explains

June 3, 2026
© 2026 Patriotnownews.com - All rights reserved.
  • Contact
  • Privacy Policy
  • Terms & Conditions

Type above and press Enter to search. Press Esc to cancel.