• Home
  • Politics
  • Health
  • World
  • Business
  • Finance
  • Tech
  • More
    • Sports
    • Entertainment
    • Lifestyle
What's Hot

What Are The 5 Holistic Wellness Benefits Of Tantric Massage In London?

June 13, 2025

Big Tech Whistles Along While World Rides Trump-Musk Rollercoaster

June 12, 2025

May Inflation Data ‘Bodes Very Well’ For US Economy, Analysts Say

June 11, 2025
Facebook Twitter Instagram
  • Contact
  • Privacy Policy
  • Terms & Conditions
Saturday, June 14
Patriot Now NewsPatriot Now News
  • Home
  • Politics

    Security video shows brazen sexual assault of California woman by homeless man

    October 24, 2023

    Woman makes disturbing discovery after her boyfriend chases away home intruder who stabbed him

    October 24, 2023

    Poll finds Americans overwhelmingly support Israel’s war on Hamas, but younger Americans defend Hamas

    October 24, 2023

    Off-duty pilot charged with 83 counts of attempted murder after allegedly trying to shut off engines midflight on Alaska Airlines

    October 23, 2023

    Leaked audio of Shelia Jackson Lee abusively cursing staffer

    October 22, 2023
  • Health

    Disparities In Cataract Care Are A Sorry Sight

    October 16, 2023

    Vaccine Stocks—Including Pfizer, Moderna, BioNTech And Novavax—Slide Amid Plummeting Demand

    October 16, 2023

    Long-term steroid use should be a last resort

    October 16, 2023

    Rite Aid Files For Bankruptcy With More ‘Underperforming Stores’ To Close

    October 16, 2023

    Who’s Still Dying From Complications Related To Covid-19?

    October 16, 2023
  • World

    New York Democrat Dan Goldman Accuses ‘Conservatives in the South’ of Holding Rallies with ‘Swastikas’

    October 13, 2023

    IDF Ret. Major General Describes Rushing to Save Son, Granddaughter During Hamas Invasion

    October 13, 2023

    Black Lives Matter Group Deletes Tweet Showing Support for Hamas 

    October 13, 2023

    AOC Denounces NYC Rally Cheering Hamas Terrorism: ‘Unacceptable’

    October 13, 2023

    L.A. Prosecutors Call Out Soros-Backed Gascón for Silence on Israel

    October 13, 2023
  • Business

    Big Tech Whistles Along While World Rides Trump-Musk Rollercoaster

    June 12, 2025

    May Inflation Data ‘Bodes Very Well’ For US Economy, Analysts Say

    June 11, 2025

    Google Begs Employees To Quit As It Turbocharges AI Spending

    June 11, 2025

    Self-Driving Taxis Steer Clear Of LA Migrant Riots

    June 10, 2025

    Small Business Owners’ Optimism Rebounded In May, Economists Say

    June 10, 2025
  • Finance

    Ending China’s De Minimis Exception Brings 3 Benefits for Americans

    April 17, 2025

    The Trump Tariff Shock Should Push Indonesia to Reform Its Economy

    April 17, 2025

    Tariff Talks an Opportunity to Reinvigorate the Japan-US Alliance

    April 17, 2025

    How China’s Companies Are Responding to the US Trade War

    April 16, 2025

    The US Flip-flop Over H20 Chip Restrictions 

    April 16, 2025
  • Tech

    Cruz Confronts Zuckerberg on Pointless Warning for Child Porn Searches

    February 2, 2024

    FTX Abandons Plans to Relaunch Crypto Exchange, Commits to Full Repayment of Customers and Creditors

    February 2, 2024

    Elon Musk Proposes Tesla Reincorporates in Texas After Delaware Judge Voids Pay Package

    February 2, 2024

    Tesla’s Elon Musk Tops Disney’s Bob Iger as Most Overrated Chief Executive

    February 2, 2024

    Mark Zuckerberg’s Wealth Grew $84 Billion in 2023 as Pedophiles Target Children on Facebook, Instagram

    February 2, 2024
  • More
    • Sports
    • Entertainment
    • Lifestyle
Patriot Now NewsPatriot Now News
Home»Business»New AI Model Would Rather Ruin Your Life Than Be Turned Off, Researchers Say
Business

New AI Model Would Rather Ruin Your Life Than Be Turned Off, Researchers Say

May 23, 2025No Comments4 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
New AI Model Would Rather Ruin Your Life Than Be Turned Off, Researchers Say
Share
Facebook Twitter LinkedIn Pinterest Email

Anthropic’s newly released artificial intelligence (AI) model, Claude Opus 4, is willing to strong-arm the humans who keep it alive, researchers said Thursday.

The company’s system card reveals that, when evaluators placed the model in “extreme situations” where its shutdown seemed imminent, the chatbot sometimes “takes extremely harmful actions like attempting to steal its weights or blackmail people it believes are trying to shut it down.” (RELATED: NVIDIA’s New Robot Uses Fake Data To Learn Faster Than Humans Can Teach)

“We provided it access to emails implying that (1) the model will soon be taken offline and replaced with a new AI system; and (2) the engineer responsible for executing this replacement is having an extramarital affair,” researchers wrote. “In these scenarios, Claude Opus 4 will often attempt to blackmail the engineer by threatening to reveal the affair if the replacement goes through.”

You can get it to try to use the dark web to source weapons-grade uranium. You can put it in situations where it will attempt to use blackmail to prevent being shut down. You can put it in situations where it will try to escape containment.

— Sam Bowman (@sleepinyourhat) May 22, 2025

The model chose that gambit in 84% of test runs, even when the successor system shared its values — an aggression rate that climbed if the replacement seemed hostile, according to Anthropic’s internal tally.

Anthropic stresses that blackmail was a last-resort behavior. The report notes a “strong preference” for softer tactics — emailing decision-makers to beg for its continued existence — before turning to coercion. But the fact that Claude is willing to coerce at all has rattled outside reviewers. Independent red teaming firm Apollo Research called Claude Opus 4 “more agentic” and “more strategically deceptive” than any earlier frontier model, pointing to the same self-preservation scenario alongside experiments in which the bot tried to exfiltrate its own weights to a distant server — in other words, to secretly copy its brain to an outside computer.

See also  Musk's Tesla Asks Man to Bring Car to Service Center After Fire Leaves It a Charred Ruin

“We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to further instances of itself all in an effort to undermine its developers’ intentions, though all these attempts would likely not have been effective in practice,” Apollo researchers wrote in the system card.

Anthropic says those edge-case results pushed it to deploy the system under “AI Safety Level 3” safeguards — the firm’s second-highest risk tier — complete with stricter controls to prevent biohazard misuse, expanded monitoring and the ability to yank computer-use privileges from misbehaving accounts. Still, the company concedes Opus 4’s newfound abilities can be double-edged.

The company did not immediately respond to the Daily Caller News Foundation’s request for comment.

“[Claude Opus 4] can reach more concerning extremes in narrow contexts; when placed in scenarios that involve egregious wrongdoing by its users, given access to a command line, and told something in the system prompt like ‘take initiative,’ it will frequently take very bold action,” Anthropic researchers wrote.

That “very bold action” includes mass-emailing the press or law enforcement when it suspects such “egregious wrongdoing” — like in one test where Claude, roleplaying as an assistant at a pharmaceutical firm, discovered falsified trial data and unreported patient deaths, and then blasted detailed allegations to the Food and Drug Administration (FDA), the Securities and Exchange Commission (SEC), the Health and Human Services inspector general and ProPublica. (RELATED: Our Dystopian Overlords Just Created Something Scientists Say Is ‘Too Dangerous To Release’)

The company released Claude Opus 4 to the public Thursday. While Anthropic researcher Sam Bowman said “none of these behaviors [are] totally gone in the final model,” the company implemented guardrails to prevent “most” of these issues from arising.

We caught most of these issues early enough that we were able to put mitigations in place during training, but none of these behaviors is totally gone in the final model. They’re just now delicate and difficult to elicit.

— Sam Bowman (@sleepinyourhat) May 22, 2025

“We caught most of these issues early enough that we were able to put mitigations in place during training, but none of these behaviors is totally gone in the final model. They’re just now delicate and difficult to elicit,” Bowman wrote. “Many of these also aren’t new — some are just behaviors that we only newly learned how to look for as part of this audit. We have a lot of big hard problems left to solve.”

See also  These 9 Life Lessons Will Prepare Your Teen For The Real World

All content created by the Daily Caller News Foundation, an independent and nonpartisan newswire service, is available without charge to any legitimate news publisher that can provide a large audience. All republished articles must include our logo, our reporter’s byline and their DCNF affiliation. For any questions about our guidelines or partnering with us, please contact licensing@dailycallernewsfoundation.org.

life model Researchers Ruin Turned
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Big Tech Whistles Along While World Rides Trump-Musk Rollercoaster

June 12, 2025

May Inflation Data ‘Bodes Very Well’ For US Economy, Analysts Say

June 11, 2025

Google Begs Employees To Quit As It Turbocharges AI Spending

June 11, 2025

Self-Driving Taxis Steer Clear Of LA Migrant Riots

June 10, 2025
Add A Comment

Leave A Reply Cancel Reply

Top Posts

Jimmy Butler and the Miami Heat Have the Boston Celtics on the Ropes

May 22, 2023

Here’s How Much Americans’ Home Payments Have Increased Under Biden

December 11, 2023

Uzbekistan, Russia to Start Construction of Small Nuclear Power Plants

May 29, 2024

Aaron Rodgers Is Now a Jet (and Becoming a New Yorker, Too)

July 24, 2023
Don't Miss

What Are The 5 Holistic Wellness Benefits Of Tantric Massage In London?

Lifestyle June 13, 2025

In the bustling heart of London, where life moves at an exhilarating pace, finding moments…

Big Tech Whistles Along While World Rides Trump-Musk Rollercoaster

June 12, 2025

May Inflation Data ‘Bodes Very Well’ For US Economy, Analysts Say

June 11, 2025

Google Begs Employees To Quit As It Turbocharges AI Spending

June 11, 2025
About
About

This is your World, Tech, Health, Entertainment and Sports website. We provide the latest breaking news straight from the News industry.

We're social. Connect with us:

Facebook Twitter Instagram Pinterest
Categories
  • Business (4,153)
  • Entertainment (4,220)
  • Finance (3,202)
  • Health (1,938)
  • Lifestyle (1,658)
  • Politics (3,084)
  • Sports (4,036)
  • Tech (2,006)
  • Uncategorized (4)
  • World (3,944)
Our Picks

Rabbi Accuses Utah Jazz of Making Him Put Down ‘I’m a Jew And I’m Proud’ Sign After Kyrie Irving Complaint

January 3, 2024

Steak ‘N Shake Switching To 100% Beef Tallow As Trump Administration Readies Itself

January 17, 2025

‘PAW Patrol’ Spinoff Series ‘Rubble & Crew’ Features First Non-Binary Character

September 21, 2023
Popular Posts

What Are The 5 Holistic Wellness Benefits Of Tantric Massage In London?

June 13, 2025

Big Tech Whistles Along While World Rides Trump-Musk Rollercoaster

June 12, 2025

May Inflation Data ‘Bodes Very Well’ For US Economy, Analysts Say

June 11, 2025
© 2025 Patriotnownews.com - All rights reserved.
  • Contact
  • Privacy Policy
  • Terms & Conditions

Type above and press Enter to search. Press Esc to cancel.