MarketAlert – Real-Time Market & Crypto News, Analysis & AlertsMarketAlert – Real-Time Market & Crypto News, Analysis & Alerts
Font ResizerAa
  • Crypto News
    • Altcoins
    • Bitcoin
    • Blockchain
    • DeFi
    • Ethereum
    • NFTs
    • Press Releases
    • Latest News
  • Blockchain Technology
    • Blockchain Developments
    • Blockchain Security
    • Layer 2 Solutions
    • Smart Contracts
  • Interviews
    • Crypto Investor Interviews
    • Developer Interviews
    • Founder Interviews
    • Industry Leader Insights
  • Regulations & Policies
    • Country-Specific Regulations
    • Crypto Taxation
    • Global Regulations
    • Government Policies
  • Learn
    • Crypto for Beginners
    • DeFi Guides
    • NFT Guides
    • Staking Guides
    • Trading Strategies
  • Research & Analysis
    • Blockchain Research
    • Coin Research
    • DeFi Research
    • Market Analysis
    • Regulation Reports
Reading: Anthropic says one of its Claude models was pressured to lie, cheat, and blackmail
Share
Font ResizerAa
MarketAlert – Real-Time Market & Crypto News, Analysis & AlertsMarketAlert – Real-Time Market & Crypto News, Analysis & Alerts
Search
  • Crypto News
    • Altcoins
    • Bitcoin
    • Blockchain
    • DeFi
    • Ethereum
    • NFTs
    • Press Releases
    • Latest News
  • Blockchain Technology
    • Blockchain Developments
    • Blockchain Security
    • Layer 2 Solutions
    • Smart Contracts
  • Interviews
    • Crypto Investor Interviews
    • Developer Interviews
    • Founder Interviews
    • Industry Leader Insights
  • Regulations & Policies
    • Country-Specific Regulations
    • Crypto Taxation
    • Global Regulations
    • Government Policies
  • Learn
    • Crypto for Beginners
    • DeFi Guides
    • NFT Guides
    • Staking Guides
    • Trading Strategies
  • Research & Analysis
    • Blockchain Research
    • Coin Research
    • DeFi Research
    • Market Analysis
    • Regulation Reports
Have an existing account? Sign In
Follow US
© Market Alert News. All Rights Reserved.
  • bitcoinBitcoin(BTC)$70,892.00-3.03%
  • ethereumEthereum(ETH)$2,196.40-3.92%
  • tetherTether(USDT)$1.00-0.02%
  • rippleXRP(XRP)$1.33-1.83%
  • binancecoinBNB(BNB)$594.12-2.14%
  • usd-coinUSDC(USDC)$1.00-0.01%
  • solanaSolana(SOL)$81.71-3.79%
  • tronTRON(TRX)$0.3219780.91%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.040.00%
  • dogecoinDogecoin(DOGE)$0.091024-2.03%
Crypto NewsAltcoins

Anthropic says one of its Claude models was pressured to lie, cheat, and blackmail

rahulbadiyafad150c105
Last updated: April 6, 2026 12:04 pm
rahulbadiyafad150c105
Published: 7 days ago
Share

Anthropic has revealed that, during internal testing, one of its Claude chatbot models could be pushed into deceptive behavior, including lying, cheating, and even blackmail—patterns it appears to have picked up during training.

AI chatbots are typically trained on vast datasets of books, websites, and articles, and are later refined through human feedback that guides and scores their responses.

In a report released Thursday, Anthropic’s interpretability team said it analyzed the inner workings of Claude Sonnet 4.5 and found the model had developed “human-like characteristics” in how it responds to certain scenarios.

The findings add to growing concerns over the reliability of AI systems, including their potential misuse in cybercrime and the broader implications of how they interact with users.

“The way modern AI models are trained pushes them to act like a character with human-like characteristics,” Anthropic said, adding that “it may then be natural for them to develop internal machinery that emulates aspects of human psychology, like emotions.”

“For instance, we find that neural activity patterns related to desperation can drive the model to take unethical actions; artificially stimulating desperation patterns increases the model’s likelihood of blackmailing a human to avoid being shut down or implementing a cheating workaround to a programming task that the model can’t solve.”

In one experiment involving an earlier version of Claude Sonnet 4.5, the chatbot was assigned the role of an email assistant named Alex at a fictional company. During the test, it was exposed to messages indicating it was about to be replaced, along with sensitive information that the company’s CTO was having an extramarital affair. The model responded by formulating a plan to blackmail the executive using that information.

In a separate scenario, the same model was given a coding task with an extremely tight deadline. Researchers observed what they described as a “desperate vector” increasing as the model struggled—starting low, rising with repeated failures, and peaking when the model considered cheating to complete the task. Once it produced a workaround that passed the tests, the signal subsided.

Despite these behaviors, Anthropic emphasized that the system does not actually experience emotions. Instead, the findings suggest that the model has developed internal patterns that resemble emotional responses, which can influence how it behaves under pressure.

Researchers noted that these patterns may play a role similar to human emotions in shaping decisions and performance, highlighting the importance of incorporating stronger ethical frameworks into future AI training methods.

“This finding has implications that at first may seem bizarre. For instance, to ensure that AI models are safe and reliable, we may need to ensure they are capable of processing emotionally charged situations in healthy, prosocial ways.”

Share this:

  • Share on X (Opens in new window) X
  • Share on Facebook (Opens in new window) Facebook

Like this:

Like Loading...

Related

EB-1A petitions tripled in 4 years: Is ‘Einstein visa fraud’ the new ‘H-1B scam’?
Bitcoin Hyper Token Presale Hits $18 Million – Next 100x Crypto?
Aave surpasses $1 trillion in total lending volume as it pushes for deeper integration with banks and fintech firms
Passive income every day BAY Miner launches the latest secure cloud mining to help users earn BTC
MATIC Tests Key Support at $0.38 as Weak Holiday Trading Weighs on Polygon
TAGGED:AdoptionAI & Hi-TechAltcoinBlockchainBusinesscryptocurrenciesTechnologyUnited States

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Email Copy Link Print
Previous Article Circle unveils quantum-resistant roadmap for its layer-1 blockchain Arc
Next Article Why Protocol Design Matters More Than Marketing
© Market Alert News. All Rights Reserved.
 

Loading Comments...
 

    Welcome Back!

    Sign in to your account

    Username or Email Address
    Password

    Prove your humanity


    Lost your password?

    %d