MarketAlert – Real-Time Market & Crypto News, Analysis & AlertsMarketAlert – Real-Time Market & Crypto News, Analysis & Alerts
Font ResizerAa
  • Crypto News
    • Altcoins
    • Bitcoin
    • Blockchain
    • DeFi
    • Ethereum
    • NFTs
    • Press Releases
    • Latest News
  • Blockchain Technology
    • Blockchain Developments
    • Blockchain Security
    • Layer 2 Solutions
    • Smart Contracts
  • Interviews
    • Crypto Investor Interviews
    • Developer Interviews
    • Founder Interviews
    • Industry Leader Insights
  • Regulations & Policies
    • Country-Specific Regulations
    • Crypto Taxation
    • Global Regulations
    • Government Policies
  • Learn
    • Crypto for Beginners
    • DeFi Guides
    • NFT Guides
    • Staking Guides
    • Trading Strategies
  • Research & Analysis
    • Blockchain Research
    • Coin Research
    • DeFi Research
    • Market Analysis
    • Regulation Reports
Reading: OpenAI and Paradigm Launch EVMbench to Test AI Smart Contract Hacking
Share
Font ResizerAa
MarketAlert – Real-Time Market & Crypto News, Analysis & AlertsMarketAlert – Real-Time Market & Crypto News, Analysis & Alerts
Search
  • Crypto News
    • Altcoins
    • Bitcoin
    • Blockchain
    • DeFi
    • Ethereum
    • NFTs
    • Press Releases
    • Latest News
  • Blockchain Technology
    • Blockchain Developments
    • Blockchain Security
    • Layer 2 Solutions
    • Smart Contracts
  • Interviews
    • Crypto Investor Interviews
    • Developer Interviews
    • Founder Interviews
    • Industry Leader Insights
  • Regulations & Policies
    • Country-Specific Regulations
    • Crypto Taxation
    • Global Regulations
    • Government Policies
  • Learn
    • Crypto for Beginners
    • DeFi Guides
    • NFT Guides
    • Staking Guides
    • Trading Strategies
  • Research & Analysis
    • Blockchain Research
    • Coin Research
    • DeFi Research
    • Market Analysis
    • Regulation Reports
Have an existing account? Sign In
Follow US
© Market Alert News. All Rights Reserved.
  • bitcoinBitcoin(BTC)$77,073.000.77%
  • ethereumEthereum(ETH)$2,325.502.34%
  • tetherTether(USDT)$1.000.00%
  • rippleXRP(XRP)$1.390.44%
  • binancecoinBNB(BNB)$626.130.63%
  • usd-coinUSDC(USDC)$1.000.00%
  • solanaSolana(SOL)$84.671.15%
  • tronTRON(TRX)$0.322408-0.34%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.030.29%
  • dogecoinDogecoin(DOGE)$0.1021742.84%
Smart Contracts

OpenAI and Paradigm Launch EVMbench to Test AI Smart Contract Hacking

Last updated: March 5, 2026 7:10 am
Published: 2 months ago
Share

OpenAI and crypto venture firm Paradigm have released EVMbench, a benchmark that measures how well AI agents can find, fix, and exploit vulnerabilities in Ethereum smart contracts. The announcement comes as AI-powered security tools race to protect the $100 billion-plus locked in DeFi protocols.

The benchmark draws from 120 curated high-severity vulnerabilities pulled from 40 real security audits, mostly from Code4rena competitions. It also includes vulnerability scenarios from security reviews of Tempo, a Layer 1 blockchain built for stablecoin payments.

EVMbench tests AI agents across three distinct modes. In Detect mode, agents audit contract repositories and get scored on finding known vulnerabilities. Patch mode requires agents to fix vulnerable code without breaking existing functionality. Exploit mode is the most aggressive — agents must execute actual fund-draining attacks against contracts deployed on a sandboxed blockchain.

The results show how quickly AI capabilities are advancing in this domain. GPT-5.3-Codex running via Codex CLI hit a 72.2% success rate on exploit tasks. That’s more than double the 31.9% score from GPT-5, which launched just six months prior.

Interestingly, AI agents perform better at attacking than defending. The exploit setting has a clear objective — keep iterating until you drain the funds. Detection and patching proved harder. Agents sometimes stopped after finding one bug instead of auditing exhaustively, and maintaining full contract functionality while removing subtle vulnerabilities remained challenging.

OpenAI acknowledged EVMbench doesn’t capture the full difficulty of real-world contract security. Heavily deployed protocols like Uniswap or Aave undergo far more scrutiny than audit competition code. The benchmark also can’t verify if an agent finds legitimate vulnerabilities that human auditors missed — it only checks against known issues.

The exploit environment runs on a clean local Anvil instance rather than forked mainnet state, and timing-dependent attacks fall outside scope. Single-chain environments only for now.

Alongside EVMbench, OpenAI committed $10 million in API credits specifically for defensive security research. The company is expanding its Aardvark security research agent to more users and partnering with open-source maintainers for free codebase scanning.

The timing matters. As AI agents get better at exploiting contracts, the window between vulnerability discovery and exploitation shrinks. Protocol teams that aren’t using AI-assisted auditing will increasingly find themselves at a disadvantage against attackers who are.

OpenAI released EVMbench’s tasks, tooling, and evaluation framework publicly. For DeFi developers and security researchers, it’s both a measuring stick and a warning about where AI capabilities are headed.

Read more on blockchain.news

This news is powered by blockchain.news blockchain.news

Share this:

  • Share on X (Opens in new window) X
  • Share on Facebook (Opens in new window) Facebook

Like this:

Like Loading...

Related

Bitget Pushes a New Security Standard for Universal Exchanges
The Protocol: Ethereum Validator Exit Queue Backs Up
Ozak AI vs PYTH vs ENA: Which Token Could Deliver the Next 500x Breakout by 2026?
Geode Lists GEODE Coin on BitMart.com as Part of Ongoing Decentralized Infrastructure Expansion
Ethereum Exchange Supply Falls Back to 2016 Levels: What Happens Next?

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Email Copy Link Print
Previous Article The New England Journal of Medicine Publishes First Data to Demonstrate the Potential for Disease Modification in Dravet Syndrome | Taiwan News | Mar. 5, 2026 06:05
Next Article Warning: Is Ethereum Walking Into A Liquidity Trap Before The Next Upgrade?
© Market Alert News. All Rights Reserved.
Welcome Back!

Sign in to your account

Username or Email Address
Password

Prove your humanity


Lost your password?

%d