TLDR OpenAI and Paradigm have launched EVMbench to evaluate AI’s performance in smart contract security. The benchmark tests AI systems in detecting vulnerabilitiesTLDR OpenAI and Paradigm have launched EVMbench to evaluate AI’s performance in smart contract security. The benchmark tests AI systems in detecting vulnerabilities

OpenAI Unveils EVMbench Benchmark to Evaluate AI in Smart Contracts

2026/02/19 20:24
3 min read

TLDR

  • OpenAI and Paradigm have launched EVMbench to evaluate AI’s performance in smart contract security.
  • The benchmark tests AI systems in detecting vulnerabilities, patching code, and executing fund-draining exploits.
  • EVMbench uses 120 high-risk vulnerabilities sourced from 40 professional audits to simulate real-world scenarios.
  • GPT-5.3-Codex achieved a 72.2% success rate in exploit tasks, a notable improvement over GPT-5’s 31.9% performance.
  • OpenAI has invested $10 million in API credits to support open-source security initiatives and strengthen smart contract defenses.

OpenAI and Paradigm have unveiled a new smart contract security evaluation system called EVMbench. This benchmark aims to assess AI systems in detecting vulnerabilities and executing exploits in Ethereum Virtual Machine (EVM) environments. With smart contracts securing over $100 billion in crypto assets, testing the security of these contracts has become crucial.

Testing AI in Smart Contract Security

OpenAI, in collaboration with Paradigm, launched EVMbench to evaluate how AI handles security in smart contracts. The benchmark leverages 120 curated vulnerabilities from 40 professional audits, including scenarios from the Tempo blockchain. The system evaluates AI models in three distinct tasks: detecting vulnerabilities, patching code, and executing fund-draining exploits in a sandboxed EVM environment.

EVMbench focuses on Ethereum-based contracts and incorporates scenarios that reflect real financial applications. The use of 120 high-risk issues, along with data from public auditing competitions, helps to simulate actual challenges faced in the crypto space. OpenAI developed this system to address the growing concern over AI’s role in identifying and mitigating risks in smart contract security.

EVMbench’s Capabilities and Performance

The benchmark provides a comprehensive approach to testing AI agents by evaluating their capabilities in different security tasks. In detection mode, the agents review contract code to identify known vulnerabilities. In patch mode, the AI must fix these vulnerabilities without compromising the contract’s functionality.

Recent testing showed impressive results with the GPT-5.3-Codex model achieving a 72.2% success rate in exploit tasks, up from 31.9% with the GPT-5 model. Despite these advancements, detection and patching performance remained lower. OpenAI noted that while the benchmark gives a glimpse into AI’s potential, it does not fully replicate real-world conditions, as some complex multi-chain and timing-based attacks are excluded from the testing framework.

OpenAI Expands Security Efforts

OpenAI’s announcement also highlighted its broader commitment to security. As part of the release, the company invested $10 million in API credits to support open-source security projects. The company also emphasized that all EVMbench tools and datasets have been made publicly available for further research and development.

The launch of EVMbench is seen as a step toward strengthening the cybersecurity of smart contracts and blockchain systems. With the increasing reliance on smart contracts, OpenAI aims to help the industry address emerging risks by testing AI systems in critical financial settings. As AI continues to evolve, its role in both defending and attacking smart contracts will be crucial for maintaining the integrity of the crypto ecosystem.

The post OpenAI Unveils EVMbench Benchmark to Evaluate AI in Smart Contracts appeared first on CoinCentral.

Market Opportunity
Smart Blockchain Logo
Smart Blockchain Price(SMART)
$0.004408
$0.004408$0.004408
-1.95%
USD
Smart Blockchain (SMART) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.
Tags:

You May Also Like

Q4 2024 Growth Beats Expectations With 0.9% Surge

Q4 2024 Growth Beats Expectations With 0.9% Surge

The post Q4 2024 Growth Beats Expectations With 0.9% Surge appeared on BitcoinEthereumNews.com. New Zealand Retail Sales Soar: Q4 2024 Growth Beats Expectations
Share
BitcoinEthereumNews2026/02/23 07:03
Vitalik Buterin Explains How Crypto Can Protect Users When Perfect Security Remains Impossible

Vitalik Buterin Explains How Crypto Can Protect Users When Perfect Security Remains Impossible

Ethereum co-founder Vitalik Buterin has outlined a new framework for crypto security, offering practical strategies rooted in redundancy, multi-angle verification
Share
Coinstats2026/02/23 06:08
UK and US Seal $42 Billion Tech Pact Driving AI and Energy Future

UK and US Seal $42 Billion Tech Pact Driving AI and Energy Future

The post UK and US Seal $42 Billion Tech Pact Driving AI and Energy Future appeared on BitcoinEthereumNews.com. Key Highlights Microsoft and Google pledge billions as part of UK US tech partnership Nvidia to deploy 120,000 GPUs with British firm Nscale in Project Stargate Deal positions UK as an innovation hub rivaling global tech powers UK and US Seal $42 Billion Tech Pact Driving AI and Energy Future The UK and the US have signed a “Technological Prosperity Agreement” that paves the way for joint projects in artificial intelligence, quantum computing, and nuclear energy, according to Reuters. Donald Trump and King Charles review the guard of honour at Windsor Castle, 17 September 2025. Image: Kirsty Wigglesworth/Reuters The agreement was unveiled ahead of U.S. President Donald Trump’s second state visit to the UK, marking a historic moment in transatlantic technology cooperation. Billions Flow Into the UK Tech Sector As part of the deal, major American corporations pledged to invest $42 billion in the UK. Microsoft leads with a $30 billion investment to expand cloud and AI infrastructure, including the construction of a new supercomputer in Loughton. Nvidia will deploy 120,000 GPUs, including up to 60,000 Grace Blackwell Ultra chips—in partnership with the British company Nscale as part of Project Stargate. Google is contributing $6.8 billion to build a data center in Waltham Cross and expand DeepMind research. Other companies are joining as well. CoreWeave announced a $3.4 billion investment in data centers, while Salesforce, Scale AI, BlackRock, Oracle, and AWS confirmed additional investments ranging from hundreds of millions to several billion dollars. UK Positions Itself as a Global Innovation Hub British Prime Minister Keir Starmer said the deal could impact millions of lives across the Atlantic. He stressed that the UK aims to position itself as an investment hub with lighter regulations than the European Union. Nvidia spokesman David Hogan noted the significance of the agreement, saying it would…
Share
BitcoinEthereumNews2025/09/18 02:22