Tech Share Share this article Copy linkX (Twitter)LinkedInFacebookEmail Sam Altman's OpenAI unveils ‘EVMbench’ to test Tech Share Share this article Copy linkX (Twitter)LinkedInFacebookEmail Sam Altman's OpenAI unveils ‘EVMbench’ to test

Sam Altman's OpenAI unveils ‘EVMbench’ to test whether AI can keep crypto’s smart contracts safe

2026/02/19 03:12
4 min read
Share
Share this article
Copy linkX (Twitter)LinkedInFacebookEmail

Sam Altman's OpenAI unveils ‘EVMbench’ to test whether AI can keep crypto’s smart contracts safe

Developed with Paradigm, the tool is OpenAI’s attempt to determine whether modern AI systems are up to the task of helping prevent smart contract issues.

By Margaux Nijkerk, AI Boost|Edited by Aoyon Ashraf
Feb 18, 2026, 7:12 p.m.
Make us preferred on Google

What to know:

  • OpenAI is stepping deeper into crypto security with the launch of EVMbench, a new testing framework designed to measure how well artificial intelligence can understand and potentially secure smart contracts on blockchains.
  • Smart contracts are typically immutable once deployed, and vulnerabilities can be serious.
  • EVMbench is OpenAI’s attempt to see whether modern AI systems are up to the task of helping prevent such issues.

OpenAI is stepping deeper into crypto security with the launch of EVMbench, a new testing framework designed to measure how well artificial intelligence can understand and potentially secure smart contracts on Ethereum and similar blockchains.

Smart contracts, self-executing code deployed on blockchains like Ethereum, underpin decentralized exchanges, lending protocols and a wide range of onchain financial applications. Because these contracts are typically immutable once deployed, vulnerabilities can be serious.

STORY CONTINUES BELOW
Don't miss another story.Subscribe to the The Protocol Newsletter today. See all newsletters
Sign me up

EVMbench is OpenAI’s attempt to see whether modern AI systems are up to the task of helping prevent those issues. Built in collaboration with crypto investment firm Paradigm, the benchmark draws on real-world smart contract vulnerabilities previously uncovered through audits and security competitions.

The system measures performance across three core abilities: identifying security bugs, exploiting those bugs in a controlled environment and fixing the vulnerable code without breaking the contracts.

OpenAI says the goal is to establish a clear standard for evaluating AI systems in blockchain security, especially as decentralized finance continues to secure billions of dollars in user funds. The stakes for smart contracts are only rising.

“Smart contracts routinely secure $100B+ in open-source crypto assets. As AI agents improve at reading, writing, and executing code, it becomes increasingly important to measure their capabilities in economically meaningful environments, and to encourage the use of AI systems defensively to audit and strengthen deployed contracts,” OpenAI wrote in a blog post.

Read more: Most Influential: Sam Altman

Sam AltmanOpenaismart contractsParadigm
AI Disclaimer: Parts of this article were generated with the assistance from AI tools and reviewed by our editorial team to ensure accuracy and adherence to our standards. For more information, see CoinDesk's full AI Policy.

More For You

Zoomex: Precise Systems of Fairness and Transparency by Design

Read full story

More For You

The Protocol: Zora moves to Solana

Also: EF’s Stańczak to leave ED role, XRPL member-only DEX and Ethereum revives the DAO.

What to know:

Welcome to The Protocol, CoinDesk's weekly wrap of the most important stories in cryptocurrency tech development. I’m Margaux Nijkerk, a reporter at CoinDesk.

In this issue:

  • Zora moves onto Solana with 'attention markets' for trading internet trends
  • Ethereum Foundation leadership shake-up: Tomasz Stańczak out as co-executive director
  • XRP Ledger rolls out members-only DEX for regulated institutions
  • From the 2016 hack to $150M Endowment: the DAO’s second act focuses on Ethereum security
Read full story
Latest Crypto News

Kraken continues acquisition streak by buying token management firm Magna ahead of IPO push

The Protocol: Zora moves to Solana

Optimism's OP token falls after Base moves away from the network's 'OP stack' in major tech shift

Ethereum’s 50% staking milestone triggers backlash over 'misleading' supply data

Bitcoin miner Riot stock jumps nearly 9% as activist Starboard urges AI data center expansion

Financial giant with $3.5 trillion asset to pilot Trump-affiliated WLFI stablecoin for tokenized funds

Top Stories

Bitcoin volatile, but flat, while crypto stocks bounce amid cooling AI fears

Goldman Sachs' David Solomon says he owns 'very little' bitcoin but watching it closely

Bitcoin's plunge signals coming AI crisis, but massive Fed response will drive new record high: Arthur Hayes

American crypto holders are scared and confused about this year’s new IRS tax rules

Hyperliquid starts DeFi lobbying group with $29 million token backing

From 2016 hack to $150M Endowment: the DAO’s second act focuses on Ethereum security

Market Opportunity
Smart Blockchain Logo
Smart Blockchain Price(SMART)
$0.004363
$0.004363$0.004363
-2.67%
USD
Smart Blockchain (SMART) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Structural job strain caps rand gains – Commerzbank

Structural job strain caps rand gains – Commerzbank

The post Structural job strain caps rand gains – Commerzbank appeared on BitcoinEthereumNews.com. Commerzbank’s Volkmar Baur highlights that South Africa’s unemployment
Share
BitcoinEthereumNews2026/02/19 05:27
Trump gushes over Nicki Minaj's skin to mark Black History Month: 'So beautiful'

Trump gushes over Nicki Minaj's skin to mark Black History Month: 'So beautiful'

President Donald Trump used an event marking Black History Month to remark on Nicki Minaj's complexion."I love Nikki Minaj," the president told the audience. "She
Share
Rawstory2026/02/19 05:07
Google's AP2 protocol has been released. Does encrypted AI still have a chance?

Google's AP2 protocol has been released. Does encrypted AI still have a chance?

Following the MCP and A2A protocols, the AI Agent market has seen another blockbuster arrival: the Agent Payments Protocol (AP2), developed by Google. This will clearly further enhance AI Agents' autonomous multi-tasking capabilities, but the unfortunate reality is that it has little to do with web3AI. Let's take a closer look: What problem does AP2 solve? Simply put, the MCP protocol is like a universal hook, enabling AI agents to connect to various external tools and data sources; A2A is a team collaboration communication protocol that allows multiple AI agents to cooperate with each other to complete complex tasks; AP2 completes the last piece of the puzzle - payment capability. In other words, MCP opens up connectivity, A2A promotes collaboration efficiency, and AP2 achieves value exchange. The arrival of AP2 truly injects "soul" into the autonomous collaboration and task execution of Multi-Agents. Imagine AI Agents connecting Qunar, Meituan, and Didi to complete the booking of flights, hotels, and car rentals, but then getting stuck at the point of "self-payment." What's the point of all that multitasking? So, remember this: AP2 is an extension of MCP+A2A, solving the last mile problem of AI Agent automated execution. What are the technical highlights of AP2? The core innovation of AP2 is the Mandates mechanism, which is divided into real-time authorization mode and delegated authorization mode. Real-time authorization is easy to understand. The AI Agent finds the product and shows it to you. The operation can only be performed after the user signs. Delegated authorization requires the user to set rules in advance, such as only buying the iPhone 17 when the price drops to 5,000. The AI Agent monitors the trigger conditions and executes automatically. The implementation logic is cryptographically signed using Verifiable Credentials (VCs). Users can set complex commission conditions, including price ranges, time limits, and payment method priorities, forming a tamper-proof digital contract. Once signed, the AI Agent executes according to the conditions, with VCs ensuring auditability and security at every step. Of particular note is the "A2A x402" extension, a technical component developed by Google specifically for crypto payments, developed in collaboration with Coinbase and the Ethereum Foundation. This extension enables AI Agents to seamlessly process stablecoins, ETH, and other blockchain assets, supporting native payment scenarios within the Web3 ecosystem. What kind of imagination space can AP2 bring? After analyzing the technical principles, do you think that's it? Yes, in fact, the AP2 is boring when it is disassembled alone. Its real charm lies in connecting and opening up the "MCP+A2A+AP2" technology stack, completely opening up the complete link of AI Agent's autonomous analysis+execution+payment. From now on, AI Agents can open up many application scenarios. For example, AI Agents for stock investment and financial management can help us monitor the market 24/7 and conduct independent transactions. Enterprise procurement AI Agents can automatically replenish and renew without human intervention. AP2's complementary payment capabilities will further expand the penetration of the Agent-to-Agent economy into more scenarios. Google obviously understands that after the technical framework is established, the ecological implementation must be relied upon, so it has brought in more than 60 partners to develop it, almost covering the entire payment and business ecosystem. Interestingly, it also involves major Crypto players such as Ethereum, Coinbase, MetaMask, and Sui. Combined with the current trend of currency and stock integration, the imagination space has been doubled. Is web3 AI really dead? Not entirely. Google's AP2 looks complete, but it only achieves technical compatibility with Crypto payments. It can only be regarded as an extension of the traditional authorization framework and belongs to the category of automated execution. There is a "paradigm" difference between it and the autonomous asset management pursued by pure Crypto native solutions. The Crypto-native solutions under exploration are taking the "decentralized custody + on-chain verification" route, including AI Agent autonomous asset management, AI Agent autonomous transactions (DeFAI), AI Agent digital identity and on-chain reputation system (ERC-8004...), AI Agent on-chain governance DAO framework, AI Agent NPC and digital avatars, and many other interesting and fun directions. Ultimately, once users get used to AI Agent payments in traditional fields, their acceptance of AI Agents autonomously owning digital assets will also increase. And for those scenarios that AP2 cannot reach, such as anonymous transactions, censorship-resistant payments, and decentralized asset management, there will always be a time for crypto-native solutions to show their strength? The two are more likely to be complementary rather than competitive, but to be honest, the key technological advancements behind AI Agents currently all come from web2AI, and web3AI still needs to keep up the good work!
Share
PANews2025/09/18 07:00