OpenAI and Paradigm have created EVMbench, a framework testing how effectively AI can identify, exploit and repair Ethereum smart contract flaws. The post OpenAIOpenAI and Paradigm have created EVMbench, a framework testing how effectively AI can identify, exploit and repair Ethereum smart contract flaws. The post OpenAI

OpenAI Launches EVMbench to Test AI’s Ability to Secure Ethereum Smart Contracts

2026/02/19 14:43
2분 읽기
  • OpenAI and Paradigm introduced EVMbench to assess AI systems’ ability to handle Ethereum smart contract vulnerabilities.
  • The benchmark uses 120 real-world audit issues and evaluates detection, repair and exploit capabilities in controlled environments.
  • Early results show significant performance differences between GPT-5.3-Codex and GPT-5, highlighting rapid model advancement.

OpenAI has unveiled EVMbench, a smart contract security benchmark developed alongside crypto investment firm Paradigm to test artificial intelligence agents on Ethereum vulnerabilities. The framework is intended to determine whether AI systems can detect, exploit and fix serious flaws in Ethereum smart contracts.

Because smart contracts are generally immutable once deployed, errors can have enduring financial consequences. OpenAI said such contracts routinely protect more than US$100 billion (AU$141 billion) in open-source crypto assets, increasing the importance of rigorous security evaluation as AI coding capabilities advance.

Related: Stripe-Owned Bridge Wins Conditional OCC Approval to Become National Crypto Bank

Measuring AI Performance

The dataset underpinning EVMbench consists of 120 curated vulnerabilities drawn from 40 professional audits, with most sourced from open audit competitions including Code4rena. Additional scenarios stem from security auditing work for Tempo, a purpose-built Layer-1 blockchain designed to support high-throughput, low-cost stablecoin payments.

AI agents are assessed across three categories: detecting known vulnerabilities, patching contracts without compromising intended functionality, and executing exploit attempts within a controlled blockchain environment. Exploit tasks are graded using deterministic transaction replay and on-chain checks.

In benchmark results, GPT-5.3-Codex achieved 72.2% in exploit mode, while GPT-5 recorded 31.9%, despite being released just over six months earlier. OpenAI said the objective is to create a clear standard for evaluating AI systems in blockchain security as decentralised finance continues to grow.

Related: Ledger Integrates OKX DEX to Enable In-App Multichain Token Swaps

The post OpenAI Launches EVMbench to Test AI’s Ability to Secure Ethereum Smart Contracts appeared first on Crypto News Australia.

시장 기회
Smart Blockchain 로고
Smart Blockchain 가격(SMART)
$0,004403
$0,004403$0,004403
-1,78%
USD
Smart Blockchain (SMART) 실시간 가격 차트
면책 조항: 본 사이트에 재게시된 글들은 공개 플랫폼에서 가져온 것으로 정보 제공 목적으로만 제공됩니다. 이는 반드시 MEXC의 견해를 반영하는 것은 아닙니다. 모든 권리는 원저자에게 있습니다. 제3자의 권리를 침해하는 콘텐츠가 있다고 판단될 경우, service@support.mexc.com으로 연락하여 삭제 요청을 해주시기 바랍니다. MEXC는 콘텐츠의 정확성, 완전성 또는 시의적절성에 대해 어떠한 보증도 하지 않으며, 제공된 정보에 기반하여 취해진 어떠한 조치에 대해서도 책임을 지지 않습니다. 본 콘텐츠는 금융, 법률 또는 기타 전문적인 조언을 구성하지 않으며, MEXC의 추천이나 보증으로 간주되어서는 안 됩니다.