- A new tool from OpenAI evaluates AI agents’ ability to identify, patch, or exploit smart contract vulnerabilities.
- Researchers found that agents are better at exploiting vulnerabilities than finding or patching them.
- The release comes just days after a bug in AI-generated code cost Moonwell users nearly $2.7 million in crypto.
OpenAI and crypto venture capital firm Paradigm on Wednesday released a tool that evaluates AI agents’ ability to identify, patch, or exploit smart contract vulnerabilities.
The tool, EVMbench, draws from 120 vulnerabilities identified over 40 prior smart contract audits, as well as “vulnerability scenarios” drawn from audits of Paradigm’s forthcoming Tempo blockchain.







