Anthropic's Claude Opus 4.7 Thinking model currently leads the Code Arena WebDev leaderboard at 1567 Elo and Image-to-WebDev at 1581, reflecting superior agentic coding on real-world tasks like building interactive websites and apps via user-voted blind evaluations on arena.ai. Released in mid-April 2026, it surged +37 points over its predecessor, widening the gap to 35+ points over rivals like Z.ai's GLM-5.1 (1532) and OpenAI's GPT-5.5 variants. Competitive pressure from Google Gemini 3.1 Pro, xAI Grok 4.20, and Alibaba Qwen3.6 Plus drives rapid iterations, with new sub-leaderboards launched May 8 highlighting domain-specific strengths. Traders eye June model drops—potentially GPT-5.6 or Claude 4.8—for breakthroughs past key thresholds by June 30, amid benchmark volatility from ongoing battles.
基于Polymarket数据的AI实验性摘要。这不是交易建议,也不影响该市场的结算方式。 · 更新于1550
54%
1560
59%
1570
32%
$7,720 交易量
1550
54%
1560
59%
1570
32%
Results from the "Score" column under the "Text Arena | Coding" Leaderboard tab at https://arena.ai/leaderboard/text/coding-no-style-control with style control off will be used to resolve this market.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If permanently unavailable, this market will resolve to "No".
市场开放时间: Apr 2, 2026, 6:09 PM ET
Resolver
0x65070BE91...Results from the "Score" column under the "Text Arena | Coding" Leaderboard tab at https://arena.ai/leaderboard/text/coding-no-style-control with style control off will be used to resolve this market.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If permanently unavailable, this market will resolve to "No".
Resolver
0x65070BE91...Anthropic's Claude Opus 4.7 Thinking model currently leads the Code Arena WebDev leaderboard at 1567 Elo and Image-to-WebDev at 1581, reflecting superior agentic coding on real-world tasks like building interactive websites and apps via user-voted blind evaluations on arena.ai. Released in mid-April 2026, it surged +37 points over its predecessor, widening the gap to 35+ points over rivals like Z.ai's GLM-5.1 (1532) and OpenAI's GPT-5.5 variants. Competitive pressure from Google Gemini 3.1 Pro, xAI Grok 4.20, and Alibaba Qwen3.6 Plus drives rapid iterations, with new sub-leaderboards launched May 8 highlighting domain-specific strengths. Traders eye June model drops—potentially GPT-5.6 or Claude 4.8—for breakthroughs past key thresholds by June 30, amid benchmark volatility from ongoing battles.
基于Polymarket数据的AI实验性摘要。这不是交易建议,也不影响该市场的结算方式。 · 更新于
警惕外部链接哦。
警惕外部链接哦。
常见问题