Anthropic's Claude Opus 4.7 Thinking model recently claimed the top spot on Arena's Code Arena WebDev leaderboard with a record 1566 Elo score as of May 12, propelled by exceptional agentic coding performance on real-world tasks like building live websites, outpacing prior Claude 4.6 versions by 20+ points and non-Anthropic rivals like GLM-5.1 by over 30. This surge underscores accelerating AI coding capabilities amid cutthroat competition from OpenAI's GPT-5.x series, Google's Gemini 3.1 Pro, and open-source contenders entering the top 10. With just six weeks until June 30 resolution, trader consensus hinges on whether imminent releases—such as potential GPT-5.5 or Grok updates—can breach higher thresholds like 1580 amid benchmark saturation risks and rapid leaderboard volatility.
Tóm tắt AI thử nghiệm tham chiếu dữ liệu Polymarket. Đây không phải tư vấn giao dịch và không ảnh hưởng đến cách thị trường này được giải quyết. · Cập nhật1550
54%
1560
59%
1570
33%
$7,719 KL.
1550
54%
1560
59%
1570
33%
Results from the "Score" column under the "Text Arena | Coding" Leaderboard tab at https://arena.ai/leaderboard/text/coding-no-style-control with style control off will be used to resolve this market.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If permanently unavailable, this market will resolve to "No".
Thị trường mở: Apr 2, 2026, 6:09 PM ET
Resolver
0x65070BE91...Results from the "Score" column under the "Text Arena | Coding" Leaderboard tab at https://arena.ai/leaderboard/text/coding-no-style-control with style control off will be used to resolve this market.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If permanently unavailable, this market will resolve to "No".
Resolver
0x65070BE91...Anthropic's Claude Opus 4.7 Thinking model recently claimed the top spot on Arena's Code Arena WebDev leaderboard with a record 1566 Elo score as of May 12, propelled by exceptional agentic coding performance on real-world tasks like building live websites, outpacing prior Claude 4.6 versions by 20+ points and non-Anthropic rivals like GLM-5.1 by over 30. This surge underscores accelerating AI coding capabilities amid cutthroat competition from OpenAI's GPT-5.x series, Google's Gemini 3.1 Pro, and open-source contenders entering the top 10. With just six weeks until June 30 resolution, trader consensus hinges on whether imminent releases—such as potential GPT-5.5 or Grok updates—can breach higher thresholds like 1580 amid benchmark saturation risks and rapid leaderboard volatility.
Tóm tắt AI thử nghiệm tham chiếu dữ liệu Polymarket. Đây không phải tư vấn giao dịch và không ảnh hưởng đến cách thị trường này được giải quyết. · Cập nhật
Cẩn thận với liên kết bên ngoài.
Cẩn thận với liên kết bên ngoài.
Câu hỏi thường gặp