Recent releases from Anthropic, including Claude Opus 4.7 variants, have driven top Coding Arena scores on Arena.AI to the 1530–1535 Elo range through superior performance in blind human evaluations of code generation, refactoring, and debugging tasks. This positions frontier models like Claude ahead of OpenAI’s GPT-5.4 Codex series and Google’s Gemini 3.1 previews, which trail by 20–50 points amid ongoing scaling improvements in large language model training. With roughly 45 days until the June 30 resolution, traders are watching for potential new checkpoints or upgrades from major labs that could add 20–50 Elo points, though historical patterns show incremental rather than sudden jumps in arena leaderboards.
Ringkasan eksperimental yang dihasilkan AI dengan referensi data Polymarket. Ini bukan saran trading dan tidak berperan dalam bagaimana pasar ini diselesaikan. · Diperbarui1550
54%
1560
59%
1570
41%
$7,730 Vol.
1550
54%
1560
59%
1570
41%
Results from the "Score" column under the "Text Arena | Coding" Leaderboard tab at https://arena.ai/leaderboard/text/coding-no-style-control with style control off will be used to resolve this market.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If permanently unavailable, this market will resolve to "No".
Pasar Dibuka: Apr 2, 2026, 6:09 PM ET
Resolver
0x65070BE91...Results from the "Score" column under the "Text Arena | Coding" Leaderboard tab at https://arena.ai/leaderboard/text/coding-no-style-control with style control off will be used to resolve this market.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If permanently unavailable, this market will resolve to "No".
Resolver
0x65070BE91...Recent releases from Anthropic, including Claude Opus 4.7 variants, have driven top Coding Arena scores on Arena.AI to the 1530–1535 Elo range through superior performance in blind human evaluations of code generation, refactoring, and debugging tasks. This positions frontier models like Claude ahead of OpenAI’s GPT-5.4 Codex series and Google’s Gemini 3.1 previews, which trail by 20–50 points amid ongoing scaling improvements in large language model training. With roughly 45 days until the June 30 resolution, traders are watching for potential new checkpoints or upgrades from major labs that could add 20–50 Elo points, though historical patterns show incremental rather than sudden jumps in arena leaderboards.
Ringkasan eksperimental yang dihasilkan AI dengan referensi data Polymarket. Ini bukan saran trading dan tidak berperan dalam bagaimana pasar ini diselesaikan. · Diperbarui
Hati-hati dengan link eksternal.
Hati-hati dengan link eksternal.
Pertanyaan yang Sering Diajukan