Recent breakthroughs from leading AI labs have pushed the frontier of large language model performance on the LMSYS Chatbot Arena leaderboard, with Anthropic’s Claude Opus 4.7 Thinking model recently posting a record 1503–1506 Elo overall score through strong results in coding, math, and conversational tasks. This has intensified competition, as OpenAI’s GPT-5.5 series and Google’s Gemini 3.1 Pro now sit within a few points, while open-weight challengers like Z.ai’s GLM-5.1 and DeepSeek V4 continue to close the gap on efficiency benchmarks. With roughly six weeks remaining until June 30, traders are watching for possible mid-cycle updates, new reasoning-focused variants, or expanded evaluation data that could lift any model above the target threshold before resolution.
Resumen experimental generado por IA con datos de Polymarket. Esto no es asesoramiento de trading y no influye en cómo se resuelve este mercado. · Actualizado$16,797 Vol.
1510
23%
1520
20%
1530
13%
$16,797 Vol.
1510
23%
1520
20%
1530
13%
Results from the "Score" column under the "Text Arena | Overall" Leaderboard tab at https://lmarena.ai/leaderboard/text with style control off will be used to resolve this market.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If permanently unavailable, this market will resolve to "No".
Mercado abierto: Apr 2, 2026, 6:02 PM ET
Resolver
0x65070BE91...Results from the "Score" column under the "Text Arena | Overall" Leaderboard tab at https://lmarena.ai/leaderboard/text with style control off will be used to resolve this market.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If permanently unavailable, this market will resolve to "No".
Resolver
0x65070BE91...Recent breakthroughs from leading AI labs have pushed the frontier of large language model performance on the LMSYS Chatbot Arena leaderboard, with Anthropic’s Claude Opus 4.7 Thinking model recently posting a record 1503–1506 Elo overall score through strong results in coding, math, and conversational tasks. This has intensified competition, as OpenAI’s GPT-5.5 series and Google’s Gemini 3.1 Pro now sit within a few points, while open-weight challengers like Z.ai’s GLM-5.1 and DeepSeek V4 continue to close the gap on efficiency benchmarks. With roughly six weeks remaining until June 30, traders are watching for possible mid-cycle updates, new reasoning-focused variants, or expanded evaluation data that could lift any model above the target threshold before resolution.
Resumen experimental generado por IA con datos de Polymarket. Esto no es asesoramiento de trading y no influye en cómo se resuelve este mercado. · Actualizado
Cuidado con los enlaces externos.
Cuidado con los enlaces externos.
Preguntas frecuentes