Recent releases from leading labs have driven Chatbot Arena scores to new highs, with Anthropic’s Claude Opus 4.7 Thinking variant reaching an Overall Arena Score of 1505 through iterative “thinking” optimizations that enhance reasoning across text, code, and creative tasks. OpenAI’s GPT-5.5 series and Google’s Gemini 3.1 Pro Preview sit just behind at 1506 and 1505 respectively, reflecting tight competition fueled by rapid scaling, post-training refinements, and larger context windows. Traders view sustained frontier progress—marked by monthly model updates and benchmark gains—as the key driver for breaching higher thresholds by year-end, though diminishing returns from evaluation saturation and potential release delays remain key uncertainties ahead of Q3 developer conferences.
基於Polymarket數據的AI實驗性摘要。這不是交易建議,也不影響該市場的結算方式。 · 更新於$90,106 交易量
↑ 1550
29%
↑ 1600
19%
↑ 1650
11%
↑ 1700
10%
$90,106 交易量
↑ 1550
29%
↑ 1600
19%
↑ 1650
11%
↑ 1700
10%
Results from the 'Score' section on the 'Text Arena' Leaderboard tab (https://lmarena.ai/leaderboard/text), with the style control unchecked, will be used to resolve this market.
The resolution source is the Chatbot Arena LLM Leaderboard (https://lmarena.ai/). If this source is temporarily unavailable, the market remains open until it is accessible again; if permanently unavailable, this market will resolve to "No".
市場開放時間: Jan 2, 2026, 1:29 PM ET
Resolver
0x65070BE91...Results from the 'Score' section on the 'Text Arena' Leaderboard tab (https://lmarena.ai/leaderboard/text), with the style control unchecked, will be used to resolve this market.
The resolution source is the Chatbot Arena LLM Leaderboard (https://lmarena.ai/). If this source is temporarily unavailable, the market remains open until it is accessible again; if permanently unavailable, this market will resolve to "No".
Resolver
0x65070BE91...Recent releases from leading labs have driven Chatbot Arena scores to new highs, with Anthropic’s Claude Opus 4.7 Thinking variant reaching an Overall Arena Score of 1505 through iterative “thinking” optimizations that enhance reasoning across text, code, and creative tasks. OpenAI’s GPT-5.5 series and Google’s Gemini 3.1 Pro Preview sit just behind at 1506 and 1505 respectively, reflecting tight competition fueled by rapid scaling, post-training refinements, and larger context windows. Traders view sustained frontier progress—marked by monthly model updates and benchmark gains—as the key driver for breaching higher thresholds by year-end, though diminishing returns from evaluation saturation and potential release delays remain key uncertainties ahead of Q3 developer conferences.
基於Polymarket數據的AI實驗性摘要。這不是交易建議,也不影響該市場的結算方式。 · 更新於
警惕外部連結哦。
警惕外部連結哦。
Frequently Asked Questions