Anthropic’s dominant 70% market-implied odds reflect its Claude Opus 4.6 and Sonnet 4.6 models’ consistent top rankings on math-specific benchmarks such as AIME and MATH, where they deliver near-perfect scores through advanced chain-of-thought reasoning and long-context handling. Recent releases and third-party evaluations have highlighted these models’ edge in multi-step problem solving over OpenAI’s GPT-5.4 and Google’s Gemini 3.1 Pro variants, which trail despite strong showings in related reasoning tasks. With the May 31 resolution date approaching, traders are watching for any final model updates or benchmark refreshes that could shift the current consensus on which large language model leads in pure mathematical capability.
基於Polymarket數據的AI實驗性摘要。這不是交易建議,也不影響該市場的結算方式。 · 更新於Anthropic 70%
OpenAI 17%
Google 14%
xAI <1%
$121,626 交易量
$121,626 交易量

Anthropic
70%

OpenAI
17%

14%

xAI
1%

ByteDance
1%

Z.ai
1%

DeepSeek
1%

Meta
1%

Baidu
<1%

Alibaba
<1%

Moonshot
<1%

Amazon
<1%

Mistral
<1%

Meituan
<1%

Microsoft
<1%
Anthropic 70%
OpenAI 17%
Google 14%
xAI <1%
$121,626 交易量
$121,626 交易量

Anthropic
70%

OpenAI
17%

14%

xAI
1%

ByteDance
1%

Z.ai
1%

DeepSeek
1%

Meta
1%

Baidu
<1%

Alibaba
<1%

Moonshot
<1%

Amazon
<1%

Mistral
<1%

Meituan
<1%

Microsoft
<1%
Results from the "Rank" column under the "Text Arena | Math" Leaderboard tab at https://arena.ai/leaderboard/text/math-no-style-control with style control off will be used to resolve this market.
Models will be ordered primarily by their leaderboard rank at the market’s check time. If two or more models are tied on rank, they will be ordered by their Arena score, including any underlying, unrounded, granular values reflected in the data below the leaderboard. If a tie still remains, alphabetical order of company names as listed in this market group will be used as a final tiebreaker (e.g., if the two models are tied by exact arena score, “Google” would be ranked ahead of “xAI”). This market will resolve based on the company that occupies first place under this ranking.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
市場開放時間: Apr 27, 2026, 5:49 PM ET
Resolver
0x69c47De9D...Results from the "Rank" column under the "Text Arena | Math" Leaderboard tab at https://arena.ai/leaderboard/text/math-no-style-control with style control off will be used to resolve this market.
Models will be ordered primarily by their leaderboard rank at the market’s check time. If two or more models are tied on rank, they will be ordered by their Arena score, including any underlying, unrounded, granular values reflected in the data below the leaderboard. If a tie still remains, alphabetical order of company names as listed in this market group will be used as a final tiebreaker (e.g., if the two models are tied by exact arena score, “Google” would be ranked ahead of “xAI”). This market will resolve based on the company that occupies first place under this ranking.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Resolver
0x69c47De9D...Anthropic’s dominant 70% market-implied odds reflect its Claude Opus 4.6 and Sonnet 4.6 models’ consistent top rankings on math-specific benchmarks such as AIME and MATH, where they deliver near-perfect scores through advanced chain-of-thought reasoning and long-context handling. Recent releases and third-party evaluations have highlighted these models’ edge in multi-step problem solving over OpenAI’s GPT-5.4 and Google’s Gemini 3.1 Pro variants, which trail despite strong showings in related reasoning tasks. With the May 31 resolution date approaching, traders are watching for any final model updates or benchmark refreshes that could shift the current consensus on which large language model leads in pure mathematical capability.
基於Polymarket數據的AI實驗性摘要。這不是交易建議,也不影響該市場的結算方式。 · 更新於
警惕外部連結哦。
警惕外部連結哦。
Frequently Asked Questions