OpenAI’s GPT-5.5 Pro currently leads FrontierMath, a benchmark of unpublished research-level mathematics problems from Epoch AI, with scores near 52 percent after recent model iterations that leverage greater test-time compute and refined chain-of-thought reasoning. This positions the company ahead of close rivals including Anthropic’s Claude Opus series and Google’s Gemini models, which trail by only a few points amid rapid frontier-lab progress. Traders are watching for any new GPT variant or internal scaling update before June 30 that could push scores higher, while noting that benchmark saturation and potential scaffold changes introduce uncertainty even at current levels.
基於Polymarket數據的AI實驗性摘要。這不是交易建議,也不影響該市場的結算方式。 · 更新於$35,531 交易量
60%+
47%
70%+
24%
$35,531 交易量
60%+
47%
70%+
24%
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
市場開放時間: Jan 29, 2026, 12:47 PM ET
Resolver
0x65070BE91...This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Resolver
0x65070BE91...OpenAI’s GPT-5.5 Pro currently leads FrontierMath, a benchmark of unpublished research-level mathematics problems from Epoch AI, with scores near 52 percent after recent model iterations that leverage greater test-time compute and refined chain-of-thought reasoning. This positions the company ahead of close rivals including Anthropic’s Claude Opus series and Google’s Gemini models, which trail by only a few points amid rapid frontier-lab progress. Traders are watching for any new GPT variant or internal scaling update before June 30 that could push scores higher, while noting that benchmark saturation and potential scaffold changes introduce uncertainty even at current levels.
基於Polymarket數據的AI實驗性摘要。這不是交易建議,也不影響該市場的結算方式。 · 更新於
警惕外部連結哦。
警惕外部連結哦。
Frequently Asked Questions