OpenAI's GPT-5.5 Pro currently leads the FrontierMath leaderboard at 52.4%, with the broader GPT-5 series clustered between 47% and 52% on this Epoch AI benchmark of unpublished, research-level mathematics problems that require hours or days for expert human solvers. Recent iterative releases have driven rapid gains, lifting top scores from roughly 40% in late 2025 to the current plateau amid tight competition from Anthropic's Claude Opus 4.x and Google's Gemini models. OpenAI's exclusive access to portions of the dataset and continued scaling of reasoning techniques remain the dominant factors behind trader consensus on further incremental progress before June 30, though benchmark saturation and potential delays in new model training could limit upside in the narrow window.
基于Polymarket数据的AI实验性摘要。这不是交易建议,也不影响该市场的结算方式。 · 更新于$35,531 交易量
60%+
57%
70%+
24%
$35,531 交易量
60%+
57%
70%+
24%
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
市场开放时间: Jan 29, 2026, 12:47 PM ET
Resolver
0x65070BE91...This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Resolver
0x65070BE91...OpenAI's GPT-5.5 Pro currently leads the FrontierMath leaderboard at 52.4%, with the broader GPT-5 series clustered between 47% and 52% on this Epoch AI benchmark of unpublished, research-level mathematics problems that require hours or days for expert human solvers. Recent iterative releases have driven rapid gains, lifting top scores from roughly 40% in late 2025 to the current plateau amid tight competition from Anthropic's Claude Opus 4.x and Google's Gemini models. OpenAI's exclusive access to portions of the dataset and continued scaling of reasoning techniques remain the dominant factors behind trader consensus on further incremental progress before June 30, though benchmark saturation and potential delays in new model training could limit upside in the narrow window.
基于Polymarket数据的AI实验性摘要。这不是交易建议,也不影响该市场的结算方式。 · 更新于
警惕外部链接哦。
警惕外部链接哦。
常见问题