Skip to main content
icon for Anthropic Claude在6月30日前在FrontierMath Benchmark上得分?

Anthropic Claude在6月30日前在FrontierMath Benchmark上得分?

icon for Anthropic Claude在6月30日前在FrontierMath Benchmark上得分?

Anthropic Claude在6月30日前在FrontierMath Benchmark上得分?

$61,944 交易量

2026-02-28
Polymarket

$61,944 交易量

Polymarket

50%+

$14,910 交易量

52%

This market will resolve to "Yes" if any Anthropic Claude model achieves the listed score or greater on the FrontierMath Exam by June 30, 2026, 11:59 PM ET. Otherwise, the market will resolve to "No". This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered. The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.Anthropic’s latest Claude Opus 4.7 variant currently trails the FrontierMath leaderboard, posting 43.8 percent against OpenAI’s GPT-5.5 Pro at 52.4 percent on this Epoch AI benchmark of original research-level math problems. Claude models continue to show relative strength on software-engineering tasks while underperforming on pure mathematical reasoning compared with their general capabilities index. Epoch researchers are still correcting roughly one-third of FrontierMath items, which could shift reported scores once the cleaned dataset is released. No major Claude math-specific update has been announced in the past month, leaving traders focused on whether an incremental training run or adaptive fine-tuning before June 30 can close the roughly nine-point gap to the current leader.

This market will resolve to "Yes" if any Anthropic Claude model achieves the listed score or greater on the FrontierMath Exam by June 30, 2026, 11:59 PM ET. Otherwise, the market will resolve to "No".

This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.

The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
交易量
$61,944
结束日期
2026-06-30
市场开放时间
Jan 30, 2026, 12:00 AM ET
This market will resolve to "Yes" if any Anthropic Claude model achieves the listed score or greater on the FrontierMath Exam by June 30, 2026, 11:59 PM ET. Otherwise, the market will resolve to "No". This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered. The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
This market will resolve to "Yes" if any Anthropic Claude model achieves the listed score or greater on the FrontierMath Exam by June 30, 2026, 11:59 PM ET. Otherwise, the market will resolve to "No". This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered. The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.Anthropic’s latest Claude Opus 4.7 variant currently trails the FrontierMath leaderboard, posting 43.8 percent against OpenAI’s GPT-5.5 Pro at 52.4 percent on this Epoch AI benchmark of original research-level math problems. Claude models continue to show relative strength on software-engineering tasks while underperforming on pure mathematical reasoning compared with their general capabilities index. Epoch researchers are still correcting roughly one-third of FrontierMath items, which could shift reported scores once the cleaned dataset is released. No major Claude math-specific update has been announced in the past month, leaving traders focused on whether an incremental training run or adaptive fine-tuning before June 30 can close the roughly nine-point gap to the current leader.

This market will resolve to "Yes" if any Anthropic Claude model achieves the listed score or greater on the FrontierMath Exam by June 30, 2026, 11:59 PM ET. Otherwise, the market will resolve to "No".

This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.

The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
交易量
$61,944
结束日期
2026-06-30
市场开放时间
Jan 30, 2026, 12:00 AM ET
This market will resolve to "Yes" if any Anthropic Claude model achieves the listed score or greater on the FrontierMath Exam by June 30, 2026, 11:59 PM ET. Otherwise, the market will resolve to "No". This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered. The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.

警惕外部链接哦。

常见问题

"Anthropic Claude在6月30日前在FrontierMath Benchmark上得分?"是 Polymarket 上一个拥有 4 个可能结果的预测市场,交易者根据自己的判断买卖份额。当前领先结果为"25%+",概率为 100%,其次是"30%以上",概率为 100%。价格反映社区的实时概率。例如,价格为 100¢ 的份额意味着市场集体认为该结果的概率为 100%。这些赔率会随着交易者的反应而不断变化。正确结果的份额在市场结算时可兑换为每份 $1。

截至目前,"Anthropic Claude在6月30日前在FrontierMath Benchmark上得分?"已产生 $61.9K 的总交易量(自Jan 30, 2026市场上线以来)。这一活跃度反映了 Polymarket 社区的高度参与,并确保当前赔率由广泛的市场参与者共同形成。你可以直接在本页追踪实时价格变动并交易任何结果。

要在"Anthropic Claude在6月30日前在FrontierMath Benchmark上得分?"上交易,浏览本页上列出的 4 个可用结果。每个结果显示一个代表市场隐含概率的当前价格。要建仓,选择你认为最可能的结果,选择"是"支持或"否"反对,输入金额并点击"交易"。如果你选择的结果在市场结算时正确,你的"是"份额每份支付 $1。如果不正确,支付 $0。你也可以在结算前随时卖出份额。

"Anthropic Claude在6月30日前在FrontierMath Benchmark上得分?"的当前领先者是"25%+",概率为 100%,意味着市场对该结果的概率评估为 100%。紧随其后的结果是"30%以上",概率为 100%。这些赔率随着交易者买卖份额而实时更新。请经常回来查看或将本页加入书签。

"Anthropic Claude在6月30日前在FrontierMath Benchmark上得分?"的结算规则明确定义了每个结果被宣布为获胜者所需满足的条件——包括用于确定结果的官方数据来源。你可以在本页评论上方的"规则"部分查看完整的结算标准。我们建议在交易前仔细阅读规则,因为它们规定了精确的条件、特殊情况和数据来源。