Skip to main content
icon for xAI Grok score on FrontierMath Benchmark by June 30?

xAI Grok score on FrontierMath Benchmark by June 30?

icon for xAI Grok score on FrontierMath Benchmark by June 30?

xAI Grok score on FrontierMath Benchmark by June 30?

$24,212 交易量

2026-02-28
Polymarket

$24,212 交易量

Polymarket

40%+

$3,642 交易量

94%

50%+

$423 交易量

<1%

This market will resolve to "Yes" if any xAI Grok model achieves the listed score or greater on the FrontierMath Exam by June 30, 2026, 11:59 PM ET. Otherwise, the market will resolve to "No". This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered. The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.xAI's Grok models currently sit at 12-14% accuracy on Epoch AI's FrontierMath Tiers 1-3, a set of 300 unpublished, research-level math problems designed to resist data contamination and require hours or days of expert effort per question. This places them well behind leaders like OpenAI's o-series variants and GPT-5 iterations, which have posted scores in the mid-20s to low-50s in recent independent evaluations. With only days remaining until the June 30, 2026 resolution deadline and no confirmed Grok updates or capability jumps announced in the past month, trader sentiment reflects the narrow window for any rapid improvement. Competitive dynamics in advanced reasoning benchmarks continue to favor labs with stronger demonstrated tool use and scaling on math-specific tasks, though xAI's focus on unique problem-solving strengths has occasionally yielded novel solves on FrontierMath.

This market will resolve to "Yes" if any xAI Grok model achieves the listed score or greater on the FrontierMath Exam by June 30, 2026, 11:59 PM ET. Otherwise, the market will resolve to "No".

This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.

The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
交易量
$24,212
结束日期
2026-06-30
市场开放时间
Jan 30, 2026, 12:01 AM ET
This market will resolve to "Yes" if any xAI Grok model achieves the listed score or greater on the FrontierMath Exam by June 30, 2026, 11:59 PM ET. Otherwise, the market will resolve to "No". This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered. The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
This market will resolve to "Yes" if any xAI Grok model achieves the listed score or greater on the FrontierMath Exam by June 30, 2026, 11:59 PM ET. Otherwise, the market will resolve to "No". This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered. The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.xAI's Grok models currently sit at 12-14% accuracy on Epoch AI's FrontierMath Tiers 1-3, a set of 300 unpublished, research-level math problems designed to resist data contamination and require hours or days of expert effort per question. This places them well behind leaders like OpenAI's o-series variants and GPT-5 iterations, which have posted scores in the mid-20s to low-50s in recent independent evaluations. With only days remaining until the June 30, 2026 resolution deadline and no confirmed Grok updates or capability jumps announced in the past month, trader sentiment reflects the narrow window for any rapid improvement. Competitive dynamics in advanced reasoning benchmarks continue to favor labs with stronger demonstrated tool use and scaling on math-specific tasks, though xAI's focus on unique problem-solving strengths has occasionally yielded novel solves on FrontierMath.

This market will resolve to "Yes" if any xAI Grok model achieves the listed score or greater on the FrontierMath Exam by June 30, 2026, 11:59 PM ET. Otherwise, the market will resolve to "No".

This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.

The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
交易量
$24,212
结束日期
2026-06-30
市场开放时间
Jan 30, 2026, 12:01 AM ET
This market will resolve to "Yes" if any xAI Grok model achieves the listed score or greater on the FrontierMath Exam by June 30, 2026, 11:59 PM ET. Otherwise, the market will resolve to "No". This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered. The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.

警惕外部链接哦。

常见问题

"xAI Grok score on FrontierMath Benchmark by June 30?"是 Polymarket 上一个拥有 4 个可能结果的预测市场,交易者根据自己的判断买卖份额。当前领先结果为"25%+",概率为 100%,其次是"30%+",概率为 100%。价格反映社区的实时概率。例如,价格为 100¢ 的份额意味着市场集体认为该结果的概率为 100%。这些赔率会随着交易者的反应而不断变化。正确结果的份额在市场结算时可兑换为每份 $1。

截至目前,"xAI Grok score on FrontierMath Benchmark by June 30?"已产生 $24.2K 的总交易量(自Jan 30, 2026市场上线以来)。这一活跃度反映了 Polymarket 社区的高度参与,并确保当前赔率由广泛的市场参与者共同形成。你可以直接在本页追踪实时价格变动并交易任何结果。

要在"xAI Grok score on FrontierMath Benchmark by June 30?"上交易,浏览本页上列出的 4 个可用结果。每个结果显示一个代表市场隐含概率的当前价格。要建仓,选择你认为最可能的结果,选择"是"支持或"否"反对,输入金额并点击"交易"。如果你选择的结果在市场结算时正确,你的"是"份额每份支付 $1。如果不正确,支付 $0。你也可以在结算前随时卖出份额。

"xAI Grok score on FrontierMath Benchmark by June 30?"的当前领先者是"25%+",概率为 100%,意味着市场对该结果的概率评估为 100%。紧随其后的结果是"30%+",概率为 100%。这些赔率随着交易者买卖份额而实时更新。请经常回来查看或将本页加入书签。

"xAI Grok score on FrontierMath Benchmark by June 30?"的结算规则明确定义了每个结果被宣布为获胜者所需满足的条件——包括用于确定结果的官方数据来源。你可以在本页评论上方的"规则"部分查看完整的结算标准。我们建议在交易前仔细阅读规则,因为它们规定了精确的条件、特殊情况和数据来源。