Gemini 3.1 Pro Preview's leading 46.4% score on the Humanity's Last Exam leaderboard—edging OpenAI's GPT-5.4 Pro at 44.3%—anchors trader consensus at 59% implied probability for Gemini surpassing 50% by June 30, per Polymarket odds. This frontier benchmark of 2,500 expert-level questions across mathematics, sciences, and humanities highlights Google's advances in reasoning and hallucination reduction via agentic enhancements like Deep Research, as shown in February 2026 evaluations. April's Gemini Enterprise Agent Platform rollout signals ongoing iteration, but leaderboard updates lag new releases. Traders watch Google I/O on May 19-20 for potential Gemini 4 previews or capability demos that could push scores higher amid fierce AI lab competition, though evaluation delays remain a risk.
基于Polymarket数据的AI实验性摘要。这不是交易建议,也不影响该市场的结算方式。 · 更新于$312,073 交易量
50%及以上
59%
55%及以上
28%
60%以上
6%
$312,073 交易量
50%及以上
59%
55%及以上
28%
60%以上
6%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
市场开放时间: Jan 29, 2026, 12:50 PM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Gemini 3.1 Pro Preview's leading 46.4% score on the Humanity's Last Exam leaderboard—edging OpenAI's GPT-5.4 Pro at 44.3%—anchors trader consensus at 59% implied probability for Gemini surpassing 50% by June 30, per Polymarket odds. This frontier benchmark of 2,500 expert-level questions across mathematics, sciences, and humanities highlights Google's advances in reasoning and hallucination reduction via agentic enhancements like Deep Research, as shown in February 2026 evaluations. April's Gemini Enterprise Agent Platform rollout signals ongoing iteration, but leaderboard updates lag new releases. Traders watch Google I/O on May 19-20 for potential Gemini 4 previews or capability demos that could push scores higher amid fierce AI lab competition, though evaluation delays remain a risk.
基于Polymarket数据的AI实验性摘要。这不是交易建议,也不影响该市场的结算方式。 · 更新于
警惕外部链接哦。
警惕外部链接哦。
常见问题