Google's Gemini 3.1 Pro Preview currently sits at 44.7% on Humanity's Last Exam (HLE), a 2,500-question benchmark testing expert-level reasoning across math, sciences, and humanities, trailing Anthropic's leading Claude variants at 53.3%. Recent gains from the 37.5% posted by Gemini 3 Pro in late 2025 reflect iterative improvements in reasoning capabilities and test-time scaling, yet Anthropic's edge stems from stronger adaptive techniques on this saturated frontier benchmark. With the June 30 deadline approaching, any unannounced Gemini preview or official update could shift scores, while competitive pressure from OpenAI's GPT-5.4 series adds uncertainty around further leaderboard movement before resolution. Traders monitor Google DeepMind announcements for signs of rapid iteration that might close the gap.
Polymarketデータを参照したAI生成の実験的な要約。これは取引アドバイスではなく、このマーケットの解決方法には一切関係ありません。 · 更新日$320,107 Vol.
50%+
2%
55%+
4%
60%+
1%
$320,107 Vol.
50%+
2%
55%+
4%
60%+
1%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
マーケット開始日: Jan 29, 2026, 12:50 PM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Google's Gemini 3.1 Pro Preview currently sits at 44.7% on Humanity's Last Exam (HLE), a 2,500-question benchmark testing expert-level reasoning across math, sciences, and humanities, trailing Anthropic's leading Claude variants at 53.3%. Recent gains from the 37.5% posted by Gemini 3 Pro in late 2025 reflect iterative improvements in reasoning capabilities and test-time scaling, yet Anthropic's edge stems from stronger adaptive techniques on this saturated frontier benchmark. With the June 30 deadline approaching, any unannounced Gemini preview or official update could shift scores, while competitive pressure from OpenAI's GPT-5.4 series adds uncertainty around further leaderboard movement before resolution. Traders monitor Google DeepMind announcements for signs of rapid iteration that might close the gap.
Polymarketデータを参照したAI生成の実験的な要約。これは取引アドバイスではなく、このマーケットの解決方法には一切関係ありません。 · 更新日
外部リンクに注意してください。
外部リンクに注意してください。
よくある質問