Google DeepMind's May 11 announcement of the AI Co-Mathematician—a multi-agent workbench powered by Gemini 3.1 Pro—doubled the base model's 19% score to 48% on FrontierMath Tier 4, surpassing OpenAI's GPT-5.5 Pro at 39.6% via iterative literature search, code execution, and self-review loops, though using extended compute. Raw Gemini models trail the leaderboard led by OpenAI's GPT-5.4 at 47.6%, with prior Gemini 3 Pro at ~38% on Tiers 1-3. Traders eye Google I/O on May 19-20 for potential Gemini 4 previews amid fierce AI math reasoning competition, with June 30 allowing evaluation time but hinging on new model releases and Epoch AI verifications to challenge the frontier benchmark's research-level problems.
Polymarket 데이터를 참조하는 실험적 AI 생성 요약입니다. 이것은 거래 조언이 아니며 이 마켓의 정산에 영향을 미치지 않습니다. · 업데이트$136,324 거래량
40%+
86%
45%+
64%
50% 이상
62%
60% 이상
54%
$136,324 거래량
40%+
86%
45%+
64%
50% 이상
62%
60% 이상
54%
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
마켓 개설일: Feb 6, 2026, 6:03 PM ET
Resolver
0x65070BE91...This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Resolver
0x65070BE91...Google DeepMind's May 11 announcement of the AI Co-Mathematician—a multi-agent workbench powered by Gemini 3.1 Pro—doubled the base model's 19% score to 48% on FrontierMath Tier 4, surpassing OpenAI's GPT-5.5 Pro at 39.6% via iterative literature search, code execution, and self-review loops, though using extended compute. Raw Gemini models trail the leaderboard led by OpenAI's GPT-5.4 at 47.6%, with prior Gemini 3 Pro at ~38% on Tiers 1-3. Traders eye Google I/O on May 19-20 for potential Gemini 4 previews amid fierce AI math reasoning competition, with June 30 allowing evaluation time but hinging on new model releases and Epoch AI verifications to challenge the frontier benchmark's research-level problems.
Polymarket 데이터를 참조하는 실험적 AI 생성 요약입니다. 이것은 거래 조언이 아니며 이 마켓의 정산에 영향을 미치지 않습니다. · 업데이트
외부 링크에 주의하세요.
외부 링크에 주의하세요.
자주 묻는 질문