OpenAI's GPT-5.5, released April 23, 2026, currently leads the FrontierMath benchmark—a rigorous test of advanced mathematical reasoning on unpublished expert-level problems—with scores up to 52% overall and 39.6% on the hardest Tier 4, outpacing rivals like Anthropic's Opus 4.7 and Google's Gemini 3.1. However, Epoch AI halted evaluations on May 11 after GPT-5.5 flagged fatal errors in about one-third of Tiers 1-4 problems, prompting a human review that could recalibrate leaderboards. DeepMind's multi-agent "AI co-mathematician" recently hit 47.9% on Tier 4, intensifying competition. Traders eye potential GPT-5.5 updates, Instant variant tweaks from May 5, or interim GPT-6 previews by June 30 amid ongoing benchmark refinements and rapid scaling gains.
Résumé expérimental généré par IA à partir des données Polymarket. Ceci n'est pas un conseil de trading et ne joue aucun rôle dans la résolution de ce marché. · Mis à jour$34,665 Vol.
60 %+
66%
70 %+
25%
$34,665 Vol.
60 %+
66%
70 %+
25%
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Marché ouvert : Jan 29, 2026, 12:47 PM ET
Resolver
0x65070BE91...This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Resolver
0x65070BE91...OpenAI's GPT-5.5, released April 23, 2026, currently leads the FrontierMath benchmark—a rigorous test of advanced mathematical reasoning on unpublished expert-level problems—with scores up to 52% overall and 39.6% on the hardest Tier 4, outpacing rivals like Anthropic's Opus 4.7 and Google's Gemini 3.1. However, Epoch AI halted evaluations on May 11 after GPT-5.5 flagged fatal errors in about one-third of Tiers 1-4 problems, prompting a human review that could recalibrate leaderboards. DeepMind's multi-agent "AI co-mathematician" recently hit 47.9% on Tier 4, intensifying competition. Traders eye potential GPT-5.5 updates, Instant variant tweaks from May 5, or interim GPT-6 previews by June 30 amid ongoing benchmark refinements and rapid scaling gains.
Résumé expérimental généré par IA à partir des données Polymarket. Ceci n'est pas un conseil de trading et ne joue aucun rôle dans la résolution de ce marché. · Mis à jour
Méfiez-vous des liens externes.
Méfiez-vous des liens externes.
Questions fréquentes