OpenAI's GPT-5.5, released April 23, 2026, currently leads the FrontierMath benchmark—a rigorous test of advanced mathematical reasoning on unpublished expert-level problems—with scores up to 52% overall and 39.6% on the hardest Tier 4, outpacing rivals like Anthropic's Opus 4.7 and Google's Gemini 3.1. However, Epoch AI halted evaluations on May 11 after GPT-5.5 flagged fatal errors in about one-third of Tiers 1-4 problems, prompting a human review that could recalibrate leaderboards. DeepMind's multi-agent "AI co-mathematician" recently hit 47.9% on Tier 4, intensifying competition. Traders eye potential GPT-5.5 updates, Instant variant tweaks from May 5, or interim GPT-6 previews by June 30 amid ongoing benchmark refinements and rapid scaling gains.
Экспериментальная сводка, созданная ИИ на основе данных Polymarket. Это не является торговой рекомендацией и не влияет на то, как разрешается этот рынок. · ОбновленоОценка OpenAI GPT по FrontierMath Benchmark к 30 июня?
Оценка OpenAI GPT по FrontierMath Benchmark к 30 июня?
$34,665 Объем
60%+
66%
70%+
25%
$34,665 Объем
60%+
66%
70%+
25%
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Открытие рынка: Jan 29, 2026, 12:47 PM ET
Resolver
0x65070BE91...This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Resolver
0x65070BE91...OpenAI's GPT-5.5, released April 23, 2026, currently leads the FrontierMath benchmark—a rigorous test of advanced mathematical reasoning on unpublished expert-level problems—with scores up to 52% overall and 39.6% on the hardest Tier 4, outpacing rivals like Anthropic's Opus 4.7 and Google's Gemini 3.1. However, Epoch AI halted evaluations on May 11 after GPT-5.5 flagged fatal errors in about one-third of Tiers 1-4 problems, prompting a human review that could recalibrate leaderboards. DeepMind's multi-agent "AI co-mathematician" recently hit 47.9% on Tier 4, intensifying competition. Traders eye potential GPT-5.5 updates, Instant variant tweaks from May 5, or interim GPT-6 previews by June 30 amid ongoing benchmark refinements and rapid scaling gains.
Экспериментальная сводка, созданная ИИ на основе данных Polymarket. Это не является торговой рекомендацией и не влияет на то, как разрешается этот рынок. · Обновлено
Не доверяй внешним ссылкам.
Не доверяй внешним ссылкам.
Часто задаваемые вопросы