Recent advances in frontier large language models have driven OpenAI GPT variants to 41–44 percent accuracy on Humanity’s Last Exam, a 2,500-question benchmark spanning expert-level topics where early 2025 models scored below 3 percent. GPT-5.5 variants with extended thinking modes now trail only Google’s Gemini 3.1 Pro Preview on public leaderboards, reflecting rapid gains from improved reasoning chains and post-training techniques. With June 30 just weeks away, traders are watching for any new GPT-5 iteration, internal capability jump, or benchmark update that could push an OpenAI model past key thresholds before the deadline. Competitive pressure from Google and Anthropic continues to accelerate release cycles, though exact timelines remain uncertain and dependent on internal testing results.
Resumo experimental gerado por IA com dados do Polymarket. Isto não é aconselhamento de trading e não tem qualquer papel na resolução deste mercado. · AtualizadoPontuação OpenAI GPT no Último Exame da Humanidade até 30 de junho?
$23,212 Vol.
50%+
33%
$23,212 Vol.
50%+
33%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Mercado Aberto: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Recent advances in frontier large language models have driven OpenAI GPT variants to 41–44 percent accuracy on Humanity’s Last Exam, a 2,500-question benchmark spanning expert-level topics where early 2025 models scored below 3 percent. GPT-5.5 variants with extended thinking modes now trail only Google’s Gemini 3.1 Pro Preview on public leaderboards, reflecting rapid gains from improved reasoning chains and post-training techniques. With June 30 just weeks away, traders are watching for any new GPT-5 iteration, internal capability jump, or benchmark update that could push an OpenAI model past key thresholds before the deadline. Competitive pressure from Google and Anthropic continues to accelerate release cycles, though exact timelines remain uncertain and dependent on internal testing results.
Resumo experimental gerado por IA com dados do Polymarket. Isto não é aconselhamento de trading e não tem qualquer papel na resolução deste mercado. · Atualizado
Cuidado com os links externos.
Cuidado com os links externos.
Frequently Asked Questions