OpenAI's latest frontier model, GPT-5.5 released in late April 2026, has propelled trader sentiment by scoring 44-57% on Humanity's Last Exam—a grueling 2,500-question benchmark spanning expert-level math, sciences, and humanities—placing it near the top alongside Google's Gemini 3.1 Pro (45-46%) but trailing Anthropic's Claude Mythos Preview (65%). This advance stems from enhanced reasoning chains and self-improvement during training, as highlighted in OpenAI's announcements, though high calibration errors reveal persistent overconfidence on novel problems. Competitive pressure intensifies with rivals saturating easier benchmarks, positioning HLE as a key AGI progress signal. Traders eye June releases like GPT-5.6 or early GPT-6 previews per OpenAI's 2026 roadmap, which could break 60% amid rapid iteration cycles.
Polymarket 데이터를 참조하는 실험적 AI 생성 요약입니다. 이것은 거래 조언이 아니며 이 마켓의 정산에 영향을 미치지 않습니다. · 업데이트$22,990 거래량
50%+
34%
$22,990 거래량
50%+
34%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
마켓 개설일: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...OpenAI's latest frontier model, GPT-5.5 released in late April 2026, has propelled trader sentiment by scoring 44-57% on Humanity's Last Exam—a grueling 2,500-question benchmark spanning expert-level math, sciences, and humanities—placing it near the top alongside Google's Gemini 3.1 Pro (45-46%) but trailing Anthropic's Claude Mythos Preview (65%). This advance stems from enhanced reasoning chains and self-improvement during training, as highlighted in OpenAI's announcements, though high calibration errors reveal persistent overconfidence on novel problems. Competitive pressure intensifies with rivals saturating easier benchmarks, positioning HLE as a key AGI progress signal. Traders eye June releases like GPT-5.6 or early GPT-6 previews per OpenAI's 2026 roadmap, which could break 60% amid rapid iteration cycles.
Polymarket 데이터를 참조하는 실험적 AI 생성 요약입니다. 이것은 거래 조언이 아니며 이 마켓의 정산에 영향을 미치지 않습니다. · 업데이트
외부 링크에 주의하세요.
외부 링크에 주의하세요.
자주 묻는 질문