OpenAI’s GPT models have shown steady gains on Humanity’s Last Exam, a 2,500-question benchmark spanning expert-level topics in science, math, and the humanities, with the latest GPT-5.4 and GPT-5.5 variants posting scores between 41% and 58% as of mid-May 2026. This marks substantial progress from the 2–8% range recorded by GPT-4o and o1 when the benchmark launched in early 2025. Iterative releases, including higher-compute “Pro” and “Thinking” variants, along with internal scaling of training and inference, have driven these lifts, though Google’s Gemini 3.1 currently leads the overall leaderboard. Traders are watching for any June model update or fine-tune that could push an OpenAI entry past the current frontier before the June 30 cutoff.
Експериментальне резюме, згенероване ШІ з посиланням на дані Polymarket. Це не торгова порада і не впливає на вирішення цього ринку. · ОновленоOpenAI GPT score on Humanity’s Last Exam by June 30?
$23,212 Обс.
50%+
33%
$23,212 Обс.
50%+
33%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Ринок відкрито: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...OpenAI’s GPT models have shown steady gains on Humanity’s Last Exam, a 2,500-question benchmark spanning expert-level topics in science, math, and the humanities, with the latest GPT-5.4 and GPT-5.5 variants posting scores between 41% and 58% as of mid-May 2026. This marks substantial progress from the 2–8% range recorded by GPT-4o and o1 when the benchmark launched in early 2025. Iterative releases, including higher-compute “Pro” and “Thinking” variants, along with internal scaling of training and inference, have driven these lifts, though Google’s Gemini 3.1 currently leads the overall leaderboard. Traders are watching for any June model update or fine-tune that could push an OpenAI entry past the current frontier before the June 30 cutoff.
Експериментальне резюме, згенероване ШІ з посиланням на дані Polymarket. Це не торгова порада і не впливає на вирішення цього ринку. · Оновлено
Обережно з зовнішніми посиланнями.
Обережно з зовнішніми посиланнями.
Часті запитання