Google's Gemini 3.1 Pro Preview currently leads the official Scale Labs Humanity’s Last Exam leaderboard—the key resolution source for this market—with a 46.44% score using high-thinking mode, edging out OpenAI's GPT-5.4 Pro at 44.32% in this no-tools, closed-ended benchmark of 2,500 PhD-level questions spanning frontier AI capabilities. This positioning reflects DeepMind's iterative advances in reasoning and chain-of-thought prompting since early 2026 releases like Gemini 3 Pro at 37.5%. Trader sentiment hinges on potential score gains before the June 30, 2026, deadline, with Google I/O on May 19-20 poised to announce Gemini updates or new models that could prompt leaderboard reevaluations amid intensifying competition from Anthropic and xAI. Delays in submissions or evaluation could cap progress.
Tóm tắt AI thử nghiệm tham chiếu dữ liệu Polymarket. Đây không phải tư vấn giao dịch và không ảnh hưởng đến cách thị trường này được giải quyết. · Cập nhật$312,076 KL.
50%+
65%
55%+
30%
60%+
6%
$312,076 KL.
50%+
65%
55%+
30%
60%+
6%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Thị trường mở: Jan 29, 2026, 12:50 PM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Google's Gemini 3.1 Pro Preview currently leads the official Scale Labs Humanity’s Last Exam leaderboard—the key resolution source for this market—with a 46.44% score using high-thinking mode, edging out OpenAI's GPT-5.4 Pro at 44.32% in this no-tools, closed-ended benchmark of 2,500 PhD-level questions spanning frontier AI capabilities. This positioning reflects DeepMind's iterative advances in reasoning and chain-of-thought prompting since early 2026 releases like Gemini 3 Pro at 37.5%. Trader sentiment hinges on potential score gains before the June 30, 2026, deadline, with Google I/O on May 19-20 poised to announce Gemini updates or new models that could prompt leaderboard reevaluations amid intensifying competition from Anthropic and xAI. Delays in submissions or evaluation could cap progress.
Tóm tắt AI thử nghiệm tham chiếu dữ liệu Polymarket. Đây không phải tư vấn giao dịch và không ảnh hưởng đến cách thị trường này được giải quyết. · Cập nhật
Cẩn thận với liên kết bên ngoài.
Cẩn thận với liên kết bên ngoài.
Câu hỏi thường gặp