Frontier AI labs remain locked in a tight race for benchmark leadership heading into late June, with OpenAI’s recent GPT-5.5 release, Anthropic’s Claude Opus 4.6/4.7 updates, Google’s Gemini 3.1 series, and xAI’s Grok 4.20 iterations all posting strong results across reasoning, coding, and agentic tasks. As of mid-May 2026, composite leaderboards show these four companies clustered within a narrow margin on metrics such as GPQA Diamond and SWE-Bench, making any single firm’s claim to the top spot highly sensitive to the next capability jump or fine-tune. Traders are watching for potential model drops or major benchmark shifts in the remaining six weeks, as even incremental gains in large language model performance can quickly reorder market-implied odds.
Polymarket डेटा का संदर्भ देने वाला प्रयोगात्मक AI-जनरेटेड सारांश। यह ट्रेडिंग सलाह नहीं है और इस बाज़ार के समाधान में कोई भूमिका नहीं निभाता। · अपडेट किया गया$1,563,416 वॉल्यूम

OpenAI
11%

xAI
5%

मिस्त्राल
4%

मेटा
4%

Z.ai
2%

DeepSeek
2%

Nvidia
2%

अलीबाबा
2%

Baidu
2%

मीटुआन
1%
$1,563,416 वॉल्यूम

OpenAI
11%

xAI
5%

मिस्त्राल
4%

मेटा
4%

Z.ai
2%

DeepSeek
2%

Nvidia
2%

अलीबाबा
2%

Baidu
2%

मीटुआन
1%
Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/ with the style control unchecked will be used to resolve this market.
If a listed model ties for #1 Arena score, it will suffice to resolve this market to "Yes."
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source becomes unavailable, the market will remain open until it is accessible again. If it becomes permanently unavailable, resolution will be based on another credible source.
बाज़ार खुला: Dec 22, 2025, 5:28 PM ET
Resolver
0x65070BE91...Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/ with the style control unchecked will be used to resolve this market.
If a listed model ties for #1 Arena score, it will suffice to resolve this market to "Yes."
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source becomes unavailable, the market will remain open until it is accessible again. If it becomes permanently unavailable, resolution will be based on another credible source.
Resolver
0x65070BE91...Frontier AI labs remain locked in a tight race for benchmark leadership heading into late June, with OpenAI’s recent GPT-5.5 release, Anthropic’s Claude Opus 4.6/4.7 updates, Google’s Gemini 3.1 series, and xAI’s Grok 4.20 iterations all posting strong results across reasoning, coding, and agentic tasks. As of mid-May 2026, composite leaderboards show these four companies clustered within a narrow margin on metrics such as GPQA Diamond and SWE-Bench, making any single firm’s claim to the top spot highly sensitive to the next capability jump or fine-tune. Traders are watching for potential model drops or major benchmark shifts in the remaining six weeks, as even incremental gains in large language model performance can quickly reorder market-implied odds.
Polymarket डेटा का संदर्भ देने वाला प्रयोगात्मक AI-जनरेटेड सारांश। यह ट्रेडिंग सलाह नहीं है और इस बाज़ार के समाधान में कोई भूमिका नहीं निभाता। · अपडेट किया गया
बाहरी लिंक से सावधान रहें।
बाहरी लिंक से सावधान रहें।
अक्सर पूछे जाने वाले प्रश्न