xAI's rapid iteration on Grok models, powered by the Colossus 2 supercluster with over 550,000 GPUs, drives trader focus on whether the next release—likely Grok 4.4 or Grok 5 targeting 6-10 trillion parameters—will debut anonymously on the LMSYS Chatbot Arena leaderboard for blind Elo benchmarking against leaders like Anthropic's Claude Opus 4.6. Recent catalysts include the May 1 launch of Grok 4.3, excelling in agentic tool use and instruction-following at aggressive pricing, and Elon's April prediction of matching Opus performance by June amid multi-model training. While Grok-4.20 beta topped Arena's medical category in early April, xAI favors direct API rollouts over stealth tests, heightening uncertainty; watch for blind submissions signaling peak capability ahead of Q2 releases.
Experimental AI-generated summary referencing Polymarket data. This is not trading advice and plays no role in how this market resolves. · Updated$30,551 Vol.
1440+
62%
1460+
42%
1480+
16%
$30,551 Vol.
1440+
62%
1460+
42%
1480+
16%
Results from the "Score" column under the "Text Arena | Overall" Leaderboard tab at https://lmarena.ai/leaderboard/text with style control off will be used to resolve this market.
If no qualifying score for the specified model is available on the Arena.AI Leaderboard at 12:00 PM ET following the date of the release, this market will resolve based on the first subsequent instance at which such a score becomes available on the leaderboard. If no qualifying score becomes available by the end of the seventh day following the day of the model’s release, or if no qualifying model release occurs by December 31, 2026, 11:59 PM ET, this market will resolve to "No".
If multiple models are released on the same calendar date or if multiple variants of the specified model appear on the Arena.AI Leaderboard at the relevant check time (e.g., base, “Thinking,” or “Instant”), the highest-scoring variant will be used for resolution.
A qualifying model must be launched and publicly accessible, including via open beta or open rolling waitlist signups. A closed beta or any form of private access will not suffice. The release must be either clearly defined and publicly announced as being accessible to the general public or otherwise made publicly accessible and explicitly labeled within the company’s official website. Labeling errors, placeholder text, or version names displayed on the website that do not correspond to a model that is actually accessible to the general public will not qualify.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at 12:00 PM ET following the date of the release, this market will resolve based on the first subsequent instance at which such a score becomes available on the leaderboard. If it remains unavailable through the end of the seventh day after a qualifying release, it will resolve to "No".
Market Opened: May 1, 2026, 6:09 PM ET
Resolver
0x65070BE91...Results from the "Score" column under the "Text Arena | Overall" Leaderboard tab at https://lmarena.ai/leaderboard/text with style control off will be used to resolve this market.
If no qualifying score for the specified model is available on the Arena.AI Leaderboard at 12:00 PM ET following the date of the release, this market will resolve based on the first subsequent instance at which such a score becomes available on the leaderboard. If no qualifying score becomes available by the end of the seventh day following the day of the model’s release, or if no qualifying model release occurs by December 31, 2026, 11:59 PM ET, this market will resolve to "No".
If multiple models are released on the same calendar date or if multiple variants of the specified model appear on the Arena.AI Leaderboard at the relevant check time (e.g., base, “Thinking,” or “Instant”), the highest-scoring variant will be used for resolution.
A qualifying model must be launched and publicly accessible, including via open beta or open rolling waitlist signups. A closed beta or any form of private access will not suffice. The release must be either clearly defined and publicly announced as being accessible to the general public or otherwise made publicly accessible and explicitly labeled within the company’s official website. Labeling errors, placeholder text, or version names displayed on the website that do not correspond to a model that is actually accessible to the general public will not qualify.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at 12:00 PM ET following the date of the release, this market will resolve based on the first subsequent instance at which such a score becomes available on the leaderboard. If it remains unavailable through the end of the seventh day after a qualifying release, it will resolve to "No".
Resolver
0x65070BE91...xAI's rapid iteration on Grok models, powered by the Colossus 2 supercluster with over 550,000 GPUs, drives trader focus on whether the next release—likely Grok 4.4 or Grok 5 targeting 6-10 trillion parameters—will debut anonymously on the LMSYS Chatbot Arena leaderboard for blind Elo benchmarking against leaders like Anthropic's Claude Opus 4.6. Recent catalysts include the May 1 launch of Grok 4.3, excelling in agentic tool use and instruction-following at aggressive pricing, and Elon's April prediction of matching Opus performance by June amid multi-model training. While Grok-4.20 beta topped Arena's medical category in early April, xAI favors direct API rollouts over stealth tests, heightening uncertainty; watch for blind submissions signaling peak capability ahead of Q2 releases.
Experimental AI-generated summary referencing Polymarket data. This is not trading advice and plays no role in how this market resolves. · Updated
Beware of external links.
Beware of external links.
Frequently Asked Questions