TOTAL VOLUME:
$66b
24H VOL:
$398,877,831
24H TRANSACTIONS:
647,445,881
OPEN INTEREST:
$1,477,629,845
622,934
Markets across
14,083
events
MATCHED EVENTS:
1,257
PLATFORM COVERAGE:
4
Polymarket:
49%
VS.
Kalshi:
51%
$
$20
$50
$100
$500
This event group tracks whether xAI's next model release will debut on the Chatbot Arena leaderboard and meet specific performance thresholds. Kalshi markets focus on release timing (before various 2026 dates), while Polymarket markets focus on the debut score achieved on Arena.AI's leaderboard (1440, 1460, or 1480+).
This market will resolve to "Yes" if the next model released by xAI has at least the specified score on the Arena.AI Leaderboard (arena.ai/leaderboard/text) when the table under the "Leaderboard" tab is checked on the date after the day of the release, 12:00 PM ET. Otherwise, this market will resolve to "No". Results from the "Score" column under the "Text Arena | Overall" Leaderboard tab at https://lmarena.ai/leaderboard/text with style control off will be used to resolve this market. If no qualifying score for the specified model is available on the Arena.AI Leaderboard at 12:00 PM ET following the date of the release, this market will resolve based on the first subsequent instance at which such a score becomes available on the leaderboard. If no qualifying score becomes available by the end of the seventh day following the day of the model’s release, or if no qualifying model release occurs by December 31, 2026, 11:59 PM ET, this market will resolve to "No". If multiple models are released on the same calendar date or if multiple variants of the specified model appear on the Arena.AI Leaderboard at the relevant check time (e.g., base, “Thinking,” or “Instant”), the highest-scoring variant will be used for resolution. A qualifying model must be launched and publicly accessible, including via open beta or open rolling waitlist signups. A closed beta or any form of private access will not suffice. The release must be either clearly defined and publicly announced as being accessible to the general public or otherwise made publicly accessible and explicitly labeled within the company’s official website. Labeling errors, placeholder text, or version names displayed on the website that do not correspond to a model that is actually accessible to the general public will not qualify. The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at 12:00 PM ET following the date of the release, this market will resolve based on the first subsequent instance at which such a score becomes available on the leaderboard. If it remains unavailable through the end of the seventh day after a qualifying release, it will resolve to "No".
Resolution requires public release of Grok 5 outside of closed beta testing, though availability limited to premium subscription tiers qualifies as public release. The event is structured around four sequential release windows: before April 1, 2026; before July 1, 2026; before October 1, 2026; and before January 1, 2027. Each window resolves independently based on whether a public release has occurred by that date.
Prediction market odds reflect real-time trader conviction rather than published analyst reports. Current market prices on Kalshi and Polymarket embed collective expectations about xAI's engineering timeline and model performance, but they may diverge from traditional equity research or AI benchmarking forecasts. Markets often price in tail risks and insider sentiment that analysts miss, while analysts may incorporate longer-term strategic context. Comparing market odds to public statements from xAI leadership, industry benchmarks, and competing model releases provides useful calibration for assessing whether markets are pricing optimistically or conservatively.
Kalshi and Polymarket can show different implied probabilities for the same outcome because of liquidity, fee structure, participant mix, and how each venue defines the contract. Kalshi and Polymarket serve different trader bases, regulatory jurisdictions, and liquidity pools, which can create temporary price gaps. Kalshi currently shows 11.0% for its top outcome, while Polymarket reflects 0.1%, a spread of 10.9 percentage points. Differences arise from varying order flow timing, user demographics, fee structures, and contract specifications. Arbitrage traders typically exploit these gaps, but frictions like withdrawal delays and platform-specific rules can prevent instant convergence. Monitoring both venues helps identify which market is pricing the event more efficiently.
Key catalysts include official xAI product announcements, Elon Musk statements about Grok development timelines, and competitive model releases from OpenAI, Anthropic, or Google. Arena leaderboard updates showing emerging models will influence expectations around performance thresholds. Regulatory changes affecting AI deployment, funding news for xAI, or technical breakthroughs in reasoning or multimodal capabilities could shift trader sentiment. Industry conferences, benchmark publications, and hiring announcements from xAI also signal development progress. Real-time monitoring of these events helps traders adjust positions before market prices fully reflect new information.
Follow the signals, not the noise
Get insights on market conviction, notable shifts, and what the data is quietly signaling.