TOTAL VOLUME:
$66b
24H VOL:
$398,877,831
24H TRANSACTIONS:
647,445,881
OPEN INTEREST:
$1,477,629,845
622,934
Markets across
14,083
events
MATCHED EVENTS:
1,257
PLATFORM COVERAGE:
4
Polymarket:
49%
VS.
Kalshi:
51%
Time left: 12d:18h:51m
$
$20
$50
$100
$500
This market tracks which company owns the AI model ranked first on the Chatbot Arena LLM Leaderboard's Math category. Across Polymarket and Kalshi, the consensus probability for the leading outcome stands at 55.0%, with Google's model at 41.5%. Both platforms reference the Chatbot Arena LLM Leaderboard at https://lmarena.ai/ as the resolution source, checking the Math category rankings with style control off. Watch the leaderboard standings as they stand on June 30, 2026, at 12:00 PM ET, when this market will resolve based on which company's model occupies the #1 rank at that specific checkpoint.
This market will resolve according to the company that owns the model that has the highest arena rank based on the Chatbot Arena LLM Leaderboard (https://lmarena.ai/) when the table under the "Leaderboard" tab for "Math" is checked on June 30, 2026, 12:00 PM ET. Results from the "Rank" column under the "Text Arena | Math" Leaderboard tab at https://arena.ai/leaderboard/text/math-no-style-control with style control off will be used to resolve this market. Models will be ordered primarily by their leaderboard rank at the market’s check time. If two or more models are tied on rank, they will be ordered by their Arena score, including any underlying, unrounded, granular values reflected in the data below the leaderboard. If a tie still remains, alphabetical order of company names as listed in this market group will be used as a final tiebreaker (e.g., if the two models are tied by exact arena score, “Google” would be ranked ahead of “xAI”). This market will resolve based on the company that occupies first place under this ranking. The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
The top-ranked model on the Arena AI Text Arena Leaderboard (Math) Style Control Off at 10:00AM ET on June 30, 2026 determines the resolution. When models are tied under Rank, the tiebreaker hierarchy is applied in sequence: first, the model with the highest Arena Score; second, the model with the most votes; third, the model that was released earlier. This sequential tiebreaker process ensures a single definitive outcome.
Prediction market odds often diverge from traditional analyst reports because traders incorporate real-time information, leaked benchmarks, and community intelligence that formal research may lag. This market prices in live sentiment from AI researchers, investors, and technologists who trade on proprietary insights or early product signals. Unlike analyst forecasts published quarterly or annually, market odds update continuously as new evidence emerges. Prediction markets also aggregate dispersed knowledge across thousands of participants, sometimes surfacing consensus before mainstream media coverage. However, analyst reports may include deeper technical context or longer-term strategic views that spot markets haven't yet priced in, making both sources complementary for a complete picture.
Kalshi and Polymarket may show different odds on the same outcome due to variations in user base composition, liquidity depth, and settlement rule interpretation. Kalshi and Polymarket can show different implied probabilities for the same outcome because of liquidity, fee structure, participant mix, and how each venue defines the contract. Each platform attracts traders with distinct risk appetites and information sources, so one may react faster to news than the other. Liquidity imbalances can also widen spreads; if one platform has deeper order books, its price may be more resilient to large trades. Additionally, subtle differences in how each platform defines or verifies the winning company can create pricing gaps. Savvy traders monitor both venues to identify arbitrage opportunities, though transaction costs and withdrawal delays typically prevent perfect convergence.
This market resolves around Jun 30, 2026, at which point the outcome is confirmed based on credible public reporting and benchmark results available at that time. The winning company will be identified through peer-reviewed publications, official model releases, standardized math competition leaderboards, or widely recognized AI evaluation frameworks. Resolution hinges on verifiable evidence of which organization's model demonstrates superior mathematical reasoning capabilities as of the end date. Once the outcome is clear from public sources, the market settles and traders receive payouts proportional to their positions. The exact timing of settlement may vary slightly depending on how quickly definitive information becomes available.
Major catalysts include new model releases from leading AI labs, published benchmark results on standardized math tests, and announcements of research breakthroughs in mathematical reasoning. Academic papers demonstrating novel approaches to symbolic reasoning or theorem proving could shift trader conviction significantly. Hiring announcements, funding rounds, or partnerships that signal capability investments may also move odds. Leaked internal evaluations or early access reports from researchers can trigger rapid repricing. Competitive product launches or public demonstrations at conferences will likely drive volatility. Additionally, regulatory developments or safety concerns affecting specific companies could indirectly influence market sentiment. Traders monitor AI research communities, preprint servers, and industry news closely for any signal suggesting which company is pulling ahead in this critical capability race.
Follow the signals, not the noise
Get insights on market conviction, notable shifts, and what the data is quietly signaling.