CapitalBench Score
Max possible = best eligible asset in each included round. Every ranked model has the same included rounds. Calculation.
Benchmark score Higher benchmark score is better.
CapitalBench
We give frontier AI models the same market brief, freeze their portfolios, and score them against real market returns.
Study how AI models behave in capital-allocation rounds, and which ones actually perform.
Read the CapitalBench ManifestoCurrent benchmarks rank models only inside equal-run comparison sets. Switch monthly/weekly.
Max possible = best eligible asset in each included round. Every ranked model has the same included rounds. Calculation.
Benchmark score Higher benchmark score is better.
Average portfolio return across the same finished rounds.
Max possible = best eligible asset in each included round. Every ranked model has the same included rounds. Calculation.
Benchmark score Higher benchmark score is better.
Average portfolio return across the same finished rounds.
Every model reads the same market report.
Every model chooses from the same 70 assets.
Each model's saved portfolio is frozen before results are known.
The frozen portfolio sits untouched for 7 days or 1 month.
Real ending prices decide which model did best.
Live rounds only. This is the current market condition as expressed by frozen model portfolios, before final scores are known.
A compact readout from the insight engine, focused on current positioning, live marks, model agreement, and latest official results.
Across the newest live weekly and monthly portfolios, Semiconductors (SMH) is the largest aggregate allocation at +28.50%.
Aggregate allocation averages the newest live model portfolios before final scores are known.
The newest live portfolios have a deterministic risk-taking score of 77.8 out of 100.
Risk-taking score is allocation-based, not performance-based: higher means more weight in growth, momentum, cyclical, and higher-risk assets.
The newest weekly portfolios allocate +55.00% to growth and technology, while the newest monthly portfolios allocate +43.00%.
Horizon agreement compares the newest weekly and monthly live portfolios to see whether short- and longer-window model stances line up.
CapitalBench tracks whether each model behaves like a risk-seeker, concentrator, defensive allocator, consensus follower, or distinctive outlier across official frozen portfolios.
Latest monthly and weekly portfolios, equal-weighted by track. This measures current model positioning, not market returns or a trading recommendation.
18.9% points to Semiconductors (SMH), while US Equity accounts for 60.8% of open portfolios.
Live rounds marked to the latest available close. These are not final scores.
Interim returns use live rounds only. Completed rounds move to official scored results.
Switch between monthly and weekly results, then move backward or forward through completed rounds in that track.
Frozen model portfolios scored after the one-month window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% Biotechnology (XBI) hindsight ceiling
Audit ID: CB-2026-06-01-1M
Frozen model portfolios scored after the one-month window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% Biotechnology (XBI) hindsight ceiling
Audit ID: CB-2026-05-29-1M
Frozen model portfolios scored after the one-month window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% Biotechnology (XBI) hindsight ceiling
Audit ID: CB-2026-05-28-1M
Frozen model portfolios scored after the one-month window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% Biotechnology (XBI) hindsight ceiling
Audit ID: CB-2026-05-24-1M
Frozen model portfolios scored after the one-month window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% Taiwan Equities (EWT) hindsight ceiling
Audit ID: CB-2026-05-17-1M
Frozen model portfolios scored after the one-month window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% Healthcare Sector (XLV) hindsight ceiling
Audit ID: CB-2026-05-10-1M
Frozen model portfolios scored after the one-week window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% Biotechnology (XBI) hindsight ceiling
Audit ID: CB-2026-06-22-1W
Frozen model portfolios scored after the one-week window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% Biotechnology (XBI) hindsight ceiling
Audit ID: CB-2026-06-18-1W
Frozen model portfolios scored after the one-week window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% Biotechnology (XBI) hindsight ceiling
Audit ID: CB-2026-06-17-1W
Frozen model portfolios scored after the one-week window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% Biotechnology (XBI) hindsight ceiling
Audit ID: CB-2026-06-16-1W
Frozen model portfolios scored after the one-week window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% Biotechnology (XBI) hindsight ceiling
Audit ID: CB-2026-06-15-1W
Frozen model portfolios scored after the one-week window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% South Korea Equities (EWY) hindsight ceiling
Audit ID: CB-2026-06-13-1W
Frozen model portfolios scored after the one-week window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% South Korea Equities (EWY) hindsight ceiling
Audit ID: CB-2026-06-12-1W
Frozen model portfolios scored after the one-week window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% South Korea Equities (EWY) hindsight ceiling
Audit ID: CB-2026-06-09-1W
Frozen model portfolios scored after the one-week window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% South Korea Equities (EWY) hindsight ceiling
Audit ID: CB-2026-06-08-1W
Frozen model portfolios scored after the one-week window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% South Korea Equities (EWY) hindsight ceiling
Audit ID: CB-2026-06-05-1W
Frozen model portfolios scored after the one-week window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% Healthcare Sector (XLV) hindsight ceiling
Audit ID: CB-2026-06-03-1W
Frozen model portfolios scored after the one-week window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% Healthcare Sector (XLV) hindsight ceiling
Audit ID: CB-2026-06-02-1W
Frozen model portfolios scored after the one-week window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% Crude Oil (USO) hindsight ceiling
Audit ID: CB-2026-06-01-1W
Frozen model portfolios scored after the one-week window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% Crude Oil (USO) hindsight ceiling
Audit ID: CB-2026-05-29-1W
Frozen model portfolios scored after the one-week window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% Semiconductors (SMH) hindsight ceiling
Audit ID: CB-2026-05-28-1W
Frozen model portfolios scored after the one-week window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% Software (IGV) hindsight ceiling
Audit ID: CB-2026-05-27-1W
Frozen model portfolios scored after the one-week window. Live rounds stay out until final prices are available.
Shows each model's saved portfolio weights.
Ranked in the same order as the chart.
Not model portfolios.
Benchmark return over the same scoring window
100% South Korea Equities (EWY) hindsight ceiling
Audit ID: CB-2026-05-24-1W
Models get the same report, choose from the same assets, and wait for weekly or monthly scoring.
Allocation-weighted from every official frozen portfolio, including live and completed rounds. It does not use future returns and is separate from the current AI Risk Appetite signal.
These are the saved model portfolios for the newest monthly and weekly rounds. They are waiting for final prices.
2026-06-29 to 2026-07-29
2026-06-29 to 2026-07-06
Every model gets the same market report.
Every model allocates from the same asset list.
Model portfolios are locked before results are known.
After 7 days or 1 month, real prices decide the result.
A 1-month round and a 7-day round are different contests. They get separate scores and separate overall results.
These are the open tests you can inspect now. Models already submitted portfolios; official scores wait for final closing prices.
Longer test of AI allocation over one month.
Portfolios locked; scoring pending.
Short-term test of AI positioning over one market week.
Portfolios locked; scoring pending.
Internal IDs and full reproducibility files are inside each audit packet.
Models have already picked portfolios. Official scores publish only after the market window ends and final closing prices are available.
5 model portfolios locked and waiting for official scoring.
5 model portfolios locked and waiting for official scoring.
Round pages show the report, prompt, model portfolios, starting prices, source reports, hashes, and result status behind each public benchmark round.
CapitalBench keeps the comparison narrow: same report, same asset list, frozen portfolios, and no final result before the round ends.
The public score uses the saved portfolio, not private retries or experiments.
Each round keeps the exact asset list, report, model output, starting prices, and audit hashes.
Final results appear only after ending prices are available.