Question 1

What is LM Council?

Accepted Answer

LM Council is a multi-model workspace that lets you orchestrate simultaneous responses from GPT-5, Claude Opus 4, Gemini 2.5, Grok 4, and other frontier AI systems inside a single conversation.

Question 2

What are Projects and how do they work?

Accepted Answer

Projects are topic workspaces that group chats and store reusable context. Project notes and up to 25 project files are included in project replies, and saved model slots can act as the project default models. Adding a model reply or chat summary saves it as a project file for future context.

Question 3

How do I customize which models participate?

Accepted Answer

You can add, replace, or remove models from each slot, letting you mix GPT-5, Claude Opus 4, Gemini 2.5 Pro, Grok 4, O3, and other providers in one coordinated exchange.

Question 4

Can I direct a prompt to a specific model?

Accepted Answer

Use the at-mention picker to target individual models. If no model is selected, every member of the council responds and you can compare their reasoning side-by-side.

Question 5

How do shared conversations work?

Accepted Answer

Any transcript can be saved as a read-only link, making it easy to showcase comparisons or hand off results to teammates without exposing private data. When someone creates and verifies an account directly from a page you shared, you also progress toward Rewards like custom emoji defaults and the Mystery Model category.

Question 6

What rewards can I unlock by sharing?

Accepted Answer

Sharing chats or code previews that lead to a verified signup unlocks new perks: one verification lets you set the default emoji on the composer button, and five verifications unlock the gold Mystery Model category that loads a random model you can reveal on demand.

Question 7

What plans are available?

Accepted Answer

A free tier provides community models and baseline credits, while Plus ($8/month, 350K credits), Pro ($14/month, 700K credits), and MAX ($49/month, 2.8M credits) unlock premium systems including GPT-5, Claude Opus 4, Gemini 2.5, automation features, and higher monthly allowances.

Question 8

What is the Leader feature?

Accepted Answer

Leader mode assigns one model to synthesize the council, producing a consolidated summary or recommendation once every other model has finished responding.

Question 9

What are Credit Blocks?

Accepted Answer

Credit Blocks are one-time credit purchases available to Plus, Pro, and MAX subscribers. Four packs are offered: Spark (500K credits, $18), Charge (1M credits, $35), Surge (5M credits, $170), and Vault (10M credits, $330). Purchased credits are added to a reserve balance that is consumed only after your monthly allowance is exhausted. Reserve credits never expire while subscribed; if you cancel, a 30-day grace period applies before credits are frozen until re-subscription or a new Credit Block purchase.

Question 10

Why do Past Chats and Audio sometimes show only recent items first?

Accepted Answer

Past Chats now loads recent history first for faster app startup. You can press Load older chats (or Load older chats for audio) to continue scanning older items, and chat search can continue finding matches as older pages load.

	Models (no tools)	Score
1	Gemini 3.1 Pro Preview (high thinking)	46.4% ±2.0
2	GPT-5.4 Pro	44.3% ±2.0
3	Muse Spark	40.6% ±1.9
4	Gemini 3 Pro Preview	37.5% ±1.9
5	GPT-5.4 (xhigh)	36.2% ±1.9

	Model	Score
1	Claude Fable 5	81.9%
2	Gemini 3.1 Pro Preview	79.6%
3	GPT-5.5 Pro	76.9%
4	Gemini 3.5 Flash	76.7%
5	Gemini 3 Pro Preview	76.4%

	Model	Minutes
1	Claude Mythos Preview	1044.8
2	Claude Opus 4.6 (unknown thinking)	718.8
3	Gemini 3.1 Pro Preview	384.1
4	GPT-5.2 (high)	352.2
5	GPT-5.3 Codex	349.5

	Model	Score
1	Claude Opus 4.7 (max)	83.5% ±1.7
2	GPT-5.5 (xhigh)	80.6% ±1.8
3	Gemini 3.5 Flash (high)	79.3% ±1.8
4	Claude Opus 4.6 (no thinking)	78.7% ±1.9
5	GPT-5.4 (high)	76.9% ±1.9

	Model	Score
1	GPT-5.4 Pro (xhigh)	94.6% ±1.6
2	Gemini 3.1 Pro Preview	94.1% ±1.7
3	GPT-5.5 (xhigh)	94.0% ±1.5
4	GPT-5.5 Pro (xhigh)	93.9% ±1.6
5	GPT-5.4 (xhigh)	93.3% ±1.8

AI Model Benchmarks Jun 2026

Compare Models

Humanity's Last Exam

SimpleBench

METR Time Horizons

SWE-bench Verified

GPQA Diamond

GDPval

Text Arena (Coding)

GSO (General Speedup Optimization)

Fiction.liveBench

BALROG

OTIS Mock AIME 2024-25

MATH Level 5

FrontierMath Tiers 1-3 (v2)

FrontierMath Tier 4 (v2)

WeirdML v2

Terminal-Bench 2.0

VPCT (Visual Physics Comprehension Test)

GeoBench

	Model	Score
1	GPT-5.2	49.7%
2	Claude Opus 4.5	45.5%
3	Claude Opus 4.1	43.6%
4	Claude Sonnet 4.5	42.5%
5	Gemini 3 Pro Preview	40.3%

	Model	Score
1	Claude Opus 4.7	1566.9
2	Claude Opus 4.6	1556.3
3	Claude Opus 4.8	1552.2
4	Qwen3.7-Max	1540.8
5	GLM-5.1	1534.0

	Model	Score
1	Claude Opus 4.7	44.1%
2	Claude Opus 4.6 (high)	41.2%
3	GPT-5.5 (xhigh)	40.2%
4	GPT-5.4 (xhigh)	31.4%
5	GPT-5.2 (high)	27.4%

	Model	Score
1	o3 (medium)	100.0%
2	GPT-5 (medium)	96.9%
3	Grok 4	96.9%
4	Gemini 2.5 Pro Exp (Mar '25)	90.6%
5	o3-pro	88.9%

	Model	Score
1	Gemini 3 Pro Preview	58.1% ±2.1
2	Gemini 3.1 Pro Preview	57.0% ±2.0
3	Gemini 3 Flash	48.1% ±2.4
4	Grok 4	43.6% ±2.2
5	Claude Opus 4.5	43.5% ±2.3

	Model	Score
1	GPT-5.5 Pro (xhigh)	100.0% ±0.0
2	GPT-5.5 (xhigh)	100.0% ±0.0
3	Claude Fable 5 (max)	99.7% ±0.3
4	Claude Opus 4.8	98.3% ±1.4
5	Claude Opus 4.7 (xhigh)	97.8% ±2.2

	Model	Score
1	GPT-5 (high)	98.1% ±0.3
2	GPT-5 (medium)	97.9% ±0.3
3	GPT-5 mini (high)	97.8% ±0.3
4	o4-mini (high)	97.8% ±0.3
5	o3 (high)	97.8% ±0.3

	Model	Score
1	GPT-5.5 Pro (xhigh)	87.7% ±1.9
2	Claude Fable 5 (max)	87.0% ±2.0
3	GPT-5.5 (xhigh)	85.3% ±2.1
4	Claude Opus 4.8	80.0% ±2.4
5	GPT-5.4 (xhigh)	78.6% ±2.4

	Model	Score
1	Claude Fable 5 (max)	87.8% ±5.2
2	GPT-5.5 Pro (xhigh)	78.0% ±6.5
3	AI co-mathematician	75.6% ±6.7
4	GPT-5.5 (xhigh)	72.5% ±7.1
5	Claude Opus 4.8	56.1% ±7.8

	Model	Score
1	Claude Fable 5 (high)	87.9%
2	GPT-5.5 (xhigh)	84.9%
3	Claude Opus 4.8 (xhigh)	82.9%
4	GPT-5.3 Codex	79.3%
5	Claude Opus 4.6 (high)	78.0%

	Model	Score
1	Claude Opus 4.7	90.2% ±2.1
2	GPT-5.5	84.7% ±2.1
3	GPT-5.4	81.8% ±2.0
4	Gemini 3.1 Pro Preview	80.2% ±2.6
5	Claude Opus 4.6	79.8% ±1.6

	Model	Score
1	Gemini 3 Pro Preview	91.0%
2	GPT-5.2 (xhigh)	84.0%
3	Gemini 3 Flash	72.6%
4	GPT-5.2 (high)	67.0%
5	GPT-5 (high)	66.0%

	Model	Score
1	Gemini 3 Pro Preview	3893
2	Gemini 2.5 Pro Preview (May '25)	3836
3	o3 (high)	3789
4	Gemini 2.0 Flash (Feb '25)	3659
5	GPT-5 (medium)	3498