← Blog
explainx / blog

OpenRouter Fusion API: Fable-Level AI at Half the Price (2026)

OpenRouter launched Fusion—a compound model that queries multiple frontier LLMs in parallel with a judge synthesizer. Near Fable 5 benchmarks at ~50% cost, timed as Claude Fable 5 goes offline.

5 min readYash Thakker
OpenRouterFusion APIFable 5AI ModelsCompound ModelsAPI

MDX restores the committed source plus an HTML comment attribution; plain text bundles the rendered markdown body with the explainx.ai attribution footer.

OpenRouter Fusion API: Fable-Level AI at Half the Price (2026)

On June 12–13, 2026, two stories collided on X: Anthropic suspended Claude Fable 5 under a US government directive—and OpenRouter launched Fusion, a compound model API that @OpenRouter bills as "Fable-level intelligence at half the price."

CEO Alex Atallah framed the thesis:

The future of AI is neurodiversity, not single-model takeovers.

Fusion is not one bigger model. It is a panel of frontier models answering in parallel, a judge synthesizing consensus and contradictions, and a final writer producing one answer—accessible as "model": "openrouter/fusion" on any OpenAI-compatible client.

This post explains how Fusion works, benchmark claims, pricing mechanics, and when to use it—especially if Fable 5 is offline in your stack.


TL;DR

QuestionAnswer
WhatMulti-model deliberation API (panel + judge + final answer)
Model aliasopenrouter/fusion
Default panelClaude Opus + GPT + Gemini Pro (Quality preset)
Performance claim~Fable 5 level on deep research; ~50% cost vs premium solo models
Benchmarks cited~69% premium panel / ~64.7% budget on tough research suites
Best forDeep research, expert critique, high-stakes analysis
Avoid forQuick chat, low-latency tasks, simple Q&A
Docsopenrouter.ai/docs/guides/features/plugins/fusion
Playgroundopenrouter.ai/labs/fusion

The Fable 5 Timing

Fable 5 and Mythos 5 were suspended June 12, 2026 after a Commerce Department directive. API calls to claude-fable-5 error; new Claude sessions fall back to Opus 4.8.

OpenRouter's launch landed the same week. @jjacky summarized the developer mood:

fable 5 down for 12 hours… fear not — @OpenRouter fusion is here. we combined a panel of models and came within 1% of fable 5's perf at half the cost. simply "model": "openrouter/fusion".

Fusion does not replicate Fable 5—it routes around single-vendor dependence by combining Opus, GPT-5.x-class, and Gemini outputs. See when Fable 5 might return for restoration timelines.


How Fusion Works

From OpenRouter's Fusion documentation:

Your request → Your model decides whether to invoke fusion
            → Panel (1–8 models) answers in parallel + web_search + web_fetch
            → Judge compares → structured JSON (consensus, contradictions, blind spots)
            → Final model writes answer from analysis

Judge output structure

The judge does not merge text blindly. It returns:

FieldMeaning
ConsensusPoints most models agree on (higher confidence)
ContradictionsDirect disagreements between panel members
Partial coverageTopics only some models addressed
Unique insightsIdeas from individual models
Blind spotsGaps none of the panel covered

Default Quality panel

RoleDefault models
Panel~anthropic/claude-opus-latest, ~openai/gpt-latest, ~google/gemini-pro-latest
JudgeFirst panel model (or configured via plugin)
Final answerWith openrouter/fusion alias, judge also writes final response

Each panel member runs with openrouter:web_search and openrouter:web_fetch enabled (up to 8 tool-calling steps by default).

Recursion protection

Inner calls carry x-openrouter-fusion-depth. Panel and judge models cannot invoke fusion recursively—deliberation stays one level deep.


Two Ways to Call Fusion

Option 1 — Model alias (simplest):

{
  "model": "openrouter/fusion",
  "messages": [
    { "role": "user", "content": "Compare ridge, lasso, and elastic-net regression." }
  ]
}

Option 2 — Server tool on your own model:

{
  "model": "~anthropic/claude-opus-latest",
  "messages": [{ "role": "user", "content": "..." }],
  "tools": [{ "type": "openrouter:fusion" }]
}

Both hit the same pipeline. Your model decides when fusion is worth the extra cost.


Full API Example (TypeScript)

const response = await fetch('https://openrouter.ai/api/v1/chat/completions', {
  method: 'POST',
  headers: {
    Authorization: `Bearer ${process.env.OPENROUTER_API_KEY}`,
    'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    model: 'openrouter/fusion',
    messages: [
      {
        role: 'user',
        content: 'What are the strongest arguments for and against carbon taxes?',
      },
    ],
    plugins: [
      {
        id: 'fusion',
        analysis_models: [
          '~anthropic/claude-opus-latest',
          '~openai/gpt-latest',
        ],
      },
    ],
  }),
});

const data = await response.json();
console.log(data.choices[0].message.content);

OpenRouter bills each panel completion + judge call separately—check Activity for which models ran.


Performance and Pricing Claims

ClaimSource
Fable-level at ~50% costOpenRouter launch posts, Alex Atallah
Within 1% of Fable 5 (Budget panel)@jjacky, community benchmarks
69% premium / 64.7% budget on tough researchGrok/X news aggregation, third-party tests
Beats solo GPT-5.5 and Opus 4.8 on 100 complex research queriesValueTheMarkets reporting

Treat benchmark numbers as directional until you reproduce on your task mix. Fusion optimizes for analytical depth, not latency.

When Fusion wins

  • Multi-step research with web grounding
  • Decisions where wrong answers are costly
  • Tasks benefiting from model diversity (legal, medical, financial analysis— with human verification)

When Fusion loses

  • Sub-second chat responses
  • Simple code completion
  • High-volume batch jobs where token multiplication hurts

Fusion vs Other Multi-Model Patterns

ApproachMechanismVendor
OpenRouter FusionPanel + judge + web toolsOpenRouter (500+ models)
Claude Fable 5Single Anthropic frontier modelAnthropic (currently suspended)
Manual LLM councilYou orchestrate prompts yourselfAny
OpenAI deep researchSingle-vendor agentic searchOpenAI

OpenRouter's pitch: drop-in via openrouter/fusion on existing OpenAI-compatible stacks—no custom orchestration code.

For routing without fusion, OpenRouter also offers Auto Router and Pareto Code Router (coding-optimized model selection).


Who Should Try Fusion Now

AudienceWhy
Devs blocked on Fable 5Ensemble may cover depth gap until restoration
Research pipelinesBuilt-in web search per panel member
Cost-conscious teamsBudget preset vs premium solo frontier
Multi-vendor strategistsReduce single-point model failure

If you need Anthropic-specific tooling (Claude Code, MCP, managed agents), Fusion is API-only—it does not replace Claude Code workflows.


Summary

OpenRouter Fusion is a compound-model API: parallel frontier panels, structured judge analysis, and a single synthesized answer via openrouter/fusion. OpenRouter claims near-Fable 5 research performance at roughly half the cost—launching as Fable 5 went offline makes it the most timely alternative for developers who depended on Anthropic's top tier.

The tradeoff is explicit: more tokens, more latency, more intelligence per dollar on hard questions. Try the Fusion lab playground before wiring production pipelines.


Related Reading

Fusion API behavior and defaults cited from OpenRouter Fusion docs as of June 13, 2026. Benchmark figures from OpenRouter launch materials and third-party reporting—verify on your workloads.

Related posts