TL;DR

Anthropic s Advisor Strategy lets Sonnet consult Opus on hard decisions within one API call. 2.7% SWE-bench gain, 11.9% cost cut. Full implementation guide.

Every developer using the Claude API faces the same dilemma. Opus 4.6 is measurably better at complex reasoning and ambiguous decision-making, but it costs significantly more per token than Sonnet or Haiku. Running everything through Opus is expensive. Running everything through Sonnet means occasionally getting weaker outputs on the hard problems where Opus would have excelled. Until April 9, 2026, the only solution was manual model routing — deciding upfront which tasks go to which model and hardcoding that logic into your application. Anthropic just shipped a better answer: the Advisor Strategy.

What Is the Advisor Strategy?

The Advisor Strategy is a new server-side tool — advisor_20260301 — built into the Claude Platform that lets an executor model (Sonnet 4.6 or Haiku 4.5) silently consult Opus 4.6 on the hard parts of a task, all within a single /v1/messages API call. Your user never sees the consultation happen. Your token costs stay mostly at Sonnet or Haiku rates. But the reasoning quality on genuinely difficult decisions approaches Opus-level.

Anthropic’s published benchmark results validate the claim: Sonnet 4.6 with an Opus 4.6 advisor scores 74.8% on SWE-bench Multilingual, up from 72.1% for Sonnet alone — a 2.7 percentage point gain — while costing 11.9% less per agentic task than running on pure Opus. That combination is unusual: most AI capability improvements trade cost for quality, or quality for cost. The Advisor Strategy claims to improve both simultaneously, which is worth understanding in detail.

The Architecture: Executor and Advisor

The mental model is straightforward. Your agent operates in two roles:

Executor — The model the user interacts with. It calls tools, reads results, iterates, and writes the final answer. This runs on Sonnet or Haiku, handling everything routine and consuming the bulk of the tokens.
Advisor — Opus 4.6, watching silently. When the executor encounters something genuinely hard — a tricky architectural decision, ambiguous requirements, conflicting context — it calls the advisor_20260301 tool and escalates. Opus reads the shared context and returns a plan, a correction, or a stop signal. It does not call tools. It does not write the final answer. It just advises.

The key design insight is that Opus does not need to process every step. The expensive reasoning only activates when the executor signals uncertainty. In a twenty-step coding task, perhaps four steps require Opus-level judgment: the initial architectural decision, a tricky edge case midway through, an unexpected error, and a final review. The Advisor Strategy structures computation around that reality rather than paying Opus rates for all twenty steps.

This maps closely to how expert consultants work in human organizations. A junior developer handles routine implementation and escalates to a senior architect only when the problem is genuinely hard. The senior architect does not rewrite every function — they provide targeted judgment at the right moments. The Advisor Strategy automates that escalation pattern within a single API call.

Configuration	SWE-bench Multilingual	Cost vs Pure Sonnet	Cost vs Pure Opus
Pure Sonnet 4.6	72.1%	1x (baseline)	~0.30x
Sonnet 4.6 + Opus Advisor	74.8%	~1.05x	~0.88x
Pure Opus 4.6	~76.2%	~3.30x	1x (baseline)

What Is the Advisor Strategy?

The Architecture: Executor and Advisor

Try Our Free Tools

JSON Formatter & Validator

cURL to Code Converter

More from AI Tools & Tutorials

OpenAI Codex Goal Mode Is Now GA — Multi-Hour Autonomous Coding Sessions

How to Implement the Advisor Strategy

Step 1: Add the Beta Header

Step 2: Declare the Advisor Tool

Step 3: Tune Your System Prompt

SWE-bench Benchmarks: What the Numbers Mean

When to Use the Advisor Strategy

High-Value Applications

Where It Adds Less Value

Cost Analysis: The Real Numbers

How the Advisor Strategy and Claude Managed Agents Fit Together

Developer Recommendation

Ready to ship faster?

One insight, every Monday. 7am IST. Zero fluff.

Comments · 0

Key takeaways · 4

Topics

Article stats

Regex Playground

Base64 Encoder / Decoder

UUID Generator

GitHub Copilot Token Billing Week 1: What Developers Are Actually Paying

Claude Sonnet 4.8 Evidence Found in Anthropic Source Maps — What We Know

xAI Launched Grok Build — A Terminal Coding Agent to Fight Claude Code and Codex

OpenAI Dreaming V3: ChatGPT Now Learns While You Sleep

DeepSeek V4 Pro Just Got 75% Cheaper — What It Means for Your AI Stack