What is an AI agent orchestrator? Patterns and costs explained | Tom & Co

An AI agent orchestrator is the control layer that coordinates several AI agents so they work as one system. It decides which agent runs, in what order, what context each one gets, and how their outputs combine into a single result. Think of it as the conductor: the individual agents play, the orchestrator sets the sequence, manages handoffs, and keeps the whole thing coherent. Anthropic's orchestrator-worker research system scored 90.2% higher than a single agent on its internal eval.

What does an AI agent orchestrator actually do?

A single AI agent takes a goal, picks tools, and acts in a loop until it is done. That works until the job gets too big for one context window, needs several specialisms, or benefits from running tasks in parallel. At that point you need something coordinating multiple agents. That something is the orchestrator.

The orchestrator sits above the agents and owns four jobs. It plans the work, delegates each part to the right agent, passes the context that agent needs, and aggregates the results. Microsoft's Azure AI agent orchestration patterns guide (updated 12 February 2026) frames it as managing work distribution, context sharing, and result aggregation across specialised agents.

Here is the distinction that trips people up. An agent is a worker that reasons and acts. An orchestrator is the coordination logic that manages several workers. In Anthropic's multi-agent research system, a lead agent analyses the query, develops a strategy, and spawns subagents to explore different aspects at once. The lead agent is the orchestrator.

Crucially, an orchestrator is not always a separate piece of software. It can be a dedicated planner agent, a framework's routing logic, or a fixed pipeline you code by hand. What makes it an orchestrator is the role it plays, not the technology. It is whatever holds the plan and decides what happens next.

How is an orchestrator different from a single AI agent?

The cleanest way to see the difference is to compare the two side by side on the dimensions that matter when you are deciding which to build. A single agent is simpler and cheaper. An orchestrated multi-agent system is more capable on large, parallel, multi-skill tasks, and it costs a great deal more to run.

Dimension	Single AI agent	AI agent orchestrator (multi-agent)
Structure	One agent, one context window, one tool loop	A coordinating layer over several specialised agents
Best for	Well-scoped tasks a single reasoning loop can finish	Heavy parallelisation, work exceeding one context window, many complex tools
Token cost	~4x a normal chat (Anthropic figure)	~15x a normal chat (Anthropic figure)
Reliability risk	Context overload, tool overload on one agent	Coordination overhead, handoff failures, harder to debug
Latency	Lower, sequential by nature	Can be lower on parallel work, higher when agents depend on each other
When to avoid	Task needs several specialisms or parallel search	All agents need the same shared context, or heavy inter-agent dependencies

Anthropic is direct about the trade-off. Its multi-agent system used about 15x the tokens of a chat, and roughly 4x for a single agent. Token usage alone explained 80% of the performance variance on its BrowseComp evaluation. More tokens, spread across agents with their own context windows, buys more reasoning capacity, at a price.

What are the main AI agent orchestration patterns?

Orchestration is not one shape. The Azure architecture guide names five patterns, each suited to a different kind of coordination. Picking the right one is most of the design work. Use the simplest pattern that reliably does the job, because every extra layer adds latency, cost, and new ways to fail.

Sequential (pipeline)

Agents run in a fixed line, each working on the previous one's output. Also called prompt chaining or linear delegation. Good for step-by-step jobs with clear dependencies, like a contract that goes template, then clauses, then compliance check, then risk review. The next agent is chosen deterministically, not by the agents themselves.

Concurrent (parallel)

Several agents work the same task at once from different angles, and their results are aggregated. Also called fan-out/fan-in or scatter-gather. This is the pattern behind Anthropic's research system, where subagents search different aspects in parallel. It cuts run time and widens coverage, at the cost of more tokens.

Group chat

Agents collaborate in one shared thread, with a chat manager deciding who speaks next. Also called roundtable or multi-agent debate. It suits maker-checker loops and human-in-the-loop review, because everything lands in one auditable conversation. Azure's guidance suggests limiting it to three or fewer agents to keep control.

Handoff (routing)

One agent assesses a task and either handles it or transfers it to a better-suited agent. Also called triage or dispatch. It is the standard pattern for customer support: a triage agent routes billing questions to a billing agent and technical questions to a technical agent. Only one agent works the task at a time.

Magentic (dynamic planning)

For open-ended problems with no set plan. A manager agent builds a task ledger, invokes tool-using agents, checks progress against the goal, and refines the plan as it goes. Azure's example is a site-reliability team automating incident response, where the remediation steps are not known upfront and have to be worked out live.

What does the data show on orchestration cost versus payoff?

The payoff is real and so is the bill. On Anthropic's internal research eval, a system using Claude Opus 4 as the orchestrator with Claude Sonnet 4 subagents beat a single Claude Opus 4 agent by 90.2%. The task was finding every board member across S&P 500 IT companies, exactly the kind of wide, parallel search a single agent struggles with.

Tom & Co analysis of Anthropic's published token figures puts the orchestration premium at roughly 3.75x a single agent (about 15x a chat for the multi-agent system, divided by about 4x a chat for one agent). So an orchestrated system that would cost a single agent £100 in model spend runs closer to £375 for the same underlying request.

That premium is the number to hold in your head. Orchestration is worth paying for when the task genuinely needs parallel work, more context than one window holds, or several tool-heavy specialisms. Anthropic is explicit that it is a poor fit when every agent needs the same shared context or when agents depend heavily on each other.

The wider market is moving fast regardless. Gartner predicts that 40% of enterprise applications will feature task-specific AI agents by the end of 2026, up from under 5% in 2025, and that by 2027 a third of agentic AI deployments will combine agents with different skills. Combining agents is exactly what an orchestrator is for.

How does orchestration fit a UK business's AI stack in 2026?

Most UK firms are nowhere near needing this yet, and that is fine. The DSIT AI Adoption Research (fieldwork February to May 2025, 3,500 businesses) found only 16% of UK businesses use any AI technology, and agentic AI was the least adopted at 7%. Orchestration sits a layer above even that.

There is sector texture, though. The same DSIT research found agentic AI adoption reached 12% in agriculture, mining, manufacturing and energy, and 10% in information and communication, against 5% across businesses overall. Firms running physical operations and data-heavy workflows are further ahead, and they are the ones most likely to hit the ceiling of a single agent first.

McKinsey's State of AI 2025 (published November 2025) found 23% of organisations are scaling an agentic system in at least one function, yet no more than 10% of respondents report scaling agents in any given function. The gap between experimenting and scaling is where orchestration decisions actually bite, because a single agent can carry a pilot but rarely carries production breadth.

For a UK SME the honest sequence is: prove value with one well-scoped agent first, measure it, and only reach for orchestration when a real task defeats a single agent. The 3.75x cost premium means an orchestrator you do not need is an expensive way to add fragility. Build up to it when the work demands it, not before.

What should a UK leader do about agent orchestration this quarter?

You do not need to commission a multi-agent platform to make progress. You need to understand where a single agent stops being enough, so you spend on orchestration only when it earns its keep.

1. Find your single-agent ceiling. List the AI tasks you already run or plan to. Flag any that need parallel work, more context than one prompt holds, or three-plus specialisms. Those are your orchestration candidates. Everything else stays a single agent, cheaper and easier to trust.

2. Match the task to a pattern. Step-by-step work fits sequential. Wide research fits concurrent. Support routing fits handoff. Open-ended problem-solving fits magentic. Review-heavy work fits group chat. Naming the pattern before you build stops you over-engineering.

3. Budget for the premium. Assume an orchestrated version of a task costs roughly 3.75x a single agent in model spend. If the capability gain does not clearly beat that, the single agent wins. Put the number in the business case, not a footnote.

4. Build in observability from day one. Orchestrated systems fail in the seams: bad handoffs, lost context, agents looping. Decide how you will log each agent's inputs and outputs before you connect them, or debugging a live multi-agent system becomes guesswork.

The firms that get value from orchestration are not the ones that adopt it earliest. They are the ones that keep single agents single until a real task forces the upgrade, then pick the pattern that fits. Getting that judgement right is worth more than any framework choice.