The assumption behind multi-agent parallelism is that independent agents will catch each other’s mistakes. The assumption is wrong. Google Research put a number on the failure mode: independent parallel agents amplify errors 17x compared to centralized orchestrator topologies. A bad shared context doesn’t get corrected by adding more agents — it gets replicated to every agent simultaneously. The reliability math works in the opposite direction from what the architecture implies.

Situation

Multi-agent systems have become a standard approach for parallelizing complex LLM-backed workflows. The logic is intuitive: if one agent can complete a task in some time, ten agents working in parallel should complete ten tasks in the same time, and errors one agent makes should be caught by the others. This mirrors how teams work in practice — distribute work, verify in parallel, surface disagreements.

The parallel to human team dynamics is part of why the architecture feels sound. Engineers building distributed systems apply the same instinct: independent components with independent failure modes produce more reliable systems than single components with single failure modes.

Both intuitions are correct when the failures are independent. They break down when failures are correlated.

Human parallel teamsIndependent parallel agents
Shared contextIndependently interpreted briefingIdentical prompt and context window
Error from bad inputFiltered by independent judgmentReplicated to every agent
Disagreement mechanismDifferent backgrounds, different priorsSame model, same temperature, same weights
Correction mechanismPeer review surfaces disagreementsNo peer review — agents don’t see each other’s outputs

The Problem

A multi-agent system where each agent operates independently on shared context has a structural property that is easy to miss: the agents are not independent. They share the same prompt, the same context window contents, the same base model weights. When the shared context contains a defect — a misleading instruction, a factual error, a misconfigured tool definition — every agent processes that defect identically.

The result is not error cancellation. It is error replication.

Google Research’s work on multi-agent coordination quantified this directly. Across studied configurations, independent parallel agents amplified errors 17x compared to centralized orchestrator topologies. The mechanism is straightforward: in an independent topology, a single defect in shared context corrupts every agent simultaneously, and there is no correction mechanism because no agent has visibility into what the others are producing.

Architecture typeError propagationCorrection mechanism
Independent parallel agentsDefect replicates to all N agents simultaneouslyNone — agents operate without visibility into each other
Centralized orchestratorDefect contained to orchestrator before task dispatchOrchestrator can catch failures before propagating downstream
Sequential chainError propagates forward through the chainEach step can validate prior output before proceeding

The core question this forces: if you are adding agents to improve reliability, what specifically is the mechanism by which the additional agents correct errors rather than replicate them?

Centralized Orchestrator as an Error Containment Boundary

flowchart TD
    subgraph independent["Independent Topology"]
        I1[shared context] --> A1[agent 1]
        I1 --> A2[agent 2]
        I1 --> A3[agent N]
        A1 --> R1[result — defect replicated]
        A2 --> R1
        A3 --> R1
    end

    subgraph centralized["Centralized Orchestrator Topology"]
        C1[shared context] --> O[orchestrator — validates and routes]
        O --> B1[agent 1 — bounded task]
        O --> B2[agent 2 — bounded task]
        B1 --> O
        B2 --> O
        O --> R2[result — defect contained]
    end

The difference between the two topologies is not parallelism — both can dispatch tasks in parallel. The difference is where context flows and where errors can be caught.

In an independent topology, each agent receives the full shared context directly and returns results that are aggregated without an intermediate validation step. A defect in the context reaches all agents before anyone can catch it.

In a centralized orchestrator topology, the orchestrator receives the shared context, validates it, and dispatches bounded tasks to agents. Agents operate on task-scoped subsets of the context, not the full shared state. Results return to the orchestrator before aggregation. A defect in the shared context hits the orchestrator first — a single failure point rather than N simultaneous failures.

  1. Route all context through the orchestrator before task dispatch. Agents should receive task-scoped context prepared by the orchestrator, not raw shared state.
    Confirm: no agent has direct access to the full shared context; all context is mediated.

  2. Require results to return to the orchestrator before aggregation. Results should flow back through the orchestrator, not directly to a shared output store.
    Confirm: the orchestrator can reject or flag anomalous results before they influence downstream steps.

  3. Treat orchestrator failures as high-priority signals, not noise. In a centralized topology, the orchestrator is the error containment boundary — its failures surface defects that would otherwise be silently replicated across all agents.
    Confirm: orchestrator errors trigger investigation, not just retry.

In Practice

Google Research’s findings on multi-agent error amplification document this as a structural property of independent topologies, not a tuning problem. The 17x amplification factor is not something that can be reduced by adjusting temperature, improving prompts, or using a better base model — it follows directly from the architecture. If agents share context and operate without mutual visibility, a shared context defect will reach every agent.

The centralized orchestrator pattern outperforms independent topologies specifically because it localizes the error surface. An error in shared context is a single orchestrator failure before it becomes N simultaneous agent failures. This is the same principle as a firewall or a circuit breaker: the value is not in preventing errors from entering, but in containing them before they propagate to the full system.

The practical implication is that choosing between independent and centralized topologies is an architectural decision with reliability consequences, not just a throughput optimization. Independent topologies can be faster to implement and easier to scale horizontally — but they trade error containment for that simplicity.

Where It Breaks

Failure modeTriggerFix
Orchestrator becomes bottleneckHigh agent count with low orchestrator throughputShard orchestrators by domain — but maintain containment within each shard
Orchestrator failure propagates everywhereSingle orchestrator with no redundancyRun redundant orchestrators with state synchronization
Orchestrator passes defect to all agentsDefect in orchestrator logic, not in shared contextTest orchestrator validation logic independently from agent execution
Context mediation adds latencyOrchestrator adds a round-trip to every task dispatchBatch task dispatch; pre-validate context before dispatch starts

The centralized orchestrator pattern addresses correlated failure from shared context. It does not address orchestrator-level defects — those require their own validation layer. The architecture shifts the error surface; it does not eliminate it.

What to Do Next

  • Problem: Independent parallel agents appear to add reliability through redundancy, but a defect in shared context reaches every agent simultaneously with no correction mechanism — amplifying errors instead of canceling them.
  • Solution: Use a centralized orchestrator topology where all context flows through the orchestrator before task dispatch and all results return through it before aggregation, containing defects to a single boundary rather than replicating them fleet-wide.
  • Proof: Google Research’s multi-agent coordination work documents the 17x amplification factor as a structural property of independent topologies. The mechanism — shared context, no mutual visibility — is reproducible across different tasks and models.
  • Action: For any multi-agent system currently in design or production, draw the context flow: does shared context reach agents directly, or does it pass through an orchestrator that can validate it first? If agents receive raw shared context directly, that topology will amplify errors under any shared context defect.

The instinct to add more agents to improve reliability is sound when failures are independent. When failures are correlated — when they trace back to a single shared context, a single bad prompt, a single misconfigured tool — more agents make things worse. Reliability in multi-agent systems comes from the structure of context flow and result aggregation, not from agent count.