Agent Autonomy Ladder: Manual, Confirm, Auto-Approve, Supervised

Autonomy is not a switch; it is a ladder with different rungs for read, draft, approve, execute, and recover. Teams adopting coding agents quickly discover that full manual control wastes the agent’s value, while full auto-approval is irresponsible for production infrastructure. Database and cloud work makes the boundary sharper because the same agent that reads a schema can also generate a migration or edit IAM.

Situation

Teams adopting coding agents quickly discover that full manual control wastes the agent’s value, while full auto-approval is irresponsible for production infrastructure. Database and cloud work makes the boundary sharper because the same agent that reads a schema can also generate a migration or edit IAM.

The pattern matters for database, cloud, and platform teams because agents do not operate in a vacuum. They inherit repository rules, tool permissions, deployment workflows, incident history, and the quality of the evidence available to them.

Operating layer	Default approach	Better alternative
Context	Rely on a long prompt or chat history	Give the agent task-specific evidence and rules
Tooling	Expose broad tools and inspect later	Expose narrow tools with clear approval boundaries
Verification	Read the final answer	Check the artifact, trace, and final state

The Problem

Without an autonomy model, every task becomes an argument. One engineer lets the agent apply changes freely. Another blocks every shell command. The organization ends up with inconsistent risk handling instead of a repeatable operating model.

The practical question is not whether an agent can produce a convincing response. The question is whether the engineering system around that response makes the work observable, reversible, and reviewable.

Failure point	What breaks	Why it matters
Weak boundary	Agent authority is broader than the task	A diagnostic run can become an unsafe change
Missing evidence	The agent cannot cite the state it used	Review becomes opinion instead of verification
No lifecycle	The workflow ends at a message	Ownership, audit, cleanup, and rollback disappear

Autonomy Ladder

Use four modes: manual for exploration, confirm for draft changes, auto-approve for reversible low-risk reads, and supervised execution for bounded production actions with audit trails.

flowchart TD
    A[task request — bounded intent] --> B[autonomy ladder — controls]
    B --> C[tool execution — evidence collected]
    C --> D[verification — final state checked]
    D --> E[human handoff — audit retained]

Define the operating boundary.
Write down the task class, allowed tools, environment, data class, and approval mode before the agent runs.
Shape the evidence.
Return compact observations instead of raw dumps. The agent should see enough to reason, but not so much that context is wasted.
Require proof of completion.
Completion should be an artifact or state check: a passing test, a reviewed plan, a valid rollback, a trace, or a linked ticket.

Map each tool and workflow to a rung. Read-only replica queries may auto-approve. Migration PR creation may require confirm. Production DDL should require supervised execution with explicit rollback.

In Practice

Context: Anthropic’s autonomy reporting frames agent behavior in terms of how much work proceeds without human intervention and where users interrupt or approve. That framing is useful for infrastructure because approvals should depend on blast radius. Source: Anthropic, Measuring AI agent autonomy in practice.

Action: Map each tool and workflow to a rung. Read-only replica queries may auto-approve. Migration PR creation may require confirm. Production DDL should require supervised execution with explicit rollback.

Result: When the rung is attached to the tool, reviewers can inspect whether the agent had the correct authority before judging the result.

Learning: Use four modes: manual for exploration, confirm for draft changes, auto-approve for reversible low-risk reads, and supervised execution for bounded production actions with audit trails. This is a documented pattern or a direct consequence of how the named systems behave, not a fabricated production story.

Where It Breaks

Failure mode	Trigger	Fix
One-size autonomy	All commands require approval or none do	Assign autonomy by tool and environment
Approval fatigue	Humans approve low-risk read commands repeatedly	Auto-approve bounded read-only actions
Silent write path	Draft task receives write credentials	Separate read, draft, and execute modes
No interrupt path	Long-running task cannot be stopped safely	Require cancellation and state checkpointing

What to Do Next

Problem: Without an autonomy model, every task becomes an argument. One engineer lets the agent apply changes freely. Another blocks every shell command. The organization ends up with inconsistent risk handling instead of a repeatable operating model.
Solution: Use four modes: manual for exploration, confirm for draft changes, auto-approve for reversible low-risk reads, and supervised execution for bounded production actions with audit trails.
Proof: When the rung is attached to the tool, reviewers can inspect whether the agent had the correct authority before judging the result.
Action: Inventory agent tools and label each one manual, confirm, auto-approve, or supervised for dev, staging, and production.

The teams that get value from agents will not be the teams with the longest prompts. They will be the teams that turn agent work into a controlled engineering workflow.

Situation

The Problem

Autonomy Ladder

In Practice

Where It Breaks

What to Do Next

Rajiv

Related Posts

Agent Productivity Depends on Context Throughput

AI Cost Incident Runbook: What to Do When Monthly Token Spend Suddenly Doubles

Agent-to-Agent Review Loops