What are the main types of AI agents?

The most useful taxonomy uses four axes: autonomy level (reactive vs goal-directed vs fully autonomous), loop architecture (ReAct, plan-and-execute, reflexion, hierarchical), deployment domain (coding, research, support, browser, workflow), and human involvement (copilot, supervised, autonomous). Most production agents combine traits from multiple categories — a coding agent is typically a deliberative ReAct agent with MCP tool access and human gates on irreversible actions.

What is the difference between a reactive agent and a deliberative agent?

A reactive agent maps the current observation directly to an action with no internal planning — fast but brittle on multi-step tasks. A deliberative agent maintains a goal, reasons about sub-steps, and chooses tools sequentially until the goal is met. Modern LLM agents (Claude Code, Devin, Cursor Agent) are deliberative: they plan implicitly through chain-of-thought and explicitly through tool calls in a loop.

When should I use a multi-agent system instead of a single agent?

Use multiple agents when tasks benefit from parallelism (independent subtasks), specialization (different prompts or tools per role), or scale (work exceeds one context window). If the task is sequential, narrow, and fits one context, a single well-tooled agent is simpler and cheaper. See the multi-agent orchestration guide for production patterns.

What type of AI agent is Claude Code?

Claude Code is a deliberative coding agent with a ReAct-style loop, terminal and file-system tool access, optional MCP servers, subagent spawning for parallel work, and human-in-the-loop gates on destructive actions. It sits in the "autonomous coding agent with supervised irreversible actions" category — high autonomy on read/test/edit, lower autonomy on git push and production deploy.

What is a plan-and-execute agent?

A plan-and-execute agent separates planning from execution: a planner model produces a step list upfront, then an executor model runs each step with tools. This reduces drift on long tasks compared to pure ReAct, but replanning is slower when the environment changes mid-task. LangGraph and CrewAI both support this pattern; Claude Code uses implicit replanning each loop iteration instead.

How do I choose the right agent type for my use case?

Start with three questions: (1) Is the task reversible? Irreversible actions need human gates regardless of agent type. (2) How many steps? Under ~10 steps, a single ReAct agent suffices; beyond ~20, consider plan-and-execute or multi-agent decomposition. (3) Does it need live data or external systems? If yes, you need tool access (MCP or API) — a RAG-only agent will not work.

Types of AI Agents: Complete Taxonomy and When to Use Each (2026) | explainx.ai Blog

explainx.ainewsletter3.5k

workshops ↗

Types of AI Agents: Complete Taxonomy and When to Use Each (2026) | explainx.ai Blog | explainx.ai

Every vendor in 2026 calls their product an "AI agent." That label now covers a chat sidebar with web search, a terminal coding assistant that runs for an hour unsupervised, a customer-support bot that opens tickets, and a fleet of twelve specialized subagents coordinating a research report. Those are not the same architecture — and picking the wrong type is the fastest way to waste tokens, miss deadlines, or ship something unsafe.

This guide gives you a practical taxonomy: the major types of AI agents, how they differ, real examples of each, and a decision matrix for choosing the right design.

If you need the foundational definition first, start with What Are AI Agents?. If you already know the basics and want the full four-layer stack (prompt → context → loop → harness), see Context vs Prompt vs Loop vs Harness Engineering.

TL;DR — agent types at a glance

Type (axis)	Subtypes	Best for	Example products
Autonomy	Reactive, deliberative, fully autonomous	Speed vs planning depth	Reactive: autocomplete; Deliberative: Claude Code; Autonomous: background schedulers
Loop architecture	ReAct, plan-and-execute, reflexion, hierarchical	Task length and error recovery	ReAct: most coding agents; Plan-execute: LangGraph workflows
Domain	Coding, research, support, browser, workflow, voice	Matching tools to task	Coding: Cursor; Research: Perplexity Deep Research; Browser: Computer Use
Agent count	Single, multi-agent (orchestrator/worker, pipeline, fan-out)	Parallelism and specialization	Single: most CLI agents; Multi: Claude Code subagents, CrewAI teams
Tool access	RAG-only, API/MCP, computer use, code execution	External system integration	MCP: Claude Desktop; Computer use: Anthropic Computer Use API

Product	Loop	Tool access	Typical autonomy
Claude Code	ReAct + subagents	Shell, files, MCP, git	High on edit; gated on push
Cursor Agent	ReAct	IDE, terminal, web	Supervised
OpenAI Codex CLI	ReAct	Shell, files	Configurable
Devin	ReAct + plan	Full dev environment	High

Pattern	Structure	Best for	Cost multiplier
Single agent	One loop, one context	Sequential tasks, under 20 steps	1x
Orchestrator/worker	Manager decomposes, workers execute	Parallel independent subtasks	2–5x
Pipeline	Agent A → Agent B → Agent C	Sequential specialization (research → draft → edit)	3x
Fan-out/fan-in	N workers in parallel, aggregator synthesizes	Search across many sources simultaneously	Nx
Debate / critique	Two agents challenge each other	High-stakes decisions, code review	2x

Level	Human role	Example
Copilot	Human initiates every action; AI suggests	Inline autocomplete, chat sidebar
Supervised	Agent proposes; human approves irreversible steps	Claude Code with permission prompts
Checkpointed	Agent runs autonomously until a gate	Approve-before-email, approve-before-deploy
Fully autonomous	Human reviews output after the fact	Scheduled report generation, log monitoring

Your answers	Recommended type
Reversible, under 10 steps, needs APIs, sequential, code	Single ReAct coding agent (Claude Code, Cursor)
Reversible, 20+ steps, needs APIs, sequential, code	Plan-and-execute or hierarchical coding agent
Reversible, parallel research, needs web	Multi-agent fan-out research system
Irreversible customer comms	Reactive support agent + human gate on every send
Repeatable business process	Workflow agent with LLM at decision nodes

Types of AI Agents: Complete Taxonomy and When to Use Each (2026)

TL;DR — agent types at a glance

Related posts

Multi-Agent Orchestration Patterns: A Production Guide (2026)

Eric Xing Critique of Agent Model: Agentic vs Agentive AI and the GIC Architecture

What Are AI Agents? The Complete Explainer for 2026

Axis 1: Autonomy level — reactive vs deliberative vs autonomous

Reactive agents

Deliberative (goal-based) agents

Fully autonomous (background) agents

Axis 2: Loop architecture — how the agent thinks step by step

ReAct (Reason + Act)

Plan-and-execute

Reflexion (self-critique)

Hierarchical agents

Axis 3: Domain — what the agent is built to do

Coding agents

Research agents

Customer support agents

Browser / computer-use agents

Workflow / automation agents

Voice agents

Axis 4: Single agent vs multi-agent

Axis 5: Tool and memory access

RAG-only agents

API / MCP agents

Code execution agents

Memory-augmented agents

Axis 6: Human involvement — the safety spectrum

Decision matrix — which agent type should you build?

How the types connect to the four-layer stack

Summary

TL;DR — agent types at a glance

Related posts

Multi-Agent Orchestration Patterns: A Production Guide (2026)

Eric Xing Critique of Agent Model: Agentic vs Agentive AI and the GIC Architecture

What Are AI Agents? The Complete Explainer for 2026

Axis 1: Autonomy level — reactive vs deliberative vs autonomous

Reactive agents

Deliberative (goal-based) agents

Fully autonomous (background) agents

Axis 2: Loop architecture — how the agent thinks step by step

ReAct (Reason + Act)

Plan-and-execute

Reflexion (self-critique)

Hierarchical agents

Axis 3: Domain — what the agent is built to do

Coding agents

Research agents

Customer support agents

Browser / computer-use agents

Workflow / automation agents

Voice agents

Axis 4: Single agent vs multi-agent

Axis 5: Tool and memory access

RAG-only agents

API / MCP agents

Code execution agents

Memory-augmented agents

Axis 6: Human involvement — the safety spectrum

Decision matrix — which agent type should you build?

How the types connect to the four-layer stack

Summary

Related reading on explainx.ai