What is ReAct prompting?

ReAct (Reasoning + Acting) is a prompting pattern introduced by Yao et al. in 2022 where a language model interleaves reasoning steps (Thought) with actions (Action) and incorporates external feedback (Observation). This loop repeats until the model reaches a final answer. It is the foundational pattern behind most production AI agents.

How is ReAct different from chain-of-thought prompting?

Chain-of-thought prompting has the model reason in a single pass before giving an answer — no external tools, no feedback loop. ReAct extends this with real-world actions: the model can call tools, search the web, run code, or read files, then incorporate the results into its next reasoning step. CoT stays inside the model; ReAct reaches out to the world.

Do I need a framework to use ReAct?

No. ReAct is a prompting pattern, not a library. You can implement it with a plain API call loop in about 50 lines of Python. Frameworks like LangChain and LangGraph implement ReAct loops for you, but understanding the pattern lets you debug and customize them effectively.

What are the most common ReAct failure modes?

The three most common failures are observation hallucination (the model invents tool outputs instead of waiting for real ones), reasoning loops (the model cycles through the same thoughts without progress), and action explosion (the model makes dozens of tool calls where two would suffice). All three can be mitigated with good system prompts and iteration limits.

How many tools should a ReAct agent have?

Start with three to five. Each additional tool increases the chance the model will pick the wrong one or chain unnecessary calls. Add tools only when you have a specific task that genuinely requires them and your eval set confirms the agent uses them correctly.

Is ReAct the same as an agent loop?

ReAct is the most common structure for an agent loop, but not the only one. A ReAct loop specifically interleaves explicit Thought reasoning with Actions and Observations. Some agent architectures (like pure tool-calling APIs) skip the explicit reasoning step. ReAct loops tend to produce more auditable and debuggable agent behavior.

When should I NOT use ReAct?

Skip ReAct for simple single-turn tasks where the model has all the information it needs to answer directly — classification, summarization of a provided document, translation, and most formatting tasks. ReAct adds latency and cost; use it only when the task genuinely requires external information or multi-step actions.

ReAct Prompting Guide 2026: Reasoning + Acting for AI Agents | explainx.ai Blog

explainx.ainewsletter3.5k

workshops ↗

ReAct Prompting Guide 2026: Reasoning + Acting for AI Agents | explainx.ai Blog | explainx.ai

If you have used LangChain agents, Claude Code, or any production AI agent in the last two years, you have been using ReAct. You may not have known the name, but the pattern is everywhere: the model thinks out loud, takes an action, gets a result back, thinks again. That loop — Thought, Action, Observation, repeat — is ReAct.

This article explains the pattern from first principles, shows you how to write a ReAct prompt from scratch, walks through real examples, and covers the failure modes you will hit in production.

What ReAct Is

ReAct stands for Reasoning + Acting. It was introduced in the paper "ReAct: Synergizing Reasoning and Acting in Language Models" by Yao et al. in 2022. The core insight is deceptively simple: language models get better at tool use when they reason out loud before each action, not just before the final answer.

Before ReAct, tool-calling approaches gave models a list of tools and let them call them — but without any structured reasoning step in between. The model would call a search API, get results, and either answer or call another tool. This worked, but it produced brittle, opaque behavior that was hard to debug.

ReAct added an explicit reasoning step. Instead of jumping straight to a tool call, the model first writes a Thought explaining why it is making that call. This does two things: it keeps the model's reasoning grounded (harder to hallucinate when you have to explain yourself), and it makes the agent's behavior legible to you.

The paper showed that ReAct significantly outperformed chain-of-thought prompting on knowledge-intensive tasks (HotpotQA, FEVER) and decision-making tasks (ALFWorld, WebShop). That performance gap is why the pattern became the default architecture for production agents.

The ReAct Loop Structure

The core loop has three steps that repeat:

snippet

┌─────────────────────────────────────────────────────────┐
│                        REACT LOOP                        │
│                                                         │
│   ┌──────────┐    ┌──────────┐    ┌───────────────┐    │
│   │  THOUGHT │───▶│  ACTION  │───▶│  OBSERVATION  │    │
│   │          │    │          │    │               │    │
│   │ Model    │    │ Tool call│    │ Tool result   │    │
│   │ reasons  │    │ or step  │    │ comes back    │    │
│   └──────────┘    └──────────┘    └───────┬───────┘    │
│        ▲                                   │            │
│        └───────────────────────────────────┘            │
│                                                         │
│   Loop repeats until model outputs: Final Answer        │
└─────────────────────────────────────────────────────────┘

Thought: The model reasons about the current state. What does it know? What does it need? What is the best next action? This step happens inside the context window — no external call yet.

Action: The model specifies a concrete action, usually a tool call. It names the tool and provides the parameters. The tool is then actually invoked by the orchestrating code (not by the model — models do not run code directly).

	Chain-of-Thought	ReAct
Reasoning steps	Yes	Yes
External tool calls	No	Yes
External feedback	No	Yes (Observations)
Number of passes	Single	Multiple
Best for	Self-contained problems	Tasks requiring external info
Latency	Low	Higher (multiple API calls)
Cost	Lower	Higher

snippet

User: What is the stock price of NVIDIA and why did it move today?

Thought: The user wants two pieces of information: the current price and
the reason for today's movement. I should get the price first, then search
for news about NVIDIA today.

Action: get_stock_price("NVDA")
Observation: {"price": 142.50, "change": "+4.2%", "volume": "high"}

Thought: NVIDIA is up 4.2% on high volume. I need to find out why.
Let me search for recent NVIDIA news.

Action: web_search("NVIDIA stock news today June 2026")
Observation: "NVIDIA announced new AI chip partnership with [company]..."

Thought: I now have both the price and the reason for the movement.
I can give a complete answer.

Final Answer: NVIDIA is trading at $142.50, up 4.2% today on high volume.
The move is driven by an announcement of a new AI chip partnership...

snippet

You are an AI assistant that can use tools to answer questions.

You have access to the following tools:
- web_search(query: str) -> str: Search the web and return relevant results
- read_url(url: str) -> str: Read the content of a webpage
- calculator(expression: str) -> float: Evaluate a math expression

Use the following format EXACTLY:

Thought: [Your reasoning about what to do next]
Action: [tool_name(parameters)]
Observation: [The result of the action — this will be filled in for you]

Repeat Thought/Action/Observation as many times as needed.
When you have enough information to answer, write:
Final Answer: [Your complete answer to the user's question]

Rules:
- Always write a Thought before every Action
- Never make up an Observation — wait for the real result
- Stop as soon as you have enough information
- If a tool returns an error, try a different approach

snippet

Thought: I need two things: the current NVIDIA stock price and today's
news explaining the price movement. Let me start with the price.

Action: get_stock_price("NVDA")
Observation: {"ticker": "NVDA", "price": 142.50, "change_pct": 4.2,
              "change_abs": 5.75, "volume": 95000000}

Thought: NVIDIA is up 4.2% on 95M volume (above average). I need to find
out what drove this. Let me search for NVIDIA news from today.

Action: web_search("NVIDIA stock news today 2026-06-27")
Observation: "NVIDIA announces expanded partnership with Microsoft Azure
for next-gen AI training clusters. Deal estimated at $2B annually.
CEO Jensen Huang presented at Microsoft Build."

Thought: I have everything I need. The price is $142.50, up 4.2%, driven
by the Microsoft Azure partnership announcement.

Final Answer: NVIDIA (NVDA) is trading at $142.50, up 4.2% ($5.75) today
on above-average volume of 95 million shares. The move is driven by
NVIDIA's announcement of an expanded partnership with Microsoft Azure
for next-generation AI training infrastructure, reported to be worth
approximately $2 billion annually. CEO Jensen Huang presented the deal
at Microsoft Build.

snippet

Thought: I need to analyze this function for bugs. It calculates an
average by summing all numbers and dividing by count. The obvious
failure case is an empty list — dividing by zero. Let me verify.

Action: run_code("calculate_average([])")
Observation: ZeroDivisionError: division by zero

Thought: Confirmed — the function crashes on empty input. I should also
check if it handles non-numeric input.

Action: run_code("calculate_average([1, 'two', 3])")
Observation: TypeError: unsupported operand type(s) for +=: 'int' and 'str'

Thought: Two bugs: empty list causes ZeroDivisionError, and non-numeric
values cause TypeError. The fix should handle both. For the empty list,
return None or raise a descriptive error. For non-numeric values, either
filter them or raise a TypeError with a clear message.

Final Answer: The function has a division-by-zero bug when called with
an empty list. Here is the fixed version:

def calculate_average(numbers):
    if not numbers:
        raise ValueError("Cannot calculate average of empty list")
    if not all(isinstance(n, (int, float)) for n in numbers):
        raise TypeError("All elements must be numeric")
    return sum(numbers) / len(numbers)

snippet

Thought: I need to do this in order: (1) get tomorrow's calendar,
(2) find a free 30-minute slot in the afternoon, (3) draft the invite.

Action: get_calendar_events("2026-06-28")
Observation: [
  {"start": "09:00", "end": "10:00", "title": "Standup"},
  {"start": "11:00", "end": "12:00", "title": "Design review"},
  {"start": "14:00", "end": "15:00", "title": "1:1 with manager"}
]

Thought: Tomorrow I have meetings at 9-10am, 11am-12pm, and 2-3pm.
Free afternoon slots: 1:00-2:00pm and 3:00-4:00pm (assuming workday
ends at 5pm). The earliest free afternoon slot is 1:00-1:30pm.

Action: draft_calendar_invite({
  "title": "Team Sync",
  "date": "2026-06-28",
  "start": "13:00",
  "end": "13:30",
  "description": "Weekly team sync to align on priorities and blockers."
})
Observation: {"draft_id": "inv_abc123", "status": "drafted"}

Final Answer: Found a free slot at 1:00-1:30pm tomorrow (June 28).
I have drafted a meeting invite titled "Team Sync" for that time with
description "Weekly team sync to align on priorities and blockers."
Draft ID: inv_abc123. Ready to send when you confirm.

snippet

Thought: I need to find X. Let me search.
Action: web_search("X")
Observation: No results found.
Thought: I need to find X. Let me search again.
Action: web_search("X")
Observation: No results found.
[repeats]

python

def react_loop(system_prompt, user_query, tools, max_iterations=10):
    messages = [
        {"role": "system", "content": system_prompt},
        {"role": "user", "content": user_query}
    ]

    for i in range(max_iterations):
        response = llm.complete(messages)
        content = response.content

        if "Final Answer:" in content:
            return extract_final_answer(content)

        action_name, action_input = parse_action(content)
        observation = tools[action_name](**action_input)

        messages.append({"role": "assistant", "content": content})
        messages.append({
            "role": "user",
            "content": f"Observation: {observation}"
        })

    return "Max iterations reached without a final answer."

ReAct Prompting: The Reasoning + Acting Pattern Behind Modern AI Agents

What ReAct Is

The ReAct Loop Structure

Related posts

Fable 5 Advisor + Sonnet 5 Executor: Claude Code Setup, Prompts, and When to Consult

Council of High Intelligence: 18 AI Personas Deliberate Your Hardest Decisions in Claude Code

Context vs Prompt vs Loop vs Harness Engineering: The Four-Layer Agent Stack

ReAct vs Chain-of-Thought

ReAct vs Simple Tool Calling

Writing a ReAct Prompt from Scratch

System Prompt Structure

Formatting Thought/Action/Observation

Full ReAct Examples

Example 1: Web Research Task

Example 2: Code Debugging Task

Example 3: Multi-Step Task

ReAct Failure Modes

1. Observation Hallucination

2. Reasoning Loops

3. Action Explosion

ReAct in Modern Frameworks

ReAct as the Foundation of Loop Engineering

When NOT to Use ReAct

Practical Checklist for Building a ReAct Agent

Read next