What is structured output in LLMs?

Structured output means the model returns data in a machine-readable format (usually JSON) rather than free-form prose. This is essential for any LLM output that code needs to parse — classification, extraction, data normalization, agent planning, and API responses all require structured output to be reliable.

What is the difference between JSON mode and structured outputs?

In OpenAI's API, JSON mode guarantees valid JSON syntax but does not enforce a specific schema. Structured Outputs (strict mode) additionally guarantee that the returned JSON matches a specific JSON Schema you provide. Claude's tool use approach achieves structured outputs by defining the desired schema as a tool, which forces the model to populate the schema fields.

Why does 'respond only in JSON' sometimes fail?

Models occasionally add prose before or after the JSON ("Here is the JSON you requested:"), produce invalid JSON syntax (trailing commas, unquoted keys), or hallucinate keys not in your schema. These failures become more common with longer schemas, complex nesting, or when the model is uncertain about a value.

Should I use Pydantic or Zod for LLM output validation?

Use Pydantic in Python pipelines and Zod in TypeScript/Node.js pipelines. Both can generate JSON Schema from your type definitions, which you can include in the prompt or pass to a native structured output API. Both handle validation errors you can feed back to the model for a retry.

How do I handle validation errors in structured output?

Catch the validation error, extract the error message, and send the model a follow-up prompt: "Your previous output failed validation with this error: [error]. Please correct the JSON and return only valid JSON." One retry usually resolves syntax errors. If it fails twice, log it and route to a fallback.

Does a shorter schema produce better JSON outputs?

Yes. The more fields in your schema, the higher the chance the model makes an error on at least one. Benchmark your actual task: often you can split a 20-field schema into two 10-field schemas called sequentially and get higher accuracy than one combined call.

Can I mix structured output with chain-of-thought reasoning?

Yes. Ask the model to reason first in a "reasoning" string field, then populate the answer fields. This gives you the accuracy benefits of chain-of-thought while still returning a parseable structure. Alternatively, make the first call free-form reasoning and the second call structured extraction from the reasoning.

Structured Output & JSON Mode Prompting Guide 2026 | explainx.ai Blog

explainx.ainewsletter3.5k

workshops ↗

Structured Output & JSON Mode Prompting Guide 2026 | explainx.ai Blog | explainx.ai

Every LLM feature that does something useful with model output eventually hits the same wall: the model returns beautiful prose, and your code needs a dictionary. You call json.loads() and get a JSONDecodeError. Or the JSON is valid but the keys are wrong. Or there is a paragraph of explanation before the opening brace.

Structured output is the discipline of making LLMs return machine-parseable data reliably enough to ship. This guide covers every approach — prompt-based, native API, schema-enforced — and shows you which to pick and how to validate what comes back.

Why Structured Output Matters

Most real LLM use cases are pipelines, not chatbots. A pipeline takes some input, runs it through a model, parses the output, and does something with the result. The pipeline breaks if the output format is unpredictable.

Consider entity extraction: you feed news articles through a model to extract company names, locations, and dates. Each extraction feeds a downstream database insert. If one article causes the model to return "No entities found in this text." instead of {"entities": []}, your pipeline crashes. Structured output is what keeps pipelines from crashing at 2am.

Beyond reliability, structured output enables:

Typed data models — parse directly into Pydantic models, eliminating manual field extraction
Agent planning — agentic systems need structured intermediate state to pass between steps
Chained prompts — output of one call becomes input to the next, so format must be predictable
Parallel processing — batch structured outputs can be parsed and stored in bulk

Three Approaches to Getting Structured Output

Approach 1: Prompt-Based JSON Instruction

The simplest approach: tell the model to return JSON in the prompt. Works with any model, any API.

snippet

System: You are a data extraction assistant. Always respond with valid JSON
and nothing else — no prose before or after the JSON.

User: Extract the company name, founding year, and CEO from this text:
"Anthropic was founded in 2021 by Dario Amodei and Daniela Amodei.
Dario Amodei serves as CEO."

Expected output:

json

{
  "company": "Anthropic",
  "founding_year": 2021,
  "ceo": "Dario Amodei"

python

from openai import OpenAI

client = OpenAI()

response = client.chat.completions.create(
    model="gpt-4o",
    response_format={"type": "json_object"},
    messages=[
        {
            "role": "system",
            "content": "Extract company info and return JSON with keys: "
                       "company, founding_year, ceo"
        },
        {
            "role": "user",
            "content": "Anthropic was founded in 2021 by Dario Amodei..."
        }
    ]
)

import json
result = json.loads(response.choices[0].message.content)

python

response = client.chat.completions.create(
    model="gpt-4o-2024-08-06",
    response_format={
        "type": "json_schema",
        "json_schema": {
            "name": "company_info",
            "strict": True,
            "schema": {
                "type": "object",
                "properties": {
                    "company": {"type": "string"},
                    "founding_year": {"type": "integer"},
                    "ceo": {"type": "string"}
                },
                "required": ["company", "founding_year", "ceo"],
                "additionalProperties": False
            }
        }
    },
    messages=[...]
)

json

{
  "type": "object",
  "properties": {
    "job_title": {
      "type": "string",
      "description": "The exact job title as listed"
    },
    "company": {
      "type": "string"
    },
    "location": {
      "type": "string",
      "description": "City, State or 'Remote'"
    },
    "employment_type": {
      "type": "string",
      "enum": ["full-time", "part-time", "contract", "internship"]
    },
    "salary_range": {
      "type": ["object", "null"],
      "properties": {
        "min": {"type": "integer"},
        "max": {"type": "integer"},
        "currency": {"type": "string", "default": "USD"}
      },
      "required": ["min", "max"]
    },
    "required_skills": {
      "type": "array",
      "items": {"type": "string"},
      "description": "List of required technical skills"
    },
    "years_experience_required": {
      "type": ["integer", "null"]
    }
  },
  "required": ["job_title", "company", "location", "employment_type",
               "required_skills"],
  "additionalProperties": false
}

python

import anthropic
import json

client = anthropic.Anthropic()

# Define the output schema as a tool
extract_tool = {
    "name": "extract_company_info",
    "description": "Extract structured company information from text",
    "input_schema": {
        "type": "object",
        "properties": {
            "company": {
                "type": "string",
                "description": "Company name"
            },
            "founding_year": {
                "type": "integer",
                "description": "Year the company was founded"
            },
            "ceo": {
                "type": "string",
                "description": "Current CEO full name"
            },
            "headquarters": {
                "type": ["string", "null"],
                "description": "City and country of headquarters, or null if not mentioned"
            }
        },
        "required": ["company", "founding_year", "ceo", "headquarters"]
    }
}

response = client.messages.create(
    model="claude-opus-4-5",
    max_tokens=1024,
    tools=[extract_tool],
    tool_choice={"type": "tool", "name": "extract_company_info"},
    messages=[
        {
            "role": "user",
            "content": "Anthropic was founded in 2021 by Dario Amodei and "
                       "Daniela Amodei. Dario serves as CEO. The company is "
                       "headquartered in San Francisco, USA."
        }
    ]
)

# Extract the structured result from the tool call
tool_use = next(
    block for block in response.content
    if block.type == "tool_use"
)
result = tool_use.input

print(result)
# {'company': 'Anthropic', 'founding_year': 2021,
#  'ceo': 'Dario Amodei', 'headquarters': 'San Francisco, USA'}

snippet

Extract information and return JSON matching this exact schema:

{
  "company": string,
  "founding_year": integer,
  "ceo": string,
  "headquarters": string or null
}

If a field is not mentioned in the text, use null.
Return ONLY the JSON object, no explanation before or after.

snippet

Extract company information from text.

Example input:
"Apple Inc. was founded in 1976 by Steve Jobs, Steve Wozniak, and Ronald Wayne.
Tim Cook is the current CEO. The company is headquartered in Cupertino, California."

Example output:
{
  "company": "Apple Inc.",
  "founding_year": 1976,
  "ceo": "Tim Cook",
  "headquarters": "Cupertino, California"
}

Now extract from this text:
[user input]

python

# Call 1: extract basic metadata
basic = extract_with_schema(doc, BASIC_SCHEMA)  # 5 fields

# Call 2: extract technical details  
technical = extract_with_schema(doc, TECHNICAL_SCHEMA)  # 8 fields

# Call 3: extract relationships
relationships = extract_with_schema(doc, RELATIONSHIP_SCHEMA)  # 6 fields

result = {**basic, **technical, **relationships}

snippet

Step 1: Extract the information and return JSON.
Step 2: Before finalizing, re-read your JSON and check: does every field have
a value or null? Is the JSON syntactically valid? Fix any issues.
Step 3: Return the final, verified JSON.

python

import re

def repair_json(raw: str) -> str:
    """Attempt basic JSON repair before parsing."""
    # Remove prose before the first { or [
    raw = re.sub(r'^[^{\[]*', '', raw.strip())
    # Remove prose after the last } or ]
    raw = re.sub(r'[^}\]]*$', '', raw.strip())
    # Replace Python-style None/True/False with JSON equivalents
    raw = raw.replace('None', 'null').replace('True', 'true').replace('False', 'false')
    return raw

python

from pydantic import BaseModel, ValidationError
from typing import Optional
import json

class CompanyInfo(BaseModel):
    company: str
    founding_year: int
    ceo: str
    headquarters: Optional[str] = None

def parse_and_validate(raw_output: str) -> CompanyInfo:
    """Parse LLM JSON output and validate against schema."""
    try:
        # Strip any prose wrapping the JSON
        json_start = raw_output.find('{')
        json_end = raw_output.rfind('}') + 1
        if json_start == -1:
            raise ValueError("No JSON object found in output")
        
        clean_json = raw_output[json_start:json_end]
        data = json.loads(clean_json)
        return CompanyInfo(**data)
    
    except json.JSONDecodeError as e:
        raise ValueError(f"Invalid JSON syntax: {e}")
    except ValidationError as e:
        raise ValueError(f"Schema validation failed: {e}")

python

def extract_with_retry(text: str, max_retries: int = 2) -> CompanyInfo:
    messages = [
        {
            "role": "user",
            "content": f"Extract company info from this text and return "
                       f"valid JSON matching this schema: "
                       f"{CompanyInfo.model_json_schema()}\n\nText: {text}"
        }
    ]
    
    for attempt in range(max_retries + 1):
        response = llm.complete(messages)
        raw_output = response.content
        
        try:
            return parse_and_validate(raw_output)
        except ValueError as e:
            if attempt == max_retries:
                raise
            
            # Feed error back to model
            messages.append({"role": "assistant", "content": raw_output})
            messages.append({
                "role": "user",
                "content": f"Your output failed validation: {e}. "
                           f"Please fix it and return only valid JSON."
            })
    
    raise RuntimeError("Unreachable")

json

{
  "type": "object",
  "properties": {
    "category": {
      "type": "string",
      "enum": ["positive", "negative", "neutral", "mixed"]
    },
    "confidence": {
      "type": "number",
      "minimum": 0,
      "maximum": 1,
      "description": "Confidence score between 0 and 1"
    },
    "reasoning": {
      "type": "string",
      "description": "One sentence explaining the classification"
    }
  },
  "required": ["category", "confidence", "reasoning"]
}

json

{
  "type": "object",
  "properties": {
    "people": {
      "type": "array",
      "items": {
        "type": "object",
        "properties": {
          "name": {"type": "string"},
          "role": {"type": ["string", "null"]},
          "organization": {"type": ["string", "null"]}
        },
        "required": ["name", "role", "organization"]
      }
    },
    "organizations": {
      "type": "array",
      "items": {"type": "string"}
    },
    "locations": {
      "type": "array",
      "items": {"type": "string"}
    },
    "dates": {
      "type": "array",
      "items": {
        "type": "object",
        "properties": {
          "text": {"type": "string"},
          "normalized": {"type": ["string", "null"],
                        "description": "ISO 8601 format if determinable"}
        },
        "required": ["text", "normalized"]
      }
    }
  },
  "required": ["people", "organizations", "locations", "dates"]
}

python

# Split into two focused calls instead of one large call
basic_info = extract_with_schema(document, schema=BASIC_INFO_SCHEMA)
financial_info = extract_with_schema(document, schema=FINANCIAL_SCHEMA)

# Merge results
result = {**basic_info, **financial_info}

json

{
  "type": "object",
  "properties": {
    "goal_understood": {
      "type": "string",
      "description": "Restate the goal in one sentence to confirm understanding"
    },
    "steps": {
      "type": "array",
      "items": {
        "type": "object",
        "properties": {
          "step_id": {"type": "integer"},
          "action": {"type": "string"},
          "tool": {"type": "string"},
          "parameters": {"type": "object"},
          "depends_on": {
            "type": "array",
            "items": {"type": "integer"},
            "description": "IDs of steps that must complete before this one"
          }
        },
        "required": ["step_id", "action", "tool", "parameters", "depends_on"]
      }
    },
    "can_parallelize": {
      "type": "boolean",
      "description": "True if any steps can run in parallel"
    }
  },
  "required": ["goal_understood", "steps", "can_parallelize"]
}

Scenario	Recommended Approach
Prototyping, any model	Prompt-based with Pydantic validation
OpenAI API, exact schema required	Structured Outputs (strict mode)
OpenAI API, valid JSON sufficient	JSON mode
Anthropic Claude	Tool use with `tool_choice` forced
Production pipeline, any model	Tool/function calling + validation + retry
Simple 2-3 field extraction	Prompt-based usually sufficient
Complex nested schemas	Native structured output APIs

Structured Output and JSON Mode Prompting: A Complete Guide for 2026

Why Structured Output Matters

Three Approaches to Getting Structured Output

Approach 1: Prompt-Based JSON Instruction

Related posts

What Is a System Prompt? The Hidden Instructions That Shape Every AI Response

Master Prompt Engineering with Claude: Complete Guide 2026

Thin Prompts, Thick Artifacts, Thin Skills: Thariq’s Claude Code Framework

Approach 2: Native JSON Mode (OpenAI)

Approach 3: Tool/Function Calling with Schemas

JSON Schema Basics for AI Prompting

Using Anthropic's Structured Output (Tool Use Pattern)

Prompting Techniques That Improve JSON Accuracy

Include the Schema in the Prompt

Show an Example Output

Use "null" Explicitly, Not Omission

Break Complex Schemas into Chains

Ask the Model to Verify Its Own Output

Debugging Structured Output Failures

Validating LLM JSON Output

Pydantic Validation (Python)

Retry Pattern with Validation Feedback

Common Structured Output Patterns

Classification with Confidence Scores

Entity Extraction

Multi-Field Document Parsing

Structured Output for Agentic Systems

Performance Tips

Choosing the Right Approach

Read next