What happened to OpenAI's fine-tuning API?

On May 7, 2026, OpenAI announced it was winding down its fine-tuning API and platform. Existing customers have until January 6, 2027 to create new training jobs. After that date, the fine-tuning platform will no longer accept new training jobs. OpenAI has not publicly announced a direct replacement service.

How much did GPT-5.5 API prices increase compared to GPT-5.4?

GPT-5.5 launched on April 23, 2026 at $5 per million input tokens and $30 per million output tokens. GPT-5.4 had launched just six weeks earlier at approximately half those rates. The GPT-5.5 Pro tier carries even higher pricing at $30 per million input tokens and $180 per million output tokens. OpenRouter's analysis found real-world cost increases of 49-92% depending on prompt length, despite GPT-5.5 generating 19-34% fewer output tokens on longer prompts.

What is GPT-Realtime-2 and how is it priced?

GPT-Realtime-2 is a voice intelligence model launched by OpenAI on May 7, 2026, with GPT-5-class reasoning capabilities. It's billed by token consumption through the Realtime API. Developers including Zillow (26-point improvement in call success rates) and BolnaAI (12.5% word error rate reduction) participated in early testing.

How does GitHub Copilot's new billing model affect Claude Opus 4.7 users?

GitHub Copilot is migrating to token-based billing effective June 1, 2026. The billing multiplier for Claude Opus 4.7 is jumping from 7.5x to 27x—a dramatic increase for developers who use Copilot with Anthropic's flagship model. This is part of a broader industry shift where AI infrastructure costs are being passed through more directly to end users.

What are cheaper alternatives to OpenAI's GPT-5.5 for developers?

DeepSeek announced on April 26, 2026 that it would slash prices for cached API inputs to approximately $0.14 per million tokens—making it dramatically cheaper than GPT-5.5 for high-volume workloads. Open-source alternatives continue to gain traction among cost-conscious teams. Anthropic's Claude Sonnet 4.6 at $3 input / $15 output per million tokens is also priced below GPT-5.5's rates.

Is the fine-tuning API deprecation permanent?

OpenAI's announcement gives customers until January 6, 2027 to create new training jobs. OpenAI has not publicly announced a replacement service. Teams that rely on fine-tuning for performance or customization will need to migrate to prompt engineering, retrieval-augmented generation (RAG), or evaluate alternatives from Anthropic, Google, or open-source providers before the deadline.

← Back to blog

explainx / blog

OpenAI Winds Down Fine-Tuning API: GPT-5.5 Pricing, Cost Hikes, and What Developers Should Do

OpenAI deprecated its fine-tuning API in May 2026, doubled GPT-5.5 API prices to $5/$30 per million tokens, and reshaped developer economics with compounding changes including GitHub Copilot token billing and the GPT-Realtime-2 launch. Here's what changed and how to respond.

May 10, 2026·8 min read·Yash Thakker

OpenAIGPT-5.5API PricingFine-tuningDeveloper ToolsAI Economics

In the first week of May 2026, OpenAI made a series of moves that are cumulatively reshaping the economics of building on frontier AI—none dramatic on its own, but compounding into a structural repricing of the developer ecosystem.

The changes include:

Fine-tuning API deprecated, with a January 6, 2027 deadline for new training jobs
GPT-5.5 pricing at double GPT-5.4 rates, with real-world cost increases of 49–92%
GPT-Realtime-2 launched for voice intelligence, billed per token via the Realtime API
GitHub Copilot shifting to token-based billing, with Claude Opus 4.7 multipliers jumping from 7.5x to 27x

Taken together, one industry analysis described the effect as: "2026 definitively closes the era of AI as a 'stable-rate API' and opens that of AI as an architectural component to be designed with the same care given to a multi-region database."

GPT-5.5: the price that changed the math

The most consequential single move came on April 23, when OpenAI launched GPT-5.5 at $5 per million input tokens and $30 per million output tokens—double the rates of GPT-5.4, which launched just six weeks earlier.

A GPT-5.5 Pro tier carries even steeper pricing: $30 per million input tokens and $180 per million output tokens.

OpenAI co-founder Greg Brockman positioned the model as "a new class of intelligence built specifically for real work and for powering agents"—but developers moved quickly to calculate what the capability gains actually cost.

OpenRouter's analysis of real-world workloads found:

Short (<2,000 tokens)	~92% increase	Full price doubling absorbed
Medium	~60-75% increase	Fewer output tokens partially offset input cost
Long	~49% increase	19-34% fewer output tokens on longer prompts

Fine-tuning use case	Alternative approach
Style/tone consistency	System prompt engineering with examples; few-shot prompting
Domain-specific knowledge	Retrieval-augmented generation (RAG); knowledge bases
Output format control	Structured outputs (JSON mode, tool use); prompt templates
Reducing token costs on repeated tasks	Prompt caching (Anthropic's implementation gives up to 90% discount on cached prompts)
Specialized task performance	Evaluate Claude Sonnet 4.6, Gemini 3.1 Pro, or open-weight alternatives

Model	Input ($/M tokens)	Output ($/M tokens)
GPT-5.5	$5.00	$30.00
GPT-5.5 Pro	$30.00	$180.00
Claude Sonnet 4.6	$3.00	$15.00
Claude Haiku 4.5	$1.00	$5.00
DeepSeek (cached input)	~$0.14	varies

OpenAI Winds Down Fine-Tuning API: GPT-5.5 Pricing, Cost Hikes, and What Developers Should Do

GPT-5.5: the price that changed the math

Related posts

OpenAI Codex Micro: $230 Work Louder Keyboard for Agent Dashboards

ChatGPT Work vs Codex: What Actually Changes (Help Center + r/codex Explained)

OpenAI Codex Plugin for Claude Code — Setup, Commands, and Who Benefits

Fine-tuning API: wind-down timeline

What fine-tuning was used for and what to replace it with

GPT-Realtime-2: voice at model-grade reasoning

GitHub Copilot: token billing with steep multipliers

The competitive response: DeepSeek and the open-weight shift

The wider picture: a week of industry-wide repricing

What this means for developers in practice

The agent economics shift

GPT-5.5: the price that changed the math

Related posts

OpenAI Codex Micro: $230 Work Louder Keyboard for Agent Dashboards

ChatGPT Work vs Codex: What Actually Changes (Help Center + r/codex Explained)

OpenAI Codex Plugin for Claude Code — Setup, Commands, and Who Benefits

Fine-tuning API: wind-down timeline

What fine-tuning was used for and what to replace it with

GPT-Realtime-2: voice at model-grade reasoning

GitHub Copilot: token billing with steep multipliers

The competitive response: DeepSeek and the open-weight shift

The wider picture: a week of industry-wide repricing

What this means for developers in practice

The agent economics shift

Related reading on explainx.ai