TL;DR

GPT-4o API shut down March 31, 2026. Enterprise access ends April 3. Here is your complete step-by-step migration guide to GPT-5.2, GPT-5.4, and alternatives.

GPT-4o is gone. As of March 31, 2026, OpenAI shut down GPT-4o API access entirely — and the final Enterprise and Business Custom GPT access terminates on April 3, 2026. That is today. If your production code is still calling gpt-4o-2024-08-06 or chatgpt-4o-latest, those requests are returning 404 errors as of this writing. Here is everything you need to do right now, and what you need to know going forward.

The Retirement Timeline

OpenAI rolled out the GPT-4o sunset in phases to give developers and businesses time to migrate:

February 13, 2026: GPT-4o removed from ChatGPT for Free, Plus, and Pro users. GPT-5.2 becomes the new default across all consumer plans.
February 16, 2026: The chatgpt-4o-latest ChatGPT API endpoint deprecated with a hard cutoff. Calls begin returning errors.
March 31, 2026: Full GPT-4o API retirement. Model versions gpt-4o-2024-05-13 and gpt-4o-2024-08-06 return 404. Azure OpenAI Service deployments also sunset on this date.
April 3, 2026 (today): Final access ends — Business, Enterprise, and Edu customers lose GPT-4o access within Custom GPTs. After this date, no user on any OpenAI plan has access to GPT-4o in any form.
August 26, 2026: Assistants API endpoints built on GPT-4o stop functioning entirely. All Threads, Runs, and Vector Store integrations will cease to work on this date.

Why OpenAI Retired Its Most Beloved Model

The numbers make the case plainly: only 0.1% of ChatGPT users were still actively choosing GPT-4o each day when OpenAI announced the retirement. The vast majority had already migrated to GPT-5.2 on their own, and the infrastructure cost of maintaining a parallel model architecture for a tiny fraction of users no longer made business sense.

There was also a counterintuitive pricing factor. Despite being an older model, GPT-4o’s input cost had become higher relative to the value it delivered compared to GPT-5.1. With GPT-5.1 and GPT-5.2 delivering superior performance at comparable or lower pricing, the financial incentive to stay on GPT-4o had largely disappeared for most production use cases.

Fine-tuned GPT-4o deployments received a one-year grace period from the retirement announcement date, giving teams with custom-trained models additional runway before needing to retrain on a newer base model. If you have fine-tuned models in production, verify your grace period deadline in the OpenAI platform dashboard.

Model	Best For	Status	Approx. Pricing (per million tokens)
GPT-5.2	General purpose — the new default	Active	~$8 in / ~$20 out
GPT-5.4	Complex reasoning, long documents	Active	~$15 in / ~$40 out
GPT-5.4 Thinking	Multi-step reasoning, math, code	Active	~$20 in / ~$60 out
GPT-5.4 mini	High-volume, cost-sensitive tasks	Active	~$0.40 in / ~$1.60 out
GPT-5.4 nano	Ultra-fast classification and extraction	Active	~$0.10 in / ~$0.40 out
GPT-4o	—	Retired March 31	—

The Retirement Timeline

Why OpenAI Retired Its Most Beloved Model

Try Our Free Tools

JSON Formatter & Validator

cURL to Code Converter

More from AI Tools & Tutorials

Imagen 3 & 4 Shut Down June 24: Migrate to Gemini Image (2026)

The #Keep4o Backlash: What Happened and What Changed

The Current OpenAI Model Landscape

Which Model Should You Migrate To?

The Code Migration (Step by Step)

Three Migration Gotchas That Catch Developers Off Guard

1. JSON Schema Strictness

2. Prompt Drift

3. Assistants API Has Its Own Separate Deadline

Beyond OpenAI: Alternatives Worth Evaluating

Build Migration Resilience Going Forward

The Bottom Line

Ready to ship faster?

One insight, every Monday. 7am IST. Zero fluff.

Comments · 0

Key takeaways · 6

Topics

Article stats

Regex Playground

Base64 Encoder / Decoder

UUID Generator

Grok Build Agent Dashboard: Run 8 Parallel Coding Agents From One Screen

Build an MCP Server in TypeScript (2026): Claude Code Guide

Income Tax Calculator India 2025-26: Complete Guide

OpenAI Codex Goal Mode Is Now GA — Multi-Hour Autonomous Coding Sessions

GitHub Copilot Token Billing Week 1: What Developers Are Actually Paying