release

GPT-5 System Card

For developers building AI workflows, GPT-5's routing system means they can get optimal performance automatically, saving engineering effort while improving user experience through faster responses and lower costs.

OpenAI Blog·August 6, 2025·1 min readrelease

releaseGPT-5 System Card

openai.com

What happened

OpenAI published a system card for GPT-5 detailing its new unified model routing system. Instead of a single model, GPT-5 consists of several specialized variants: gpt-5-main for general tasks, gpt-5-thinking for deeper reasoning, and lightweight versions like gpt-5-thinking-nano optimized for speed. The router automatically selects the best variant per request based on complexity and latency requirements. This architecture allows developers to balance accuracy and cost without manual model selection. For builders, the practical takeaway is that they can now leverage a single API that intelligently delegates to the most appropriate sub-model, reducing the need to maintain multiple endpoints or custom logic. The system card also covers safety and performance benchmarks, indicating OpenAI's focus on transparency and responsible deployment.

Key takeaways

GPT-5 introduces a unified routing system with multiple model variants: main, thinking, and thinking-nano.
The router matches each request to the best variant based on task complexity and speed needs.
Lightweight variants like gpt-5-thinking-nano are designed for low-latency, cost-sensitive applications.
OpenAI published a system card detailing safety evaluations and performance characteristics.
Developers can access GPT-5 through a single API without manual model selection.