GPT-5.4-nano
gpt-5.4-nano
OpenAI · chat · api · seen 13m ago
Context
400K
Max output
128K
Input $/1M
$0.20
Output $/1M
$1.25
Modalities
text
Released
14 Mar 2026
AI summary
● machine-written
OpenAI releases GPT-5.4 Nano, lightweight model for cost-efficient inference
GPT-5.4 Nano is the lightweight variant of the GPT-5.4 family, optimized for speed-critical and high-volume tasks. It supports text and image inputs with a 400K token context window and is designed for low-latency use cases including classification, data extraction, ranking, and sub-agent execution. The model prioritizes responsiveness and cost-efficiency over deep reasoning, making it suitable for real-time systems and distributed agent architectures.
What's new
- Released March 17, 2026
- 400K token context window with up to 128K token maximum output
- Priced at $0.20 per million input tokens and $1.25 per million output tokens
- Supports text and image inputs
- Optimized for low-latency classification, extraction, ranking, and agent tasks
Best for
Background tasks and real-time systemsClassification, data extraction, and ranking workflowsSub-agent execution in distributed architecturesCost-sensitive, high-volume inference pipelines