Skip to content

GPT-5.4-nano

gpt-5.4-nano
OpenAI · chat · api · seen 13m ago
GA Alert me on changes
Context
400K
Max output
128K
Input $/1M
$0.20
Output $/1M
$1.25
Modalities
text
Released
14 Mar 2026
AI summary
● machine-written

OpenAI releases GPT-5.4 Nano, lightweight model for cost-efficient inference

GPT-5.4 Nano is the lightweight variant of the GPT-5.4 family, optimized for speed-critical and high-volume tasks. It supports text and image inputs with a 400K token context window and is designed for low-latency use cases including classification, data extraction, ranking, and sub-agent execution. The model prioritizes responsiveness and cost-efficiency over deep reasoning, making it suitable for real-time systems and distributed agent architectures.

What's new
  • Released March 17, 2026
  • 400K token context window with up to 128K token maximum output
  • Priced at $0.20 per million input tokens and $1.25 per million output tokens
  • Supports text and image inputs
  • Optimized for low-latency classification, extraction, ranking, and agent tasks
Best for
Background tasks and real-time systemsClassification, data extraction, and ranking workflowsSub-agent execution in distributed architecturesCost-sensitive, high-volume inference pipelines
Sources

Source: https://platform.openai.com/docs/models