GPT-5 nano

Compact GPT-5 tier focused on low-latency responses and strong cost efficiency for high-volume workloads.

Context window

128k tokens

Peak context for this model.

Availability

OpenAI API, Responses API, Batch API

Where you can run it.

Modalities

Text · Code

Input/output coverage.

Pricing

$0.05 / 1M input tokens, $0.40 / 1M output tokens

Latency: Very low; optimized for high-throughput production traffic

Strengths

Best for

Summary

Other models

GPT-4.1

flagship · 2024

o3-mini

reasoning · 2024