OpenAI releases GPT-5.4 mini and nano models with 2x faster speeds and dramatically lower costs, targeting coding assistants and agentic AI systems. (Read More)OpenAI releases GPT-5.4 mini and nano models with 2x faster speeds and dramatically lower costs, targeting coding assistants and agentic AI systems. (Read More)

OpenAI Launches GPT-5.4 Mini and Nano for High-Volume AI Workloads

2026/03/18 02:05
3 min read
For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com

OpenAI Launches GPT-5.4 Mini and Nano for High-Volume AI Workloads

Peter Zhang Mar 17, 2026 18:05

OpenAI releases GPT-5.4 mini and nano models with 2x faster speeds and dramatically lower costs, targeting coding assistants and agentic AI systems.

OpenAI Launches GPT-5.4 Mini and Nano for High-Volume AI Workloads

OpenAI dropped its most cost-efficient models yet on March 17, 2026—GPT-5.4 mini and nano—targeting developers building latency-sensitive applications where the flagship model's horsepower becomes overkill.

The mini variant runs more than twice as fast as GPT-5 mini while approaching the full GPT-5.4's performance on coding benchmarks. On SWE-Bench Pro, mini scored 54.4% compared to the flagship's 57.7%—a narrow gap that matters when you're paying 75 cents per million input tokens instead of premium rates.

Nano goes even cheaper at $0.20 per million input tokens and $1.25 per million output tokens. OpenAI positions it for classification, data extraction, and what they call "coding subagents"—smaller AI workers handling simpler tasks within larger systems.

The Subagent Play

Here's where this gets interesting for developers building agentic systems. OpenAI is explicitly pushing a tiered architecture: let GPT-5.4 handle planning and complex judgment while mini or nano subagents execute narrower tasks in parallel. In their Codex platform, mini uses only 30% of the GPT-5.4 quota.

The benchmark numbers back this up. Mini hit 72.1% on OSWorld-Verified for computer use tasks—nearly matching the flagship's 75%—while nano dropped to 39%. Translation: mini can interpret screenshots and navigate interfaces almost as well as the big model, but nano shouldn't touch those workflows.

Where Each Model Fits

The performance spread tells you exactly what OpenAI optimized for:

Mini excels at coding (54.4% SWE-Bench Pro, 60% Terminal-Bench 2.0) and tool-calling (93.4% on τ2-bench telecom tasks). It supports a 400k context window with text and image inputs, web search, and function calling.

Nano trades capability for cost efficiency. It scored 52.4% on SWE-Bench Pro and 46.3% on Terminal-Bench 2.0—respectable for a model at one-quarter mini's price point. But its long-context performance drops significantly, hitting just 33.1% on the 128K-256K needle retrieval test.

Hebbia's CTO Aabhas Sharma noted that mini "matched or exceeded competitive models on several output tasks and citation recall at a much lower cost" while achieving "stronger source attribution than the larger GPT-5.4 model."

Availability

Mini is live across the API, Codex, and ChatGPT. Free and Go users can access it through the Thinking feature; other tiers get it as a rate limit fallback for GPT-5.4 Thinking.

Nano remains API-only—a signal that OpenAI sees it primarily as infrastructure for developers rather than a consumer-facing product.

For teams running high-volume AI workloads, the math just changed. The question isn't whether to use smaller models anymore—it's figuring out which tasks actually need the flagship.

Image source: Shutterstock
  • openai
  • gpt-5.4
  • ai models
  • api pricing
  • machine learning
Market Opportunity
4 Logo
4 Price(4)
$0.007143
$0.007143$0.007143
-7.80%
USD
4 (4) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

IP Hits $11.75, HYPE Climbs to $55, BlockDAG Surpasses Both with $407M Presale Surge!

IP Hits $11.75, HYPE Climbs to $55, BlockDAG Surpasses Both with $407M Presale Surge!

The post IP Hits $11.75, HYPE Climbs to $55, BlockDAG Surpasses Both with $407M Presale Surge! appeared on BitcoinEthereumNews.com. Crypto News 17 September 2025 | 18:00 Discover why BlockDAG’s upcoming Awakening Testnet launch makes it the best crypto to buy today as Story (IP) price jumps to $11.75 and Hyperliquid hits new highs. Recent crypto market numbers show strength but also some limits. The Story (IP) price jump has been sharp, fueled by big buybacks and speculation, yet critics point out that revenue still lags far behind its valuation. The Hyperliquid (HYPE) price looks solid around the mid-$50s after a new all-time high, but questions remain about sustainability once the hype around USDH proposals cools down. So the obvious question is: why chase coins that are either stretched thin or at risk of retracing when you could back a network that’s already proving itself on the ground? That’s where BlockDAG comes in. While other chains are stuck dealing with validator congestion or outages, BlockDAG’s upcoming Awakening Testnet will be stress-testing its EVM-compatible smart chain with real miners before listing. For anyone looking for the best crypto coin to buy, the choice between waiting on fixes or joining live progress feels like an easy one. BlockDAG: Smart Chain Running Before Launch Ethereum continues to wrestle with gas congestion, and Solana is still known for network freezes, yet BlockDAG is already showing a different picture. Its upcoming Awakening Testnet, set to launch on September 25, isn’t just a demo; it’s a live rollout where the chain’s base protocols are being stress-tested with miners connected globally. EVM compatibility is active, account abstraction is built in, and tools like updated vesting contracts and Stratum integration are already functional. Instead of waiting for fixes like other networks, BlockDAG is proving its infrastructure in real time. What makes this even more important is that the technology is operational before the coin even hits exchanges. That…
Share
BitcoinEthereumNews2025/09/18 00:32
What To Expect From The Fed Rate Decision Tomorrow

What To Expect From The Fed Rate Decision Tomorrow

The post What To Expect From The Fed Rate Decision Tomorrow appeared on BitcoinEthereumNews.com. The Fed is likely to hold interest rates steady for a second consecutive
Share
BitcoinEthereumNews2026/03/18 06:22
Young pastor says entrenched conservatism 'made me question the whole system'

Young pastor says entrenched conservatism 'made me question the whole system'

Rural Alabama pastor Daniel Rogers refused to give up the church after being ousted by his home denomination, but it wasn’t an easy journey.Rogers is a member of
Share
Alternet2026/03/18 06:41