OpenAI's Responses API overhaul with WebSockets boosts agentic workflow speeds by 40%, enabling GPT-5.3 to hit 1,000 TPS. Here's how they did it. (Read More)OpenAI's Responses API overhaul with WebSockets boosts agentic workflow speeds by 40%, enabling GPT-5.3 to hit 1,000 TPS. Here's how they did it. (Read More)

OpenAI Speeds Up Codex Workflows with WebSockets in Responses API

2026/04/29 00:33
3분 읽기
이 콘텐츠에 대한 의견이나 우려 사항이 있으시면 crypto.news@mexc.com으로 연락주시기 바랍니다

OpenAI Speeds Up Codex Workflows with WebSockets in Responses API

Terrill Dicki Apr 28, 2026 16:33

OpenAI's Responses API overhaul with WebSockets boosts agentic workflow speeds by 40%, enabling GPT-5.3 to hit 1,000 TPS. Here's how they did it.

OpenAI Speeds Up Codex Workflows with WebSockets in Responses API

OpenAI has introduced WebSocket support to its Responses API, slashing latency in Codex agent workflows by 40%. This upgrade allows the latest GPT-5.3-Codex-Spark model to achieve speeds of over 1,000 tokens per second (TPS), a massive leap from the 65 TPS of earlier versions. These advancements are central to enabling faster and more efficient AI-driven coding assistance, a key feature of Codex’s functionality.

Codex, OpenAI's AI-powered coding assistant, operates by scanning codebases, building context, making edits, and running tests in rapid succession. Historically, these tasks involved numerous back-and-forth API calls, which added significant latency. The issue became more pronounced as model inference speeds improved, exposing API overhead as the bottleneck. Addressing this, OpenAI reengineered the Responses API, introducing a persistent WebSocket connection to minimize redundant processing.

Previously, the Responses API treated each request as independent, requiring full conversation histories to be sent and processed with every call. The new WebSocket implementation caches reusable state in memory, allowing only incremental changes to be sent during follow-up requests. This streamlined approach reduces overhead and makes better use of both server-side GPUs and client-side resources.

Key optimizations included:

  • Caching tokens and model configurations to bypass repetitive tokenization.
  • Eliminating unnecessary network hops to streamline API-to-inference communication.
  • Enhancing safety classifiers to process only new inputs instead of entire histories.
  • Overlapping non-blocking processes like billing with subsequent requests.

The decision to adopt WebSockets over alternatives like gRPC bidirectional streaming hinged on simplicity and minimal disruption to developers. The WebSocket mode preserved the familiar API interaction patterns, allowing developers to integrate it seamlessly without major changes to their workflows.

The results have been striking. In alpha tests with key partners, startups reported up to a 40% improvement in agentic workflow speeds. Codex users, including those leveraging platforms like Vercel and Cursor, saw significant reductions in latency. Additionally, Codex achieved its target of 1,000 TPS for GPT-5.3-Codex-Spark and even recorded bursts of up to 4,000 TPS in production traffic.

For AI developers and enterprises, these changes underscore the growing importance of optimizing API infrastructures to keep pace with accelerating model inference speeds. As OpenAI’s advancements demonstrate, improving the systems surrounding AI models is as critical as enhancing the models themselves. By addressing API bottlenecks, OpenAI ensures that performance gains from faster inference hardware, like the Cerebras GPUs used by GPT-5.3, are fully realized by end users.

Looking ahead, the implementation of WebSocket support in the Responses API sets a new benchmark for speed in AI agent workflows. It positions OpenAI strongly as demand for high-performance, real-time AI capabilities continues to grow across industries. With the API updates now in full production, developers can expect smoother, faster integrations—an essential step as AI tools become foundational to coding, productivity, and beyond.

Image source: Shutterstock
  • openai
  • websockets
  • api
  • codex
  • gpt-5.3
시장 기회
CodexField 로고
CodexField 가격(CODEX)
$17.3301
$17.3301$17.3301
0.00%
USD
CodexField (CODEX) 실시간 가격 차트
면책 조항: 본 사이트에 재게시된 글들은 공개 플랫폼에서 가져온 것으로 정보 제공 목적으로만 제공됩니다. 이는 반드시 MEXC의 견해를 반영하는 것은 아닙니다. 모든 권리는 원저자에게 있습니다. 제3자의 권리를 침해하는 콘텐츠가 있다고 판단될 경우, crypto.news@mexc.com으로 연락하여 삭제 요청을 해주시기 바랍니다. MEXC는 콘텐츠의 정확성, 완전성 또는 시의적절성에 대해 어떠한 보증도 하지 않으며, 제공된 정보에 기반하여 취해진 어떠한 조치에 대해서도 책임을 지지 않습니다. 본 콘텐츠는 금융, 법률 또는 기타 전문적인 조언을 구성하지 않으며, MEXC의 추천이나 보증으로 간주되어서는 안 됩니다.

Starter Gold Rush: Win $2,500!

Starter Gold Rush: Win $2,500!Starter Gold Rush: Win $2,500!

Start your first trade & capture every Alpha move