fromInfoQ
6 days agoOpenAI Introduces Websocket-Based Execution Mode to Reduce Latency in Agentic Workflows
OpenAI has introduced a WebSocket-based execution mode for its responses API to improve the performance of agentic workflows used in coding agents and real-time AI systems. The change replaces the traditional HTTP request-response pattern with a persistent, bidirectional connection between client and server, targeting latency and coordination overhead in multi-step reasoning workflows.
Artificial intelligence