OpenAI Realtime API - A real-time voice interaction API designed for low-latency, multimodal experiences.
## Purpose of OpenAI Realtime API
The primary purpose of the OpenAI Realtime API is to enable developers to create applications with fast, low-latency, and multimodal interactions, particularly focusing on seamless voice-to-voice communication.
## Technologies Integrated with OpenAI Realtime API
The OpenAI Realtime API integrates with large language models like GPT-4 and uses WebSockets for real-time streaming capabilities.
## Input and Output Types Supported by OpenAI Realtime API
The OpenAI Realtime API supports both text and audio input and output.
## Achieving Low-Latency Communication with OpenAI Realtime API
The OpenAI Realtime API achieves low-latency communication through the use of WebSockets, which enable real-time streaming and bidirectional communication.
## Applications Built with OpenAI Realtime API
Applications that can be built using the OpenAI Realtime API include voice assistants, real-time translation tools, and customer support agents.
## Initial Preset Voices in OpenAI Realtime API
Initially, the OpenAI Realtime API offered six preset voices.
## Additional Voices Added to OpenAI Realtime API in 2024
In 2024, the OpenAI Realtime API added five new voices: Ash, Ballad, Coral, Sage, and Verse.
## Steps to Use OpenAI Realtime API
To use the OpenAI Realtime API, developers need to generate an API key, set up a WebSocket connection, send and receive real-time data, handle API events, and optionally implement function calls to extend AI capabilities.
## Official Documentation for OpenAI Realtime API
Developers can find the official documentation for the OpenAI Realtime API at [platform.openai.com/docs](https://platform.openai.com/docs/).
### Citation sources:
- [OpenAI Realtime API](https://platform.openai.com/docs) - Official URL
Updated: 2025-03-26