OpenAI
OpenAI's new voice AI, explained — what it can do now
Smarter conversations, live translation and instant transcription. Here's the plain-English version.
On 7 May 2026 OpenAI launched smarter voice AI that can reason, translate and transcribe live.
Talking to AI is getting a lot better, and OpenAI's latest update is a big reason why. Here's what it added, in plain English — and where you might actually notice it.
The three new tools
- **Smarter talking (GPT-Realtime-2):** can handle more complex requests in a spoken conversation, instead of getting confused or giving canned answers.
- **Live translation (GPT-Realtime-Translate):** translates speech from 70+ languages into 13, keeping up with the speaker — handy for travel or cross-language calls.
- **Live transcription (GPT-Realtime-Whisper):** turns your speech into text in real time, as you talk.
Where you'll see it
You won't download these directly — they're for companies building apps. But you'll feel them in smarter voice assistants, customer-service lines that actually understand you, live-translation features, and meeting apps that transcribe as people speak. It's the plumbing behind the better voice experiences arriving across lots of apps.
Sources
- Advancing voice intelligence with new models in the API — OpenAI, 7 May 2026
- OpenAI launches new voice intelligence features in its API — TechCrunch, 7 May 2026