OpenAI has rolled out a new set of real-time audio models focused on making voice AI faster and more useful in live ...
Translate, and Realtime-Whisper split voice into discrete models, reducing the orchestration overhead that has made ...
GPT‑Realtime‑Whisper is a new streaming transcription model built for low-latency speech-to-text. It transcribes audio as ...
What’s new: OpenAI launched GPT‑Realtime‑2, GPT‑Realtime‑Translate, and GPT‑Realtime‑Whisper, enhancing voice AI in reasoning, translation, and transcription. Why it matters: The models enable more ...
May 7 (Reuters) - OpenAI introduced three audio models for its developer platform on Thursday, aiming to make voice-based ...
GPT-Realtime-2 is OpenAI’s first voice model with GPT-5-class reasoning designed for live conversational use cases. It ...
The new lineup includes GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper. All three are available now through ...
OpenAI launched three new audio models that can reason, translate across 70+ languages, and transcribe speech in real time, ...
The new features could be handy for customer service systems, but OpenAI says they have applications that work across a ...
DeepL says its tech could be used for real-time translation with meeting tools like Zoom and Microsoft Teams ...
AI voice agents are getting closer to doing more than waiting their turn to speak. OpenAI announced Thursday that it is ...
ENCO has announced the release of enSpeak, a voice translation solution, allowing audiences to hear live programming in their preferred language.