GPT-Realtime-2 brings GPT-5-class reasoning to live voice. A separate translation model covers 70+ input languages. A streaming Whisper variant handles transcription. The pricing is aggressive enough ...
What’s been launched: OpenAI released GPT‑Realtime‑2, GPT‑Realtime‑Translate, and GPT‑Realtime‑Whisper via its API, adding advanced reasoning, live translation, and instant transcription capabilities.
Voice agents have been expensive to run and painful to orchestrate, not because the models can't handle conversation, but because context ceilings forced enterprises to build session resets, state ...
May 7 (Reuters) - OpenAI introduced ⁠three ⁠audio models for ⁠its developer platform on Thursday, aiming to make voice-based software agents more conversational ‌and capable of completing ‌tasks in ...