GPT‑Realtime‑Whisper is a new streaming transcription model built for low-latency speech-to-text. It transcribes audio as ...
OpenAI launched three new audio models that can reason, translate across 70+ languages, and transcribe speech in real time, ...
The new features could be handy for customer service systems, but OpenAI says they have applications that work across a ...
The three are GPT-Realtime-2, a successor to the company’s existing realtime voice model with what OpenAI describes as GPT-5-class reasoning; GPT-Realtime-Translate, a live translation model with more ...
OpenAI launches GPT Realtime 2 for advanced voice reasoning alongside a new Codex Chrome extension to automate browser ...
OpenAI has introduced three new audio models through its API, expanding its push into ...
New AI suite: OpenAI released three voice AI models with real-time reasoning, translation, and transcription capabilities for developers. Enhanced capabilities: GPT‑Realtime‑2 features GPT‑5‑class ...
The launch ⁠of ⁠the application programming interface (API) moves the ChatGPT-maker beyond ​transcription and chat toward ...
OpenAI advances recursive AI; new startup pursues self-improving systems amid leaked experimental model names.
Put simply: these agents can be created and accessed from ChatGPT, but users can also add them to third-party apps like Slack ...
New ChatGPT Images 2.0 claims a step up in thinking capabilities, detailed instruction following, and improved rendering of ...
OpenAI is shuttering Sora, its stand-alone AI video generation app and social network, and the availability for developers to access the Sora 2 video model family through its application programming ...