OpenAI在X上宣布：我们的新语音模型现已在实时API中可用

OpenAI(@OpenAI)

OpenAI(@OpenAI)2026年5月7日

OpenAI on X: "Our new voice models are now available in the Realtime API:

7.8Score

TL;DR · AI Summary

OpenAI has released three new voice models: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper, supporting real-time translation across more than 70 input and 13 output languages.

Key Takeaways

GPT-Realtime-2 enables building production-ready voice agents capable of handlin
GPT-Realtime-Translate supports real-time translation across more than 70 input
GPT-Realtime-Whisper transcribes streaming audio in real time to generate captio

Outline

Jump quickly between sections.

§New Voice Models Released
OpenAI has introduced three new voice models, expanding the application range of voice intelligence.
·GPT-Realtime-2
This model is used for creating voice agents capable of handling complex dialogues.
·GPT-Realtime-Translate
Supports multi-language real-time translation, breaking down language barriers.
·GPT-Realtime-Whisper
Transcribes streaming audio in real time, suitable for captioning and note-taking.

Mindmap

See how the topics connect at a glance.

查看大纲文本（无障碍 / 无 JS 友好）

OpenAI 新语音模型
- GPT-Realtime-2
  - 生产就绪的语音代理
- GPT-Realtime-Translate
  - 多语言实时翻译
- GPT-Realtime-Whisper
  - 实时转录与字幕生成

Highlights

Key sentences worth saving and sharing.

GPT-Realtime-Translate: Translate while streaming across more than 70 input and 13 output languages.
⬇︎ 下载 PNG 𝕏 分享到 X
GPT-Realtime-2: Build production-ready voice agents that can think harder, take action, handle interruptions, and keep conversations flowing.
⬇︎ 下载 PNG 𝕏 分享到 X
GPT-Realtime-Whisper: Transcribe streaming audio as words are spoken to generate captions and notes in real time.
⬇︎ 下载 PNG 𝕏 分享到 X

#OpenAI#Voice Models#Realtime API

Open original article

🎙️ GPT-Realtime-2: Build production-ready voice agents that can think harder, take action, handle interruptions, and keep conversations flowing.

🎙️ GPT-Realtime-Translate: Translate while streaming across more than 70" / X

OpenAI on X: "Our new voice models are now available in the Realtime API: 🎙️ GPT-Realtime-2: Build production-ready voice agents that can think harder, take action, handle interruptions, and keep conversations flowing. 🎙️ GPT-Realtime-Translate: Translate while streaming across more than 70" / X

Don’t miss what’s happening

OpenAI

@OpenAI

Our new voice models are now available in the Realtime API: Image 5: 🎙️ GPT-Realtime-2: Build production-ready voice agents that can think harder, take action, handle interruptions, and keep conversations flowing. Image 6: 🎙️ GPT-Realtime-Translate: Translate while streaming across more than 70 input and 13 output languages, breaking down language barriers and helping people communicate more naturally. Image 7: 🎙️ GPT-Realtime-Whisper: Transcribe streaming audio as words are spoken to generate captions and notes in real time.

![Image 8 Advancing voice intelligence with new models in the API](https://t.co/CLRyRfQmmf)

From openai.com

5:19 PM · May 7, 2026

·

251.9K Views

37

184

1.6K

457

Read 37 replies