T
traeai
Sign in
返回首页
OpenAI(@OpenAI)

OpenAI on X: "Our new voice models are now available in the Realtime API:

7.8Score
OpenAI on X: "Our new voice models are now available in the Realtime API:

TL;DR · AI Summary

OpenAI has released three new voice models: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper, supporting real-time translation across more than 70 input and 13 output languages.

Key Takeaways

  • GPT-Realtime-2 enables building production-ready voice agents capable of handlin
  • GPT-Realtime-Translate supports real-time translation across more than 70 input
  • GPT-Realtime-Whisper transcribes streaming audio in real time to generate captio

Outline

Jump quickly between sections.

  1. OpenAI has introduced three new voice models, expanding the application range of voice intelligence.

  2. This model is used for creating voice agents capable of handling complex dialogues.

  3. Supports multi-language real-time translation, breaking down language barriers.

  4. Transcribes streaming audio in real time, suitable for captioning and note-taking.

Mindmap

See how the topics connect at a glance.

查看大纲文本(无障碍 / 无 JS 友好)
  • OpenAI 新语音模型
    • GPT-Realtime-2
      • 生产就绪的语音代理
    • GPT-Realtime-Translate
      • 多语言实时翻译
    • GPT-Realtime-Whisper
      • 实时转录与字幕生成

Highlights

Key sentences worth saving and sharing.

  • GPT-Realtime-Translate: Translate while streaming across more than 70 input and 13 output languages.
    ⬇︎ 下载 PNG𝕏 分享到 X
  • GPT-Realtime-2: Build production-ready voice agents that can think harder, take action, handle interruptions, and keep conversations flowing.
    ⬇︎ 下载 PNG𝕏 分享到 X
  • GPT-Realtime-Whisper: Transcribe streaming audio as words are spoken to generate captions and notes in real time.
    ⬇︎ 下载 PNG𝕏 分享到 X
#OpenAI#Voice Models#Realtime API
Open original article

🎙️ GPT-Realtime-2: Build production-ready voice agents that can think harder, take action, handle interruptions, and keep conversations flowing.

🎙️ GPT-Realtime-Translate: Translate while streaming across more than 70" / X

OpenAI on X: "Our new voice models are now available in the Realtime API: 🎙️ GPT-Realtime-2: Build production-ready voice agents that can think harder, take action, handle interruptions, and keep conversations flowing. 🎙️ GPT-Realtime-Translate: Translate while streaming across more than 70" / X

Don’t miss what’s happening

Image 4: Square profile picture

OpenAI

@OpenAI

Our new voice models are now available in the Realtime API: Image 5: 🎙️ GPT-Realtime-2: Build production-ready voice agents that can think harder, take action, handle interruptions, and keep conversations flowing. Image 6: 🎙️ GPT-Realtime-Translate: Translate while streaming across more than 70 input and 13 output languages, breaking down language barriers and helping people communicate more naturally. Image 7: 🎙️ GPT-Realtime-Whisper: Transcribe streaming audio as words are spoken to generate captions and notes in real time.

![Image 8 Advancing voice intelligence with new models in the API](https://t.co/CLRyRfQmmf)

From openai.com

5:19 PM · May 7, 2026

·

251.9K Views

37

184

1.6K

457

Read 37 replies

AI may generate inaccurate information. Please verify important content.