GPT Realtime Whisper 最近有什么新动态？

traeai 已收录 7 篇与 GPT Realtime Whisper 相关的内容。最新一篇是「Advancing voice intelligence with new models in the API」，由 OpenAI Blog 发布。

模型

GPT Realtime Whisper

Q: 什么是 GPT Realtime Whisper？

用于实时语音转录的 GPT 模型。

已跟踪 7 条高相关材料

TraeAI 观察

如果只读 3 篇

Advancing voice intelligence with new models in the API

OpenAI Blog · 9.2 分

OpenAI 推出三款实时语音模型，提升语音应用的智能化水平。

OpenAI 在 Realtime API 里上线了三款新语音模型：GPT-Realtime-2、GPT-Realtime-Translate 和 GPT-Realtime-Whisper，分别管对话...

宝玉(@dotey) · 8.9 分

OpenAI发布了三款新语音模型：GPT-Realtime-2、GPT-Realtime-Translate和GPT-Realtime-Whisper，显著提升了对话、翻译和实时转录能力。

Our new voice models are now available in the Realtime API: 🎙️ GPT-Realtime-2: Build production-re...

OpenAI(@OpenAI) · 7.8 分

OpenAI发布了三款新的语音模型：GPT-Realtime-2、GPT-Realtime-Translate和GPT-Realtime-Whisper，支持超过70种输入语言和13种输出语言的实时翻译。

Advancing voice intelligence with new models in the API

OpenAI Blog5月8日1767 字 (约 8 分钟)

OpenAI introduces three real-time audio models to enhance voice applications.

入选理由：GPT-Realtime-2 能处理复杂请求并自然延续对话。

FeaturedArticle#OpenAI#Audio Models#API中文

OpenAI 在 Realtime API 里上线了三款新语音模型：GPT-Realtime-2、GPT-Realtime-Translate 和 GPT-Realtime-Whisper，分别管对话...

OpenAI Released Three New Voice Models in Realtime API

宝玉(@dotey)5月8日572 字 (约 3 分钟)

OpenAI released three new voice models: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper, significantly enhancing dialogue, translation, and real-time transcription capabilities.

入选理由：GPT-Realtime-2在Big Bench Audio智能测试上从81.4%提升到96.6%，支持更复杂的任务编排。

FeaturedTweet#OpenAI#voice model#API中文

Our new voice models are now available in the Realtime API:

🎙️ GPT-Realtime-2: Build production-re...

OpenAI on X: "Our new voice models are now available in the Realtime API:

OpenAI(@OpenAI)5月8日216 字 (约 1 分钟)

OpenAI has released three new voice models: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper, supporting real-time translation across more than 70 input and 13 output languages.

入选理由：GPT-Realtime-2能够构建生产就绪的语音代理，处理复杂对话并保持流畅。

FeaturedTweet#OpenAI#Voice Models#Realtime API英文

Codex for transcribing and answering questions about a meeting in real time:

Greg Brockman(@gdb)5月27日94 字 (约 1 分钟)

Codex 现实时间转录会议并实时回答问题，通过更新的 Codex Meeting Recorder 技能实现。

入选理由：Codex 使用 GPT Realtime Whisper 实现会议实时转录。

FeaturedTweet#Codex#GPT#会议转录#实时问答中文

OpenAI Developers on X: "GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper are available in the Realtime API today.

OpenAI Developers(@OpenAIDevs)5月8日74 字 (约 1 分钟)

OpenAI has released GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper, now available in the Realtime API.

入选理由：GPT-Realtime-2提升了实时处理能力。

FeaturedTweet#OpenAI#GPT#Realtime API英文

GPT-Realtime-Whisper brings low-latency streaming transcription to the Realtime API

OpenAI Developers(@OpenAIDevs)5月8日94 字 (约 1 分钟)

GPT-Realtime-Whisper brings low-latency streaming transcription to the Realtime API, suitable for applications that need to continuously understand speech during interactions.

入选理由：GPT-Realtime-Whisper实现了低延迟的语音转文字服务。

FeaturedTweet#GPT-Realtime-Whisper#Speech Recognition#OpenAI英文

第三个模型 GPT-Realtime-Whisper 是个流式语音转文字模型

原版 Whisper 的设计前提是处理「完整的一段音频」，你录完一段交给它，它出转写结果。新的流式版本是边说边转，延迟极...

The Third Model GPT-Realtime-Whisper is a Streaming Speech-to-Text Model

小互(@imxiaohu)5月8日311 字 (约 2 分钟)

GPT-Realtime-Whisper is a streaming speech-to-text model designed for real-time scenarios, supporting low-latency processing unlike the original Whisper which handles complete audio batches.

入选理由：新版模型支持流式处理，无需等待整段音频完成即可输出结果。

FeaturedTweet#AI#Speech Recognition#Streaming Processing#Whisper#Real-time Communication中文

跨材料问答 · GPT Realtime Whisper

回答基于：GPT Realtime Whisper 相关 7 条材料