---
title: "Last week, we launched Gemini 3.1 TTS, our latest and best text-to-speech model. This new model intr..."
source_name: "Google AI(@GoogleAI)"
original_url: "https://x.com/GoogleAI/status/2047377023656436013"
canonical_url: "https://www.traeai.com/articles/733ab0b7-f04c-4c85-b89e-2ec9794c4ee3"
content_type: "tweet"
language: "中文"
score: 5
tags: []
published_at: "2026-04-23T18:08:15+00:00"
created_at: "2026-04-23T23:00:32.096289+00:00"
---

# Last week, we launched Gemini 3.1 TTS, our latest and best text-to-speech model. This new model intr...

Canonical URL: https://www.traeai.com/articles/733ab0b7-f04c-4c85-b89e-2ec9794c4ee3
Original source: https://x.com/GoogleAI/status/2047377023656436013

## Summary

traeai 为开发者、研究员和内容团队筛选高质量 AI 技术内容，提供摘要、评分、趋势雷达与一键内容产出。

## Key Takeaways

- 
- 
- 

## Content

Title: Google AI on X: "Last week, we launched Gemini 3.1 TTS, our latest and best text-to-speech model. This new model introduces [awe] audio tags, an intuitive way to guide vocal style, pace, and delivery.

Here are some tips on the best ways to use audio tags in your prompts:

1. All inline tags must https://t.co/YDbBLs5Dcp" / X

URL Source: http://x.com/GoogleAI/status/2047377023656436013

Markdown Content:
## Post

## Conversation

[![Image 1: Square profile picture](https://pbs.twimg.com/profile_images/1924554705503715328/0-HDhohz_normal.jpg)](https://x.com/GoogleAI)

Last week, we launched Gemini 3.1 TTS, our latest and best text-to-speech model. This new model introduces [awe] audio tags, an intuitive way to guide vocal style, pace, and delivery. Here are some tips on the best ways to use audio tags in your prompts: 1. All inline tags must be enclosed in square brackets, such as [screams] or [whispers] 2. Insert these tags exactly where you want the transition to occur and make sure to avoid placing tags directly next to each other 3. Use tags like [slow] or [fast] to control the pace of the delivery, or even [short pause] or [long pause] to ramp up the anticipation in dramatic moments 4. The model also offers granular control over vocalizations, allowing you to direct the delivery with cues like [cackles] or [whispers] 5. An ideal audio tag formula could look something like: [encouraging] Let’s try that last sentence again to make sure that you nailed it. [slow] "L'oiseau s'est envolé." [short pause] Perfect! [laughs] You're a natural. No matter what you’re developing — from [scholarly] a language learning tool, to [mysterious] an interactive podcast app, to [friendly] more adaptive customer service offerings, and beyond — these prompting tips will equip you to start building with Gemini 3.1 TTS.
