Google Launches Gemini 3.1 Flash TTS with Enhanced Emotional Expression and Multi-Speaker Capabilities

Gate News message, April 17 — Google unveiled Gemini 3.1 Flash TTS, an advanced text-to-speech model with enhanced emotional expression and control features, on April 15. The new model will be rolled out progressively through developer APIs, enterprise Vertex AI, and collaboration tools.

The model’s core capabilities include natural language-based audio tags for fine-tuning speed, intonation, and emotion, plus a “Director Mode” for specifying scenes and character roles to generate more nuanced voice outputs. A multi-speaker feature enables simultaneous dialogue generation, allowing more natural conversation flows suitable for podcasts, audio content, and AI assistants. The model supports over 70 languages and dialects, reflecting regional accents and expressions for localized voice experiences globally.

Google emphasized performance and cost efficiency, achieving high scores on blind human evaluation benchmarks while reducing computational costs through its Flash architecture—designed for large-scale enterprise adoption. Generated audio includes SynthID watermarking to identify AI-generated content and combat misinformation.

The move reflects intensifying competition in voice interfaces. OpenAI is combining real-time voice features with conversational AI for human-like interactions, while Meta is expanding investments in AI characters with voice-based social experiences. Industry observers note that while high-level acting and creative work may remain human-driven for now, repetitive and large-scale production markets could see gradual AI adoption in dubbing, advertising, and audiobook sectors.

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.

Related Articles

Anthropic Partners BlackRock and Goldman Sachs to Launch AI Consulting JV with $1.5B Investment

According to WSJ, Anthropic is finalizing a deal to launch a joint venture with BlackRock, Goldman Sachs, and other Wall Street firms on Monday, aimed at selling artificial intelligence tools to private equity-backed companies. The venture is expected to serve as Anthropic's consulting arm,

GateNews4m ago

China Blocks Meta's US$2B Manus AI Acquisition

China announced it was blocking Meta's US$2 billion acquisition of AI agent firm Manus, citing concerns over the transfer of Chinese artificial intelligence intellectual property to a US company, according to Tech in Asia reporting. Manus, a China-founded company that relocated its headquarters to

CryptoFrontier22m ago

Tencent Used Anthropic's Claude Code in Latest Hy3 Model Training, The Information Reports

According to The Information, citing internal Tencent memos and sources, Tencent employees used Anthropic's Claude Code during the post-training phase of Hy3, the company's latest large language model, despite Anthropic's explicit ban on commercial services to Chinese firms citing national

GateNews53m ago

Samsung Electro-Mechanics Gains on AI Demand, KB Securities Raises Target Price on May 4

According to KB Securities on May 4, Samsung Electro-Mechanics maintained a buy rating and raised its target price, citing strong demand for multilayer ceramic capacitors and FC-BGA substrates used in AI servers. The company reported first-quarter revenue of 3.2 trillion won ($2.18 billion) and

GateNews1h ago

Meta AI Boosts Facebook, Instagram Engagement in Q1 FY26; Reels Time Spent Up 10%

According to The Economic Times, Meta's Q1 FY26 results showed AI upgrades lifted video engagement across Facebook and Instagram. Instagram Reels time spent rose 10%, while Facebook video watch time increased more than 8% globally. Average price per ad rose 12% year-on-year, and more than 8

GateNews1h ago

Harvard Medical School’s latest study: AI’s diagnostic decision-making in the emergency room is better than that of human doctors

A study published by Harvard Medical School in Science used double-blind testing and clinical reasoning to objectively compare the differences between AI systems and human physicians in interpreting medical records. The results showed that at early emergency-department decision points, AI could be as good as or better than the attending physician, using only the electronic medical record information available at the time. The study also emphasized that AI cannot yet practice medicine autonomously, and physicians remain indispensable; if widely adopted, AI could reduce diagnostic errors and the cost of seeking care.

ChainNewsAbmedia1h ago
Comment
0/400
No comments