AUDIO MODEL Feb 2026 Google/DeepMind Last updated: Apr 29, 2026

Lyria 3 NEW

Google's Generative Music + Live Mode

Google's music generation model — Lyria 3 and the realtime Lyria RealTime, integrated into the Gemini app, YouTube Shorts, and NotebookLM Audio Overviews. RealTime variant generates music live as you guide it, similar to playing a synthesizer.

Try Gemini API Docs ↗

Official ↗

Why it matters

Made generative music a default Gemini-app feature for hundreds of millions of consumers. The realtime variant proved that generative music can be interactive performance, not just batch synthesis.

Core Capabilities

Audio

Speech, music, or other audio understanding/synthesis.

Generative

Produces images, video, audio, or other media.

Multimodal

Combines text, vision, and audio in one model.

Context Window

Context window not disclosed.

Availability

API

Available

Product / App

Available

Open Source

Not released

Enterprise

Contact sales

Pricing Model

Pay per token

Input and output billed separately.

Pay-per-token

What it feels like

Audio model from Google DeepMind — see the linked sources below for benchmark and review coverage
Audio synthesis or transcription per the published model card

Best use cases

Audio synthesis / transcription tasks per the model card
See the model spec and sources block for benchmarked use cases

Tools to try

Gemini app AI Studio Vertex AI

Not ideal for

Tasks far outside the modalities listed in this model's spec
Workflows where a more recent successor in the same family scores higher