AUDIO MODEL Mar 2026 Mistral AI Last updated: Apr 29, 2026

Voxtral TTS (v26.03) NEW

Voxtral TTS (v26.03) is an API model from Mistral AI. It’s positioned for audio tasks—work that benefits from iteration, not just one-shot answers.

Core Capabilities

Audio

Speech, music, or other audio understanding/synthesis.

Generative

Produces images, video, audio, or other media.

Context window not disclosed.

API

Available

Product / App

Not available

Open Source

Not released

Enterprise

Contact sales

Pay per token

Input and output billed separately.

Pay-per-token

Audio model from Mistral AI — see the linked sources below for benchmark and review coverage
Audio synthesis or transcription per the published model card