tech

February 4, 2026

Voxtral transcribes at the speed of sound.

Usage-based, starting around €5K/month for your size.

Voxtral transcribes at the speed of sound.

TL;DR

  • Voxtral Transcribe 2 includes Mini Transcribe V2 for batch and Realtime for live applications.
  • Models offer state-of-the-art transcription quality, speaker diarization, and ultra-low latency.
  • Voxtral Mini Transcribe V2 provides context biasing and word-level timestamps in 13 languages.
  • Voxtral Realtime features configurable latency down to sub-200ms for voice agents and real-time applications.
  • Voxtral Realtime is open-weights under Apache 2.0 for edge deployment and privacy.
  • Mistral Studio now includes an audio playground to test Voxtral Transcribe 2 features.
  • Voxtral Mini Transcribe V2 boasts industry-leading accuracy at a low cost, outperforming competitors like GPT-4o mini and Gemini 2.5 Flash.
  • Enterprise features include speaker diarization, context biasing, word-level timestamps, expanded language support, noise robustness, and longer audio support (up to 3 hours).
  • Voxtral Realtime is priced at $0.006/min via API and available as open weights.
  • Voxtral Mini Transcribe V2 is priced at $0.003/min via API.