Index / tool

VibeVoice

Category: AI Tools
Pricing: Open Source
Type: TOOL
Builder: @microsoft
GitHub: 50.5k stars
Added: Apr 12, 2026

About

Open-source voice AI framework that includes advanced speech recognition (ASR) for 60-minute audio transcription with speaker diarization, text-to-speech (TTS) for 90-minute multi-speaker synthesis, and real-time streaming TTS. Operates at ultra-low 7.5Hz frame rate for efficient long-form audio processing.

Why it made the leaderboard

Microsoft Research's open-source voice family built on 7.5Hz continuous speech tokenizers: ASR that transcribes 60 minutes in a single pass with speaker diarization, TTS that sustains 90-minute multi-speaker synthesis, and real-time streaming TTS. Long-form audio without the chunking hacks.

Tech Stack

Python

Featured in Intel

explainer
Microsoft's VibeVoice Just Made Enterprise-Grade Voice AI Free
Microsoft open-sourced their internal voice AI stack, and it's crushing commercial alternatives with 60-minute transcription and 90-minute speech synthesis.

Comments (0)

No comments yet

Indexed by a proprietary survey. Corrections welcome.

About

Why it made the leaderboard

Tags

Tech Stack

Featured in Intel

Microsoft's VibeVoice Just Made Enterprise-Grade Voice AI Free

Comments (0)