jamiepine/voicebox
The open-source, local-first voice synthesis studio for voice cloning, speech generation, and audio effects.
Core Features
Quick Start
docker compose upDetailed Introduction
Voicebox is an innovative open-source, local-first voice synthesis studio designed as a privacy-focused alternative to commercial solutions like ElevenLabs. It empowers users to clone voices from short audio samples, generate speech across 23 languages using five distinct TTS engines, and apply a wide array of post-processing effects. With its multi-track timeline editor, users can compose complex audio projects like conversations and podcasts. Built with Tauri for native performance, Voicebox ensures all models and voice data remain securely on your machine, offering unparalleled privacy and control for voice-powered applications and creative endeavors.