Desktop Application
19.2k 2026-04-17

jamiepine/voicebox

The open-source, local-first voice synthesis studio for voice cloning, speech generation, and audio effects.

Core Features

Complete privacy with all data and models processed locally.
Supports 5 TTS engines and 23 languages for diverse speech generation.
Offers advanced post-processing effects like pitch shift, reverb, and compression.
Features a multi-track timeline editor for composing complex audio projects.
Provides an API-first design for seamless integration into other applications.

Quick Start

docker compose up

Detailed Introduction

Voicebox is an innovative open-source, local-first voice synthesis studio designed as a privacy-focused alternative to commercial solutions like ElevenLabs. It empowers users to clone voices from short audio samples, generate speech across 23 languages using five distinct TTS engines, and apply a wide array of post-processing effects. With its multi-track timeline editor, users can compose complex audio projects like conversations and podcasts. Built with Tauri for native performance, Voicebox ensures all models and voice data remain securely on your machine, offering unparalleled privacy and control for voice-powered applications and creative endeavors.

OSS Alternative

Explore the best open source alternatives to commercial software.

© 2026 OSS Alternative. hotgithub.com - All rights reserved.