Tags: #speech-synthesis - OSS Alternative - Discover Top Open Source Alternatives to Popular Software

Tags: #speech-synthesis

AI Voice Studio Application

24.7k

jamiepine/voicebox

An open-source, local-first AI voice studio offering voice cloning, speech generation, and dictation with complete privacy.

ai voice voice cloning speech synthesis

Replaces:

ElevenLabs WisprFlow

AI Desktop Application Toolkit

10.7k

Baiyuetribe/paper2gui

Paper2GUI converts complex AI research papers into user-friendly, install-free desktop applications, making advanced AI accessible to everyone.

ai tools desktop app gui

Replaces:

Midjourney Topaz Video AI...

Speech Synthesis Library

5.9k

snakers4/silero-models

A collection of pre-trained, end-to-end text-to-speech models designed for simplicity, speed, and natural-sounding speech across multiple languages.

text-to-speech tts ai

Replaces:

Google Cloud Text-to-Speech Amazon Polly...

Text-to-Speech (TTS) Model

8.7k

fishaudio/Bert-VITS2

An open-source text-to-speech model that combines the VITS2 backbone with multilingual BERT for high-quality, multi-language speech synthesis.

tts speech-synthesis bert

Generative AI Text-to-Speech Model

39.2k

2noise/ChatTTS

A generative speech model optimized for natural, expressive dialogue in LLM assistants, featuring fine-grained prosodic control.

tts generative-ai dialogue

Deep Learning Framework

59.7k

CorentinJ/Real-Time-Voice-Cloning

A deep learning framework for real-time voice cloning and text-to-speech synthesis from short audio samples.

voice-cloning speech-synthesis deep-learning

AI Voice Cloning Toolkit

36.9k

babysor/MockingBird

A real-time voice cloning toolkit that allows users to replicate a voice in 5 seconds and generate arbitrary speech.

voice-cloning text-to-speech real-time

Replaces:

ElevenLabs Google Cloud Text-to-Speech...

AI Speech and Sound Generation Framework

1.5k

OpenMOSS/MOSS-TTS

An open-source AI model family for high-fidelity, expressive speech and sound generation across diverse real-world applications.

speech synthesis sound generation text-to-speech

Replaces:

Google Cloud Text-to-Speech Amazon Polly...

Desktop Application

6.1k

LokerL/tts-vue

A cross-platform desktop application for Microsoft Edge text-to-speech synthesis, built with Electron and Vue.

tts electron vue

Replaces:

Commercial Text-to-Speech Software

AI-powered Text-to-Speech System

6.1k

canopyai/Orpheus-TTS

Orpheus TTS is a state-of-the-art open-source text-to-speech system built on a Llama-3b backbone, aiming to generate human-sounding, emotionally rich speech with low latency.

text-to-speech speech-synthesis ai

Speech Synthesis System

10.9k

rhasspy/piper

A fast, local neural text-to-speech system for generating high-quality speech offline.

text-to-speech tts neural-network

Replaces:

Google Cloud Text-to-Speech Amazon Polly...

AI/ML Model & Speech Synthesis Library

6.2k

yl4579/StyleTTS2

StyleTTS 2 is a cutting-edge text-to-speech model achieving human-level speech synthesis through style diffusion and adversarial training with large speech language models.

text-to-speech tts ai

Text-to-Speech (TTS) Foundational Model

4.2k

metavoiceio/metavoice-src

MetaVoice-1B is an open-source 1.2B parameter foundational model for highly expressive, human-like text-to-speech synthesis with advanced voice cloning capabilities.

tts voice cloning deep learning

Replaces:

Google Cloud Text-to-Speech Amazon Polly...

Speech Synthesis Library

4.0k

TensorSpeech/TensorFlowTTS

TensorFlowTTS is a real-time, state-of-the-art speech synthesis library built on TensorFlow 2, supporting multiple languages and optimized for efficient deployment.

speech-synthesis text-to-speech tensorflow

Replaces:

Commercial Text-to-Speech APIs

Speech Synthesis Library

7.8k

jaywalnut310/vits

VITS is an end-to-end text-to-speech model that generates highly natural-sounding audio with diverse rhythms, outperforming traditional two-stage TTS systems.

text-to-speech tts ai

Deep Learning Library

10.1k

mozilla/TTS

A deep learning library for advanced, high-quality, and efficient Text-to-Speech (TTS) synthesis, supporting multiple languages and models.

text-to-speech deep-learning speech-synthesis