Best Open Source Alternatives to google cloud text-to-speech

deep-learning text-to-speech voice-synthesis

45.2k

coqui-ai/TTS

A deep learning toolkit for advanced, multi-language Text-to-Speech generation and voice cloning, suitable for research and production.

Replaces:

AI Voice Cloning Toolkit

voice-cloning text-to-speech real-time

36.9k

babysor/MockingBird

A real-time voice cloning toolkit that allows users to replicate a voice in 5 seconds and generate arbitrary speech.

Replaces:

ElevenLabs Google Cloud Text-to-Speech...

AI Text-to-Speech System

20.8k

FunAudioLLM/CosyVoice

CosyVoice is an advanced multi-lingual large language model-based text-to-speech system offering state-of-the-art voice generation, cloning, and full-stack deployment capabilities.

text-to-speech tts llm

Replaces:

Text-to-Speech (TTS) System

20.3k

index-tts/index-tts

An industrial-level, zero-shot text-to-speech system offering precise duration control and disentangled emotional expression for highly natural and controllable speech synthesis.

text-to-speech tts ai

Replaces:

text-to-speech tts neural-network

Speech Synthesis System

10.9k

rhasspy/piper

A fast, local neural text-to-speech system for generating high-quality speech offline.

Replaces:

AI Desktop Application Toolkit

10.7k

Baiyuetribe/paper2gui

Paper2GUI converts complex AI research papers into user-friendly, install-free desktop applications, making advanced AI accessible to everyone.

ai tools desktop app gui

Replaces:

Midjourney Topaz Video AI...

AI Text-to-Speech Engine

Docker

8.5k

netease-youdao/EmotiVoice

EmotiVoice is an open-source, multi-voice, and prompt-controlled text-to-speech engine supporting English and Chinese with emotional synthesis capabilities.

text-to-speech tts ai

Replaces:

text-to-speech tts multi-lingual

Text-to-Speech Library

python

7.4k

myshell-ai/MeloTTS

A high-quality, multi-lingual text-to-speech library supporting various languages and accents, optimized for real-time CPU inference.

Replaces:

Speech Synthesis Library

pytorch

5.9k

snakers4/silero-models

A collection of pre-trained, end-to-end text-to-speech models designed for simplicity, speed, and natural-sounding speech across multiple languages.

text-to-speech tts ai

Replaces:

tts voice cloning deep learning

Text-to-Speech (TTS) Foundational Model

Docker

4.2k

metavoiceio/metavoice-src

MetaVoice-1B is an open-source 1.2B parameter foundational model for highly expressive, human-like text-to-speech synthesis with advanced voice cloning capabilities.

Replaces:

text-to-speech audio-generation voice-conversion

AI/ML Audio Platform

python

3.1k

rsxdalv/TTS-WebUI

A unified Gradio and React web interface integrating a vast collection of open-source Text-to-Speech, audio generation, and voice conversion AI models.

Replaces:

ElevenLabs Google Cloud Text-to-Speech...

speech synthesis sound generation text-to-speech

AI Speech and Sound Generation Framework

llama.cpp

1.5k

OpenMOSS/MOSS-TTS

An open-source AI model family for high-fidelity, expressive speech and sound generation across diverse real-world applications.

Replaces:

AI/ML Library & SDK

text-to-speech ai-inference python-library

1.4k

edwko/OuteTTS

A versatile interface for OuteTTS models, providing flexible text-to-speech generation capabilities across various AI inference backends and hardware platforms.

Replaces: