Best Open Source Alternatives to google cloud text-to-speech
Looking for open source alternatives to google cloud text-to-speech? Here are our top picks. These projects offer similar features with transparency and community support.
Projects (13)
coqui-ai/TTS
A deep learning toolkit for advanced, multi-language Text-to-Speech generation and voice cloning, suitable for research and production.
babysor/MockingBird
A real-time voice cloning toolkit that allows users to replicate a voice in 5 seconds and generate arbitrary speech.
FunAudioLLM/CosyVoice
CosyVoice is an advanced multi-lingual large language model-based text-to-speech system offering state-of-the-art voice generation, cloning, and full-stack deployment capabilities.
index-tts/index-tts
An industrial-level, zero-shot text-to-speech system offering precise duration control and disentangled emotional expression for highly natural and controllable speech synthesis.
rhasspy/piper
A fast, local neural text-to-speech system for generating high-quality speech offline.
Baiyuetribe/paper2gui
Paper2GUI converts complex AI research papers into user-friendly, install-free desktop applications, making advanced AI accessible to everyone.
netease-youdao/EmotiVoice
EmotiVoice is an open-source, multi-voice, and prompt-controlled text-to-speech engine supporting English and Chinese with emotional synthesis capabilities.
myshell-ai/MeloTTS
A high-quality, multi-lingual text-to-speech library supporting various languages and accents, optimized for real-time CPU inference.
snakers4/silero-models
A collection of pre-trained, end-to-end text-to-speech models designed for simplicity, speed, and natural-sounding speech across multiple languages.
metavoiceio/metavoice-src
MetaVoice-1B is an open-source 1.2B parameter foundational model for highly expressive, human-like text-to-speech synthesis with advanced voice cloning capabilities.
rsxdalv/TTS-WebUI
A unified Gradio and React web interface integrating a vast collection of open-source Text-to-Speech, audio generation, and voice conversion AI models.
OpenMOSS/MOSS-TTS
An open-source AI model family for high-fidelity, expressive speech and sound generation across diverse real-world applications.
edwko/OuteTTS
A versatile interface for OuteTTS models, providing flexible text-to-speech generation capabilities across various AI inference backends and hardware platforms.
FAQ
Why look for open source alternatives to google cloud text-to-speech?
Open source alternatives offer transparency, avoid vendor lock-in, and are often free or self-hosted.
Are these projects secure?
Source code is public for auditing. High-star projects with active communities are generally very secure.