Tags: #multilingual

AI Text-to-Speech System

20.8k

FunAudioLLM/CosyVoice

CosyVoice is an advanced multi-lingual large language model-based text-to-speech system offering state-of-the-art voice generation, cloning, and full-stack deployment capabilities.

text-to-speech tts llm

Replaces:

Google Cloud Text-to-Speech Amazon Polly...

Details

AI Prompt Library

11.7k

YouMind-OpenLab/awesome-nano-banana-pro-prompts

A vast, multilingual, and open-source library of over 10,000 curated prompts for Google's Nano Banana Pro AI image generation, complete with preview images.

ai prompts prompt engineering ai image generation

Details

Text-to-Speech (TTS) Model

python

8.7k

fishaudio/Bert-VITS2

An open-source text-to-speech model that combines the VITS2 backbone with multilingual BERT for high-quality, multi-language speech synthesis.

tts speech-synthesis bert

Details

AI-powered Text-to-Speech System

Docker

30.0k

fishaudio/fish-speech

A state-of-the-art open-source multilingual text-to-speech system offering exceptionally natural, realistic, and emotionally rich voice generation.

tts ai multilingual

Details

AI Speech Synthesis System

4.6k

WhisperSpeech/WhisperSpeech

An open-source, high-performance text-to-speech (TTS) system built by inverting OpenAI Whisper, aiming to be the Stable Diffusion for speech.

text-to-speech ai open-source

Details

AI Text-to-Speech Engine

Docker

8.5k

netease-youdao/EmotiVoice

EmotiVoice is an open-source, multi-voice, and prompt-controlled text-to-speech engine supporting English and Chinese with emotional synthesis capabilities.

text-to-speech tts ai

Replaces:

Google Cloud Text-to-Speech Amazon Polly...

Details

AI Model Implementation / Text-to-Speech System

Python

7.9k

Plachtaa/VALL-E-X

An open-source implementation of Microsoft's VALL-E X, enabling zero-shot multilingual text-to-speech synthesis and voice cloning with emotion control.

tts voice-cloning multilingual

Replaces:

ElevenLabs Commercial Text-to-Speech (TTS) services

Details

On-Device Multilingual Text-to-Speech System

ONNX Runtime

7.1k