Tags: #multilingual
FunAudioLLM/CosyVoice
CosyVoice is an advanced multi-lingual large language model-based text-to-speech system offering state-of-the-art voice generation, cloning, and full-stack deployment capabilities.
DrewThomasson/ebook2audiobook
A powerful tool to convert e-books into audiobooks with advanced text-to-speech, voice cloning, and extensive language support.
OpenBMB/VoxCPM
A tokenizer-free, multilingual Text-to-Speech system offering advanced voice design, controllable cloning, and high-quality audio output.
fishaudio/fish-speech
A state-of-the-art open-source multilingual text-to-speech system offering natural, expressive, and emotionally rich voice generation.
remsky/Kokoro-FastAPI
A Dockerized FastAPI wrapper providing a high-performance, multi-platform (CPU/GPU) and multi-language API for the Kokoro-82M text-to-speech model, compatible with OpenAI's speech endpoint.
WhisperSpeech/WhisperSpeech
An open-source, high-performance text-to-speech system built on Whisper, aiming to be a hackable and commercially safe alternative for speech generation.
Plachtaa/VALL-E-X
An open-source implementation of Microsoft's VALL-E X, enabling zero-shot multilingual text-to-speech synthesis and voice cloning with emotion control.
AIGODLIKE/AIGODLIKE-ComfyUI-Translation
A ComfyUI plugin providing multilingual translation for its user interface and nodes, enhancing accessibility for global users.