Tags: #voice-cloning
jamiepine/voicebox
The open-source, local-first voice synthesis studio for voice cloning, speech generation, and audio effects.
FunAudioLLM/CosyVoice
CosyVoice is an advanced multi-lingual large language model-based text-to-speech system offering state-of-the-art voice generation, cloning, and full-stack deployment capabilities.
DrewThomasson/ebook2audiobook
A powerful tool to convert e-books into audiobooks with advanced text-to-speech, voice cloning, and extensive language support.
OpenBMB/VoxCPM
A tokenizer-free, multilingual Text-to-Speech system offering advanced voice design, controllable cloning, and high-quality audio output.
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time using a three-stage deep learning framework.
babysor/MockingBird
A powerful open-source toolkit for real-time voice cloning and arbitrary speech generation from text.
RVC-Boss/GPT-SoVITS
A powerful open-source web UI for few-shot voice conversion and text-to-speech, enabling high-quality voice cloning with minimal audio data.
WhisperSpeech/WhisperSpeech
An open-source, high-performance text-to-speech system built on Whisper, aiming to be a hackable and commercially safe alternative for speech generation.
OpenMOSS/MOSS-TTS
An open-source AI model family for high-fidelity, expressive speech and sound generation across diverse real-world applications.
canopyai/Orpheus-TTS
A state-of-the-art open-source text-to-speech system leveraging LLMs to generate human-like, emotional, and low-latency speech with zero-shot voice cloning capabilities.
abus-aikorea/voice-pro
A powerful AI-powered web application for comprehensive multimedia content creation, offering advanced speech recognition, voice cloning, multilingual TTS, and YouTube video processing.
jianchang512/clone-voice
A user-friendly web-based tool for voice cloning, text-to-speech, and speech-to-speech conversion, leveraging the Coqui XTTS_v2 model with multi-language support.
myshell-ai/OpenVoice
An AI voice synthesis library offering instant, accurate, and flexible voice cloning with multi-lingual support.
coqui-ai/TTS
A deep learning toolkit for Text-to-Speech, offering pretrained models, training tools, and dataset utilities.
metavoiceio/metavoice-src
MetaVoice-1B is an open-source, 1.2B parameter foundational model for highly expressive, human-like text-to-speech synthesis and zero-shot voice cloning.
Plachtaa/VALL-E-X
An open-source implementation of Microsoft's VALL-E X, enabling zero-shot multilingual text-to-speech synthesis and voice cloning with emotion control.