Tags: #voice-cloning
jamiepine/voicebox
An open-source, local-first AI voice studio offering voice cloning, speech generation, and dictation with complete privacy.
FunAudioLLM/CosyVoice
CosyVoice is an advanced multi-lingual large language model-based text-to-speech system offering state-of-the-art voice generation, cloning, and full-stack deployment capabilities.
DrewThomasson/ebook2audiobook
Generate audiobooks from e-books with advanced text-to-speech, voice cloning, and extensive language support.
OpenBMB/VoxCPM
VoxCPM2 is a tokenizer-free, 2B-parameter Text-to-Speech system supporting 30 languages, creative voice design, and controllable voice cloning with 48kHz studio-quality audio output.
CorentinJ/Real-Time-Voice-Cloning
A deep learning framework for real-time voice cloning and text-to-speech synthesis from short audio samples.
babysor/MockingBird
A real-time voice cloning toolkit that allows users to replicate a voice in 5 seconds and generate arbitrary speech.
RVC-Boss/GPT-SoVITS
A powerful web-based tool for few-shot voice cloning and text-to-speech, enabling high-quality voice generation from minimal audio data.
WhisperSpeech/WhisperSpeech
An open-source, high-performance text-to-speech (TTS) system built by inverting OpenAI Whisper, aiming to be the Stable Diffusion for speech.
OpenMOSS/MOSS-TTS
An open-source AI model family for high-fidelity, expressive speech and sound generation across diverse real-world applications.
canopyai/Orpheus-TTS
Orpheus TTS is a state-of-the-art open-source text-to-speech system built on a Llama-3b backbone, aiming to generate human-sounding, emotionally rich speech with low latency.
abus-aikorea/voice-pro
An AI-powered web application for comprehensive multimedia content creation, offering speech recognition, voice cloning, text-to-speech, and multilingual translation.
jianchang512/clone-voice
A user-friendly, open-source tool that clones any human voice to generate speech from text or convert existing audio, featuring a web interface and multi-language support.
myshell-ai/OpenVoice
An open-source AI model for instant, accurate, and flexible voice cloning, supporting cross-lingual synthesis and granular style control.
coqui-ai/TTS
A deep learning toolkit for advanced, multi-language Text-to-Speech generation and voice cloning, suitable for research and production.
metavoiceio/metavoice-src
MetaVoice-1B is an open-source 1.2B parameter foundational model for highly expressive, human-like text-to-speech synthesis with advanced voice cloning capabilities.
Plachtaa/VALL-E-X
An open-source implementation of Microsoft's VALL-E X, enabling zero-shot multilingual text-to-speech synthesis and voice cloning with emotion control.