Ecosystem & Stack: ffmpeg
hacksider/Deep-Live-Cam
Deep-Live-Cam enables real-time face swapping and one-click video deepfake generation using just a single image.
SamurAIGPT/AI-Youtube-Shorts-Generator
An AI-powered Python tool that automatically generates engaging YouTube Shorts from long-form videos by identifying viral-worthy moments and vertically cropping them.
CorentinJ/Real-Time-Voice-Cloning
A deep learning framework for real-time voice cloning and text-to-speech synthesis from short audio samples.
babysor/MockingBird
A real-time voice cloning toolkit that allows users to replicate a voice in 5 seconds and generate arbitrary speech.
santinic/audiblez
A Python-based tool to convert e-books (EPUB) into high-quality M4B audiobooks using advanced text-to-speech models.
emcf/thepipe
A Python library for extracting clean markdown, multimodal media, and structured data from complex documents using vision-language models.
jianchang512/ChatTTS-ui
Provides a local web interface and API for ChatTTS to synthesize text into speech, supporting mixed Chinese, English, and numbers.
jianchang512/clone-voice
A user-friendly, open-source tool that clones any human voice to generate speech from text or convert existing audio, featuring a web interface and multi-language support.
metavoiceio/metavoice-src
MetaVoice-1B is an open-source 1.2B parameter foundational model for highly expressive, human-like text-to-speech synthesis with advanced voice cloning capabilities.
Plachtaa/VALL-E-X
An open-source implementation of Microsoft's VALL-E X, enabling zero-shot multilingual text-to-speech synthesis and voice cloning with emotion control.
numz/ComfyUI-SeedVR2_VideoUpscaler
Official SeedVR2 Video Upscaler for ComfyUI, enabling high-quality video and image upscaling, also runnable as a standalone CLI.
mayeaux/nodetube
An open-source, self-hostable alternative to YouTube, offering video, audio, and image uploads, livestreaming, and built-in monetization features.
3b1b/manim
A Python-based animation engine for creating precise, explanatory mathematical videos.
riffusion/riffusion-hobby
A library for real-time music and audio generation leveraging stable diffusion, offering CLI, interactive app, and API capabilities.