Ecosystem & Stack: ffmpeg
hacksider/Deep-Live-Cam
Deep-Live-Cam enables real-time face swapping and one-click video deepfakes using just a single image, designed for creative AI media generation.
SamurAIGPT/AI-Youtube-Shorts-Generator
Automates YouTube Shorts generation from long videos using AI for highlights, subtitles, and vertical cropping.
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time using a three-stage deep learning framework.
babysor/MockingBird
A powerful open-source toolkit for real-time voice cloning and arbitrary speech generation from text.
santinic/audiblez
Generate high-quality audiobooks in .m4b format from .epub e-books using advanced text-to-speech technology, with both command-line and graphical interfaces.
emcf/thepipe
A Python library for extracting clean markdown, multimodal media, and structured data from complex documents using vision-language models.
jianchang512/ChatTTS-ui
Provides a local web interface and API for the ChatTTS model, enabling text-to-speech synthesis with support for mixed languages and numbers.
jianchang512/clone-voice
A user-friendly web-based tool for voice cloning, text-to-speech, and speech-to-speech conversion, leveraging the Coqui XTTS_v2 model with multi-language support.
metavoiceio/metavoice-src
MetaVoice-1B is an open-source, 1.2B parameter foundational model for highly expressive, human-like text-to-speech synthesis and zero-shot voice cloning.
Plachtaa/VALL-E-X
An open-source implementation of Microsoft's VALL-E X, enabling zero-shot multilingual text-to-speech synthesis and voice cloning with emotion control.
numz/ComfyUI-SeedVR2_VideoUpscaler
Official SeedVR2 Video Upscaler for ComfyUI, enabling high-quality video and image upscaling, also runnable as a standalone CLI.
mayeaux/nodetube
An open-source, self-hostable alternative to YouTube, offering video, audio, and image uploads, livestreaming, and built-in monetization features.
3b1b/manim
A Python-based animation engine for creating precise, explanatory mathematical videos.
heygen-com/hyperframes
An open-source framework for creating and rendering HTML-based video compositions, optimized for AI agent-driven workflows.