Ecosystem & Stack: ffmpeg

AI-powered Deepfake and Face Swap Software
python
87.0k

hacksider/Deep-Live-Cam

Deep-Live-Cam enables real-time face swapping and one-click video deepfakes using just a single image, designed for creative AI media generation.

CLI Tool
python
3.2k

SamurAIGPT/AI-Youtube-Shorts-Generator

Automates YouTube Shorts generation from long videos using AI for highlights, subtitles, and vertical cropping.

Deep Learning Framework / AI Tool
Python
59.6k

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time using a three-stage deep learning framework.

Voice Cloning and Speech Synthesis Toolkit
Python
36.9k

babysor/MockingBird

A powerful open-source toolkit for real-time voice cloning and arbitrary speech generation from text.

CLI Tool & Desktop Application
Python
6.0k

santinic/audiblez

Generate high-quality audiobooks in .m4b format from .epub e-books using advanced text-to-speech technology, with both command-line and graphical interfaces.

AI-powered Multimodal Data Extraction Library
python
1.5k

emcf/thepipe

A Python library for extracting clean markdown, multimodal media, and structured data from complex documents using vision-language models.

Local Web Interface for Text-to-Speech
Python
7.5k

jianchang512/ChatTTS-ui

Provides a local web interface and API for the ChatTTS model, enabling text-to-speech synthesis with support for mixed languages and numbers.

AI Voice Cloning and Synthesis Tool
python
9.0k

jianchang512/clone-voice

A user-friendly web-based tool for voice cloning, text-to-speech, and speech-to-speech conversion, leveraging the Coqui XTTS_v2 model with multi-language support.

AI Text-to-Speech (TTS) Model
Docker
4.2k

metavoiceio/metavoice-src

MetaVoice-1B is an open-source, 1.2B parameter foundational model for highly expressive, human-like text-to-speech synthesis and zero-shot voice cloning.

AI/ML Model Implementation
Python
8.0k

Plachtaa/VALL-E-X

An open-source implementation of Microsoft's VALL-E X, enabling zero-shot multilingual text-to-speech synthesis and voice cloning with emotion control.

AI Video Upscaling Tool
ComfyUI
2.4k

numz/ComfyUI-SeedVR2_VideoUpscaler

Official SeedVR2 Video Upscaler for ComfyUI, enabling high-quality video and image upscaling, also runnable as a standalone CLI.

Media Hosting & Streaming Platform
Node.js
2.4k

mayeaux/nodetube

An open-source, self-hostable alternative to YouTube, offering video, audio, and image uploads, livestreaming, and built-in monetization features.

Replaces:
Details
Programmatic Animation Engine
Python
86.1k

3b1b/manim

A Python-based animation engine for creating precise, explanatory mathematical videos.

Video Rendering Framework
Node.js
5.2k

heygen-com/hyperframes

An open-source framework for creating and rendering HTML-based video compositions, optimized for AI agent-driven workflows.

OSS Alternative

Explore the best open source alternatives to commercial software.

© 2026 OSS Alternative. hotgithub.com - All rights reserved.