Ecosystem & Stack: cuda

LLM Serving Platform
Python
5.1k

kvcache-ai/Mooncake

A KVCache-centric disaggregated architecture for high-performance LLM serving, powering leading AI services.

Deep Learning Adaptation Framework
Python
1.5k

tianrun-chen/SAM-Adapter-PyTorch

A PyTorch-based framework to adapt Meta AI's Segment Anything Model (SAM) for improved performance on challenging downstream computer vision tasks using adapters and prompts.

Deep Learning Toolkit / Multimodal LLM Framework
linux
1.0k

X-LANCE/SLAM-LLM

A deep learning toolkit for training custom multimodal large language models focused on speech, language, audio, and music processing.

AI/ML Finetuning UI
python
2.1k

lxe/simple-llm-finetuner

A beginner-friendly UI for fine-tuning language models using LoRA on commodity NVIDIA GPUs, though the project is no longer actively maintained.

AI/ML Model Finetuning Framework
python
3.8k

mymusise/ChatGLM-Tuning

A cost-effective solution for fine-tuning ChatGLM-6B using LoRA, enabling personalized large language models.

Replaces:
Details
LLM Inference Server
Docker
3.8k

predibase/lorax

A multi-LoRA inference server designed to efficiently serve thousands of fine-tuned Large Language Models on a single GPU, drastically cutting serving costs while maintaining high throughput and low latency.

Machine Learning Library
pytorch
33.4k

huggingface/diffusers

A modular PyTorch library for state-of-the-art diffusion models, enabling easy generation of images, audio, and more.

ComfyUI Custom Nodes / AI Video Generation Plugin
comfyui
3.5k

Lightricks/ComfyUI-LTXVideo

Extends ComfyUI with advanced custom nodes for the LTX-2 video generation model, enabling powerful text-to-video and image-to-video workflows.

CLI Tool
python
3.2k

SamurAIGPT/AI-Youtube-Shorts-Generator

Automates YouTube Shorts generation from long videos using AI for highlights, subtitles, and vertical cropping.

Content Creation Tool
Python
4.3k

denizsafak/abogen

Generate high-quality audiobooks and voiceovers from various text formats with synchronized captions.

AI Voice Synthesis Web Application
Python
56.8k

RVC-Boss/GPT-SoVITS

A powerful open-source web UI for few-shot voice conversion and text-to-speech, enabling high-quality voice cloning with minimal audio data.

Multimodal AI Inference and Serving Framework
python
4.4k

vllm-project/vllm-omni

vLLM-Omni is an efficient, flexible, and easy-to-use framework extending vLLM to serve omni-modality models (text, image, video, audio) with high throughput and an OpenAI-compatible API.

LLM Fine-tuning Framework
Python
2.8k

liucongg/ChatGLM-Finetuning

A comprehensive toolkit for fine-tuning ChatGLM-6B, ChatGLM2-6B, and ChatGLM3-6B models using various methods like Freeze, Lora, P-tuning, and full parameter fine-tuning.

AI Multimedia Processing Web Application
CUDA
6.6k

abus-aikorea/voice-pro

A powerful AI-powered web application for comprehensive multimedia content creation, offering advanced speech recognition, voice cloning, multilingual TTS, and YouTube video processing.

Local Web Interface for Text-to-Speech
Python
7.5k

jianchang512/ChatTTS-ui

Provides a local web interface and API for the ChatTTS model, enabling text-to-speech synthesis with support for mixed languages and numbers.

AI Voice Cloning and Synthesis Tool
python
9.0k

jianchang512/clone-voice

A user-friendly web-based tool for voice cloning, text-to-speech, and speech-to-speech conversion, leveraging the Coqui XTTS_v2 model with multi-language support.

Audio Synthesis Framework
Python
4.8k

MoonInTheRiver/DiffSinger

DiffSinger is an official PyTorch implementation of a singing voice synthesis (SVS) and text-to-speech (TTS) system, leveraging a shallow diffusion mechanism for high-quality audio generation.

AI/ML Model Implementation
Python
8.0k

Plachtaa/VALL-E-X

An open-source implementation of Microsoft's VALL-E X, enabling zero-shot multilingual text-to-speech synthesis and voice cloning with emotion control.

AI/ML Library
.net
3.6k

SciSharp/LLamaSharp

A cross-platform C#/.NET library for efficient local inference of large language models (LLMs) like LLaMA and LLAVA.

AI Generative WebUI
stable diffusion
7.1k

vladmandic/sdnext

An all-in-one open-source WebUI for AI generative image and video creation, captioning, and processing, built on Stable Diffusion.

Generative AI Educational Resource Hub
google colab
2.7k

FurkanGozukara/Stable-Diffusion

A comprehensive repository offering expert-level tutorials, guides, and courses on various Generative AI technologies, primarily focusing on Stable Diffusion and its ecosystem.

AI Video Generation Tool
Python
4.7k

nateraw/stable-diffusion-videos

Create dynamic videos by smoothly transitioning between text prompts using Stable Diffusion's latent space exploration.

AI-powered 3D Generation Node Suite
ComfyUI
3.7k

MrForExample/ComfyUI-3D-Pack

An extensive node suite that integrates cutting-edge 3D generation algorithms and models into ComfyUI, enabling seamless processing of 3D inputs like meshes and UV textures.

OSS Alternative

Explore the best open source alternatives to commercial software.

© 2026 OSS Alternative. hotgithub.com - All rights reserved.