Tags: #diffusion-models

Visual AI Workflow Engine
108.5k

Comfy-Org/ComfyUI

A powerful and modular visual interface for designing and executing advanced Stable Diffusion and other generative AI model pipelines.

AI/ML Training Platform
DeepSpeed
2.8k

bghira/SimpleTuner

A user-friendly, versatile fine-tuning kit for image, video, and audio diffusion models, emphasizing simplicity and cutting-edge features.

AI/ML Model Fine-tuning Tool
conda
2.0k

adobe-research/custom-diffusion

Enables fast and efficient multi-concept customization of text-to-image diffusion models like Stable Diffusion using a few images.

AI Inference Engine & Optimization Library
comfyui
3.8k

nunchaku-ai/nunchaku

Nunchaku is a high-performance inference engine that optimizes 4-bit neural networks, especially diffusion models, for speed and efficiency.

AI/ML Model Fine-tuning Tool
pytorch
7.5k

cloneofsimo/lora

A tool for fast and efficient fine-tuning of diffusion models using Low-rank Adaptation (LoRA), producing small, shareable models.

AI/ML Library
python
1.7k

zai-org/ImageReward

A human preference reward model for evaluating and improving text-to-image generation models.

Machine Learning Library
pytorch
33.4k

huggingface/diffusers

A modular PyTorch library for state-of-the-art diffusion models, enabling easy generation of images, audio, and more.

3D Content Generation Framework
Python
8.8k

ashawkey/stable-dreamfusion

A PyTorch implementation of Dreamfusion, enabling text-to-3D and image-to-3D content generation using NeRF and Stable Diffusion.

Research Repository
3.3k

YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy

A comprehensive collection and categorization of diffusion model papers, accompanied by a detailed survey and taxonomy.

Audio Synthesis Framework
Python
4.8k

MoonInTheRiver/DiffSinger

DiffSinger is an official PyTorch implementation of a singing voice synthesis (SVS) and text-to-speech (TTS) system, leveraging a shallow diffusion mechanism for high-quality audio generation.

AI/ML Model, Speech Synthesis Library
python
6.2k

yl4579/StyleTTS2

StyleTTS 2 is a text-to-speech model that achieves human-level speech synthesis by leveraging style diffusion and adversarial training with large speech language models.

AI Inference Engine / Command-Line Tool
ggml
5.8k

leejet/stable-diffusion.cpp

A lightweight, pure C/C++ inference engine for various diffusion models, enabling efficient image and video generation across multiple platforms and hardware.

Curated Research Collection
2.2k

wangkai930418/awesome-diffusion-categorized

A meticulously categorized collection of research papers on diffusion models, organized by diverse subareas such as visual illusion, color in generation, image restoration, and text-guided editing.

OSS Alternative

Explore the best open source alternatives to commercial software.

© 2026 OSS Alternative. hotgithub.com - All rights reserved.