Tags: #diffusion-models
bghira/SimpleTuner
A user-friendly, versatile fine-tuning kit for image, video, and audio diffusion models, emphasizing simplicity and cutting-edge features.
Nerogar/OneTrainer
A comprehensive, one-stop solution for training various diffusion models with advanced features and a user-friendly interface.
adobe-research/custom-diffusion
Enables fast and efficient multi-concept customization of text-to-image diffusion models like Stable Diffusion using a few images.
nunchaku-ai/nunchaku
Nunchaku is a high-performance AI inference engine that optimizes 4-bit neural networks, especially diffusion models, for faster and more memory-efficient execution.
cloneofsimo/lora
Enables rapid and efficient fine-tuning of diffusion models, particularly Stable Diffusion, using Low-rank Adaptation (LoRA) to generate high-quality, custom images with significantly smaller model sizes.
zai-org/ImageReward
A human preference reward model for evaluating and improving text-to-image generation models.
huggingface/diffusers
A modular PyTorch library for state-of-the-art diffusion models, enabling easy inference and training for image, video, and audio generation.
ashawkey/stable-dreamfusion
A PyTorch implementation for generating 3D models from text or images, leveraging NeRF and diffusion models like Stable Diffusion.
YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
A comprehensive and continuously updated collection and categorization of research papers on diffusion models.
lucidrains/DALLE2-pytorch
A PyTorch implementation of OpenAI's DALL-E 2, a state-of-the-art neural network for text-to-image synthesis.
MoonInTheRiver/DiffSinger
DiffSinger is an official PyTorch implementation of a singing voice synthesis (SVS) and text-to-speech (TTS) system, leveraging a shallow diffusion mechanism for high-quality audio generation.
yl4579/StyleTTS2
StyleTTS 2 is a cutting-edge text-to-speech model achieving human-level speech synthesis through style diffusion and adversarial training with large speech language models.
leejet/stable-diffusion.cpp
A pure C/C++ implementation for efficient, cross-platform inference of various diffusion models, including Stable Diffusion, FLUX, Wan, and Qwen Image.
wangkai930418/awesome-diffusion-categorized
A meticulously categorized collection of research papers on diffusion models, spanning various subareas from visual illusions to image restoration and text-guided editing.