Tags: #diffusion-models

Nunchaku is a high-performance AI inference engine that optimizes 4-bit neural networks, especially diffusion models, for faster and more memory-efficient execution.

ai-inference quantization diffusion-models

Details

AI/ML Fine-tuning Tool

diffusers

7.5k

Enables rapid and efficient fine-tuning of diffusion models, particularly Stable Diffusion, using Low-rank Adaptation (LoRA) to generate high-quality, custom images with significantly smaller model sizes.

lora diffusion models fine-tuning

Details

AI/ML Library

python

1.7k

zai-org/ImageReward

A human preference reward model for evaluating and improving text-to-image generation models.

text-to-image reward model human preference

Details

Machine Learning Library

PyTorch

33.5k

huggingface/diffusers

A modular PyTorch library for state-of-the-art diffusion models, enabling easy inference and training for image, video, and audio generation.

diffusion models generative ai pytorch

Details

AI/ML 3D Content Generation Library

python

8.8k

ashawkey/stable-dreamfusion

A PyTorch implementation for generating 3D models from text or images, leveraging NeRF and diffusion models like Stable Diffusion.

3d generation text-to-3d image-to-3d

Details

Research Paper Collection & Taxonomy

3.3k

YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy

A comprehensive and continuously updated collection and categorization of research papers on diffusion models.

diffusion models generative ai machine learning

Details

AI/ML Library

Pytorch

11.3k

lucidrains/DALLE2-pytorch

A PyTorch implementation of OpenAI's DALL-E 2, a state-of-the-art neural network for text-to-image synthesis.

dall-e-2 pytorch text-to-image

Replaces:

OpenAI DALL-E 2

Details

Audio Synthesis Framework

Python

4.8k

MoonInTheRiver/DiffSinger

DiffSinger is an official PyTorch implementation of a singing voice synthesis (SVS) and text-to-speech (TTS) system, leveraging a shallow diffusion mechanism for high-quality audio generation.

singing-voice-synthesis text-to-speech diffusion-models

Details

AI/ML Model & Speech Synthesis Library

Python

6.2k

yl4579/StyleTTS2

StyleTTS 2 is a cutting-edge text-to-speech model achieving human-level speech synthesis through style diffusion and adversarial training with large speech language models.

text-to-speech tts ai

Details

AI Inference Runtime

ggml

5.9k

leejet/stable-diffusion.cpp

A pure C/C++ implementation for efficient, cross-platform inference of various diffusion models, including Stable Diffusion, FLUX, Wan, and Qwen Image.

diffusion-models c-cpp ai-ml

Details

Curated Research List

2.2k

wangkai930418/awesome-diffusion-categorized

A meticulously categorized collection of research papers on diffusion models, spanning various subareas from visual illusions to image restoration and text-guided editing.

diffusion models research papers computer vision

Details

Tags: #diffusion-models

bghira/SimpleTuner

Nerogar/OneTrainer

adobe-research/custom-diffusion

nunchaku-ai/nunchaku

cloneofsimo/lora

zai-org/ImageReward

huggingface/diffusers

ashawkey/stable-dreamfusion

YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy

lucidrains/DALLE2-pytorch

MoonInTheRiver/DiffSinger

yl4579/StyleTTS2

leejet/stable-diffusion.cpp

wangkai930418/awesome-diffusion-categorized