Ecosystem & Stack: pytorch

AI/ML Fine-tuning and Deployment Framework

13.9k

modelscope/ms-swift

A scalable and lightweight infrastructure for fine-tuning, inference, and deployment of over 1000 large language models (LLMs) and multimodal large language models (MLLMs) using advanced techniques.

llm mllm fine-tuning

llm-serving high-performance multimodal-ai

AI/ML Serving Framework

NVIDIA GPUs

26.4k

sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models, optimizing inference throughput and latency.

Machine Learning Library

transformers

21.0k

huggingface/peft

A state-of-the-art library for Parameter-Efficient Fine-Tuning (PEFT) of large pretrained models, drastically reducing computational and storage costs.

peft fine-tuning llm

LLM Inference and Serving Engine

78.1k

vllm-project/vllm

vLLM is a high-throughput and memory-efficient open-source library designed for fast and easy serving of large language models.

llm inference serving

Deep Learning Framework

deep-learning pytorch ml-framework

31.1k

Lightning-AI/pytorch-lightning

Streamlines complex deep learning engineering, enabling scalable AI model training and finetuning across diverse hardware with minimal code changes.

AI/ML Training Engine

llm moe models training engine

5.1k

InternLM/xtuner

A next-generation training engine optimized for ultra-large Mixture-of-Experts (MoE) models, offering superior efficiency and scalability.

Educational Code Repository

llm pytorch deep-learning

91.8k

rasbt/LLMs-from-scratch

An educational project providing step-by-step code to build a ChatGPT-like Large Language Model (LLM) from scratch using PyTorch.

AI Model Utility

llm censorship-removal ai-safety

20.5k

p-e-w/heretic

Heretic is an AI model utility that automatically removes censorship and safety alignment from transformer-based language models without requiring expensive post-training.

AI Agent Learning and Career Guide

langchain

4.2k

adongwanai/AgentGuide

A comprehensive, job-oriented guide for AI Agent development, covering core technologies, practical projects, and interview preparation for LLM-related roles.

ai-agent llm rag

LLM Serving Platform

llm serving kvcache disaggregated architecture

5.2k

kvcache-ai/Mooncake

A KVCache-centric disaggregated architecture for high-performance LLM serving, powering leading AI services.

AI/ML Experiment Tracking and Visualization Platform

ai-training mlops experiment-tracking

3.9k

SwanHubX/SwanLab

SwanLab is an open-source, modern-design platform for tracking, visualizing, and analyzing AI/ML training experiments, supporting cloud and self-hosted deployments.

Replaces:

Weights & Biases Comet ML...

kubernetes ai-training llm-finetuning

Distributed AI Training Platform

Kubernetes

2.1k

kubeflow/trainer

A Kubernetes-native platform for scalable distributed AI model training and LLM fine-tuning across various frameworks.

Technical Guide & Knowledge Base

cloud computing

17.8k

stas00/ml-engineering

An open collection of methodologies, tools, and step-by-step instructions for successful training, fine-tuning, and inference of large language and multi-modal models.

ml-engineering llm vlm

AI Data Management Platform

ai machine-learning data-lake

9.1k

activeloopai/deeplake

Deep Lake is an AI data runtime and database optimized for deep learning, offering serverless multimodal data storage, scalable retrieval, and training capabilities.

mlops ai platform cloud native

AI/MLOps Platform

kubernetes

5.0k

tencentmusic/cube-studio

An open-source, cloud-native, all-in-one MLOps platform designed for the full lifecycle management of machine learning, deep learning, and large language model development and deployment.

Replaces:

AWS SageMaker Google Cloud AI Platform...

Deep Learning Library

8.2k

bitsandbytes-foundation/bitsandbytes

A PyTorch library enabling accessible large language models through k-bit quantization, significantly reducing memory consumption for both inference and training.

pytorch quantization llm

AI/ML Deep Learning Framework

low-code llm deep-learning

11.7k

ludwig-ai/ludwig

A low-code, declarative framework for building and deploying custom large language models (LLMs) and other deep neural networks with ease and efficiency.

AI Inference and Deployment Toolkit

ai inference deep learning model optimization

10.1k

openvinotoolkit/openvino

OpenVINO is an open-source toolkit designed to optimize and deploy deep learning models for efficient AI inference across a wide range of hardware platforms.

Educational Curriculum

ai machine-learning education

46.7k

microsoft/AI-For-Beginners

A 12-week, 24-lesson curriculum from Microsoft to learn Artificial Intelligence for beginners, including practical lessons, quizzes, and labs.

Machine Learning Data Library

ai machine learning datasets

21.5k

huggingface/datasets

A lightweight library providing one-line dataloaders and efficient pre-processing tools for a vast hub of AI datasets, supporting various ML frameworks.

Python Library for Multimodal AI Data

multimodal-data machine-learning data-structure

3.1k

docarray/docarray

A Python library for representing, transmitting, storing, and retrieving multimodal data, designed for AI applications.

LLM Fine-tuning Framework

llm-fine-tuning open-source-llms private-llms

2.7k

stochasticai/xTuring

xTuring simplifies the process of fine-tuning and deploying open-source Large Language Models (LLMs) on private data, ensuring privacy, efficiency, and scalability.

Deep Learning Adaptation Framework

segmentation computer-vision pytorch

1.5k

tianrun-chen/SAM-Adapter-PyTorch

A PyTorch-based framework to adapt Meta AI's Segment Anything Model (SAM) for improved performance on challenging downstream computer vision tasks using adapters and prompts.

llm quantization compression

LLM Optimization Toolkit

huggingface

1.1k

ModelCloud/GPTQModel

A toolkit for quantizing (compressing) Large Language Models (LLMs) with hardware acceleration across various GPUs and CPUs, integrating with popular inference frameworks.

multimodal-llm deep-learning speech-processing

Deep Learning Toolkit / Multimodal LLM Framework

linux

1.0k

X-LANCE/SLAM-LLM

A deep learning toolkit for training custom multimodal large language models focused on speech, language, audio, and music processing.

Educational Resource / Course Code Repository

huggingface transformers nlp

3.9k

zyds/transformers-code

A comprehensive code repository accompanying a hands-on course for mastering Huggingface Transformers, covering fundamental concepts to advanced fine-tuning and deployment techniques.

LLM Finetuning UI Tool

2.1k

lxe/simple-llm-finetuner

A beginner-friendly UI for fine-tuning large language models (LLMs) using the LoRA method on commodity NVIDIA GPUs.

llm-finetuning lora peft

Large Language Model Finetuning Solution

3.8k

mymusise/ChatGLM-Tuning

A cost-effective solution for finetuning ChatGLM-6B with LoRA, enabling personalized large language models.

llm finetuning lora

Replaces:

ChatGPT

Deep Learning Library Extension

nlp deep learning transfer learning

2.8k

adapter-hub/adapters

A unified library extending HuggingFace Transformers for parameter-efficient and modular transfer learning in NLP.

LLM Inference Optimization Library

llm inference gpu optimization memory efficiency

17.0k

lyogavin/airllm

Optimizes large language model inference to run 70B models on a single 4GB GPU without quantization, enabling efficient deployment on resource-constrained hardware.

Educational Resource / Deep Learning Implementations

deep-learning pytorch machine-learning

66.5k

labmlai/annotated_deep_learning_paper_implementations

A comprehensive collection of PyTorch implementations for over 60 deep learning papers, featuring side-by-side annotated notes for enhanced understanding.

stable-diffusion google-colab ai-art-generation

Cloud-based AI Art Generation Utility

Google Colab

15.9k

camenduru/stable-diffusion-webui-colab

Provides Google Colab notebooks for easily deploying and running Stable Diffusion WebUI, enabling AI-powered image generation and training without local hardware.

Replaces:

Midjourney DALL-E...

Machine Learning Library

13.5k

microsoft/LoRA

A Python library implementing LoRA (Low-Rank Adaptation) to efficiently fine-tune large language models by significantly reducing trainable parameters and storage requirements.

lora llm fine-tuning

llm chinese-nlp open-source

Open-source Large Language Model Framework

Hugging Face

8.3k

LianjiaTech/BELLE

BELLE is an open-source project dedicated to fostering the development of Chinese conversational large language models, aiming to make LLMs accessible to everyone.

Replaces:

ChatGPT

Large Language Model (LLM)

ChatGPT API Commercial LLM APIs

2.5k

wenge-research/YAYI

YaYi is an open-source Chinese Large Language Model, built on LLaMA 2 & BLOOM, designed for secure, reliable, and domain-specific applications through extensive instruction tuning.

llm chinese-nlp llama2

Replaces:

AI/ML Training Framework

multi-modal llm alignment

4.7k

PKU-Alignment/align-anything

A modular framework for aligning any-modality large models with human intentions and values using various fine-tuning and reinforcement learning methods.

AI/ML Library

text-to-image reward model human preference

1.7k

zai-org/ImageReward

A human preference reward model for evaluating and improving text-to-image generation models.

Machine Learning Research Toolkit

rlhf reward modeling large language models

1.5k

RLHFlow/RLHF-Reward-Modeling

A comprehensive collection of recipes and code for training various reward models crucial for Reinforcement Learning from Human Feedback (RLHF) in large language models.

LLM Alignment Framework

1.4k

An open-source framework providing code, models, and insights for stable Reinforcement Learning from Human Feedback (RLHF) training in Large Language Models, focusing on the PPO algorithm and reward modeling.

llm rlhf ppo

diffusion models generative ai pytorch

Machine Learning Library

PyTorch

33.5k

huggingface/diffusers

A modular PyTorch library for state-of-the-art diffusion models, enabling easy inference and training for image, video, and audio generation.

AI-Powered Image Editing Tool

ai image-editing inpainting

23.0k

Sanster/IOPaint

An open-source, AI-driven tool for advanced image inpainting, outpainting, object removal, and replacement using state-of-the-art models.

Replaces:

Photoshop

blender stable-diffusion ai-art-generation

AI-powered 3D Design Plugin

blender

8.2k

carson-katri/dream-textures

Integrates Stable Diffusion directly into Blender for seamless AI-powered texture generation, concept art creation, and image manipulation within 3D workflows.

AI/ML 3D Content Generation Library

3d generation text-to-3d image-to-3d

8.8k

ashawkey/stable-dreamfusion

A PyTorch implementation for generating 3D models from text or images, leveraging NeRF and diffusion models like Stable Diffusion.

Text-to-Image Generation Library

3.5k

kuprel/min-dalle

A fast, minimal PyTorch port of DALL·E Mini for efficient text-to-image generation.

pytorch text-to-image ai

imagen text-to-image pytorch

AI Model Implementation / Deep Learning Library

Pytorch

8.4k

lucidrains/imagen-pytorch

A PyTorch implementation of Google's Imagen, a state-of-the-art text-to-image neural network, enabling advanced generative AI capabilities.

Replaces:

DALL-E2

dall-e-2 pytorch text-to-image

AI/ML Library

Pytorch

11.3k

lucidrains/DALLE2-pytorch

A PyTorch implementation of OpenAI's DALL-E 2, a state-of-the-art neural network for text-to-image synthesis.

Replaces:

OpenAI DALL-E 2

AI Model Implementation

dall-e pytorch text-to-image

5.6k

lucidrains/DALLE-pytorch

An open-source PyTorch implementation and replication of OpenAI's DALL-E, a text-to-image transformer, including CLIP for generation ranking.

Replaces:

OpenAI DALL-E

AI Model Fine-tuning Tool

dreambooth stable-diffusion fine-tuning

7.7k

XavierXiao/Dreambooth-Stable-Diffusion

This project implements Google's Dreambooth technique on Stable Diffusion, enabling users to fine-tune a text-to-image model with a few custom examples for personalized image generation.

Speech AI Framework

generative ai llm speech ai

17.2k

NVIDIA-NeMo/NeMo

A scalable generative AI framework for researchers and developers focused on Large Language Models, Multimodal, and Speech AI (ASR, TTS).

Speech Synthesis Library

5.9k

snakers4/silero-models

A collection of pre-trained, end-to-end text-to-speech models designed for simplicity, speed, and natural-sounding speech across multiple languages.

text-to-speech tts ai

Replaces:

Text-to-Speech (TTS) Model

tts speech-synthesis bert

8.7k

fishaudio/Bert-VITS2

An open-source text-to-speech model that combines the VITS2 backbone with multilingual BERT for high-quality, multi-language speech synthesis.

Content Creation Tool

text-to-speech audiobook-generator epub-pdf-converter

4.3k

denizsafak/abogen

Generate high-quality audiobooks and voiceovers from various text formats with synchronized captions.

AI Voice Cloning Toolkit

voice-cloning text-to-speech real-time

36.9k

babysor/MockingBird

A real-time voice cloning toolkit that allows users to replicate a voice in 5 seconds and generate arbitrary speech.

Replaces:

ElevenLabs Google Cloud Text-to-Speech...

Audiobook Generation Tool

audiobook-generation text-to-speech epub-converter

6.4k

santinic/audiblez

A Python-based tool to convert e-books (EPUB) into high-quality M4B audiobooks using advanced text-to-speech models.

Replaces:

Audible Commercial Audiobook Services

AI Voice Synthesis WebUI

text-to-speech voice-cloning few-shot-learning

57.1k

RVC-Boss/GPT-SoVITS

A powerful web-based tool for few-shot voice cloning and text-to-speech, enabling high-quality voice generation from minimal audio data.

Replaces:

Commercial Text-to-Speech Services Voice Cloning Software

fastapi text-to-speech docker

Text-to-Speech API Server

Docker

4.8k

remsky/Kokoro-FastAPI

A Dockerized FastAPI wrapper for the Kokoro-82M text-to-speech model, offering multi-language support, CPU/GPU inference, and an OpenAI-compatible API.

Replaces:

OpenAI Speech API

Multimodal AI Data Platform

multimodal ai data infrastructure python

1.5k

pixeltable/pixeltable

A declarative, transactional Python library for building multimodal AI applications with incremental data storage, transformation, indexing, and orchestration.

AI Model Evaluation Framework

multimodal-ai llm-evaluation benchmarking

4.1k

EvolvingLMMs-Lab/lmms-eval

A unified, reproducible, and efficient multimodal evaluation toolkit for large language models across text, image, video, and audio tasks.

speech synthesis sound generation text-to-speech

AI Speech and Sound Generation Framework

llama.cpp

1.5k

OpenMOSS/MOSS-TTS

An open-source AI model family for high-fidelity, expressive speech and sound generation across diverse real-world applications.

Replaces:

Deep Learning Library

1.9k

kyegomez/BitNet

A PyTorch implementation of BitNet, enabling highly efficient 1-bit transformers for large language models.

pytorch llm quantization

AI/ML Research Framework

multimodal-ai vision-language pytorch

5.6k

facebookresearch/mmf

A modular and scalable PyTorch-based framework for state-of-the-art vision and language multimodal research from Facebook AI Research.

AI-powered Multimodal Data Extraction Library

data extraction document processing vlm

1.5k

emcf/thepipe

A Python library for extracting clean markdown, multimodal media, and structured data from complex documents using vision-language models.

face-recognition deep-learning paddlepaddle

Deep Learning Library / Computer Vision Library

PaddlePaddle

3.6k

ZhaoJ9014/face.evoLVe

A high-performance, comprehensive face recognition library built on PaddlePaddle and PyTorch.

AI/NLP Model Framework

3.1k

dbiir/UER-py

An open-source PyTorch-based framework for NLP pre-training and fine-tuning, offering modularity, reproducibility, and a comprehensive model zoo for various downstream tasks.

nlp pre-training pytorch

Large Language Model Finetuning Toolkit

DeepSpeed

2.8k

liucongg/ChatGLM-Finetuning

A toolkit for finetuning ChatGLM series models (ChatGLM-6B, ChatGLM2-6B, ChatGLM3-6B) using various methods like Freeze, Lora, P-tuning, and full parameter training for downstream NLP tasks.

chatglm finetuning llm

LLM Fine-tuning Platform

llm instruction-tuning parameter-efficient-tuning

2.8k

PhoebusSi/Alpaca-CoT

A unified platform simplifying instruction-tuning for Large Language Models by integrating diverse data, LLMs, and parameter-efficient methods.

AI Voice Cloning Framework

voice cloning text-to-speech ai

36.5k

myshell-ai/OpenVoice

An open-source AI model for instant, accurate, and flexible voice cloning, supporting cross-lingual synthesis and granular style control.

Audio Synthesis Framework

singing-voice-synthesis text-to-speech diffusion-models

4.8k

MoonInTheRiver/DiffSinger

DiffSinger is an official PyTorch implementation of a singing voice synthesis (SVS) and text-to-speech (TTS) system, leveraging a shallow diffusion mechanism for high-quality audio generation.

AI Text-to-Speech Engine

Docker

8.5k

netease-youdao/EmotiVoice

EmotiVoice is an open-source, multi-voice, and prompt-controlled text-to-speech engine supporting English and Chinese with emotional synthesis capabilities.

text-to-speech tts ai

Replaces:

AI/ML Model & Speech Synthesis Library

6.2k

yl4579/StyleTTS2

StyleTTS 2 is a cutting-edge text-to-speech model achieving human-level speech synthesis through style diffusion and adversarial training with large speech language models.

text-to-speech tts ai

tts voice cloning deep learning

Text-to-Speech (TTS) Foundational Model

Docker

4.2k

metavoiceio/metavoice-src

MetaVoice-1B is an open-source 1.2B parameter foundational model for highly expressive, human-like text-to-speech synthesis with advanced voice cloning capabilities.

Replaces:

AI Model Implementation / Text-to-Speech System

tts voice-cloning multilingual

7.9k

Plachtaa/VALL-E-X

An open-source implementation of Microsoft's VALL-E X, enabling zero-shot multilingual text-to-speech synthesis and voice cloning with emotion control.

Replaces:

ElevenLabs Commercial Text-to-Speech (TTS) services

Deep Learning Library

text-to-speech deep-learning speech-synthesis

10.1k

mozilla/TTS

A deep learning library for advanced, high-quality, and efficient Text-to-Speech (TTS) synthesis, supporting multiple languages and models.

AI Model Fine-tuning Library

stable-diffusion lora fine-tuning

2.5k

KohakuBlueleaf/LyCORIS

LyCORIS is a library implementing various parameter-efficient fine-tuning (PEFT) algorithms for Stable Diffusion, extending beyond conventional LoRA methods to enhance model adaptation.

AI Video Generation Tool

stable-diffusion ai-video-generation text-to-video

4.7k

nateraw/stable-diffusion-videos

Create dynamic and visually captivating videos by smoothly morphing between different text prompts using Stable Diffusion.

video-upscaling image-upscaling comfyui-node

AI Video Upscaling Tool

ComfyUI

2.4k

numz/ComfyUI-SeedVR2_VideoUpscaler

Official SeedVR2 Video Upscaler for ComfyUI, enabling high-quality video and image upscaling, also runnable as a standalone CLI.

llm architecture recurrent transformer mixture of experts

Deep Learning Library / LLM Architecture Implementation

PyTorch

12.1k

kyegomez/OpenMythos

An open-source, theoretical reconstruction of the Claude Mythos LLM architecture, featuring a Recurrent-Depth Transformer and sparse Mixture of Experts for advanced reasoning.

Deep Learning Project Template

pytorch-lightning hydra deep-learning

5.2k

ashleve/lightning-hydra-template

A user-friendly template integrating PyTorch Lightning and Hydra to streamline deep learning experimentation and development.

LLM Inference Engine

llm inference deep learning python

13.1k

GeeeekExplorer/nano-vllm

A lightweight and optimized Python library for fast offline large language model inference, offering comparable or better performance than vLLM with a more readable codebase.

LLM Application Development Library

4.1k

SylphAI-Inc/AdalFlow

AdalFlow is a PyTorch-like open-source library designed to build and automatically optimize large language model (LLM) applications, from chatbots and RAG systems to complex AI agents.

llm rag agents

Large Language Model (LLM) Development Toolkit

llm finetuning deep-learning

13.3k

Lightning-AI/litgpt

A high-performance, no-abstraction toolkit providing recipes for pretraining, finetuning, and deploying over 20 large language models at scale.

Machine Learning Framework

deep-learning pytorch computer-vision

3.8k

open-mmlab/mmpretrain

MMPreTrain is an OpenMMLab project providing a comprehensive, open-source PyTorch-based toolbox for pre-training and benchmarking various computer vision and multi-modal models.

Deep Learning Architecture Library

PyTorch

3.1k

microsoft/torchscale

A PyTorch library providing advanced foundation architectures to efficiently and effectively scale Transformers for large language models and general-purpose AI.

pytorch transformers llm

AI Art Generation Library/Framework

ai art generation text-to-image disco diffusion

3.8k

jina-ai/discoart

Create stunning Disco Diffusion artworks with a single line of Python code, offering a professional API and robust integration capabilities.

AI/ML Library & Generative Audio Tool

ai music-generation stable-diffusion

3.9k

riffusion/riffusion-hobby

A library for real-time music and audio generation leveraging stable diffusion, offering CLI, interactive app, and API capabilities.

text-to-speech tts multilingual

Text-to-Speech Model

onnx

3.0k

OpenMOSS/MOSS-TTS-Nano

MOSS-TTS-Nano is an open-source, multilingual, tiny speech generation model optimized for real-time CPU inference and lightweight integration.