Tags: #deep-learning

Deep Learning Library / Machine Learning Framework
159.3k

huggingface/transformers

A unified framework providing state-of-the-art machine learning models for text, vision, audio, and multimodal tasks, optimized for both inference and training.

AI Model Fine-tuning and Deployment Framework
python
13.7k

modelscope/ms-swift

A comprehensive framework from ModelScope for efficiently fine-tuning, evaluating, and deploying over 1000 large language models and multimodal large models using advanced techniques.

Machine Learning Library
pytorch
20.9k

huggingface/peft

PEFT is a state-of-the-art library for Parameter-Efficient Fine-Tuning, drastically reducing the computational and storage costs of adapting large pretrained models.

LLM Inference and Serving Engine
python
76.3k

vllm-project/vllm

A high-throughput and memory-efficient open-source engine designed for fast, easy, and cost-effective serving of large language models.

AI/ML Fine-tuning Framework
Python
70.2k

hiyouga/LlamaFactory

A unified and efficient framework for fine-tuning over 100 large language models (LLMs) and vision-language models (VLMs) with both CLI and Web UI.

Deep Learning Framework
python
31.0k

Lightning-AI/pytorch-lightning

A deep learning framework that simplifies PyTorch development by automating boilerplate engineering code, enabling scalable training from CPU to multi-node GPUs with minimal code changes.

Deep Learning Inference Engine
14.9k

alibaba/MNN

A blazing-fast, lightweight inference engine from Alibaba, powering high-performance on-device LLMs and Edge AI.

LLM Fine-tuning Framework
python
11.7k

axolotl-ai-cloud/axolotl

A free and open-source framework designed for efficient and flexible fine-tuning of large language models.

AI Training Engine
5.1k

InternLM/xtuner

XTuner V1 is a next-generation training engine specifically designed for ultra-large-scale Mixture-of-Experts (MoE) models, offering superior efficiency and scalability.

Educational Codebase & Deep Learning Tutorial
python
91.0k

rasbt/LLMs-from-scratch

A comprehensive, step-by-step guide and codebase for building a ChatGPT-like Large Language Model from scratch using PyTorch.

Machine Learning Model Viewer
python
32.7k

lutzroeder/netron

A universal viewer for neural network, deep learning, and machine learning models, supporting a wide array of formats.

Machine Learning Interoperability Standard
python
20.7k

onnx/onnx

An open standard and format for machine learning models, enabling interoperability across AI frameworks and streamlining the path from research to production.

Educational Platform
6.1k

luhengshiwo/LLMForEverybody

A comprehensive learning platform offering structured knowledge, interview questions, and paper analysis for Large Language Models (LLMs).

Curated Resource List
2.0k

alvinunreal/awesome-opensource-ai

A meticulously curated list of battle-tested, production-proven open-source AI models, libraries, infrastructure, and developer tools.

MLOps Platform
Kubernetes
3.7k

polyaxon/polyaxon

A comprehensive MLOps platform for managing, orchestrating, and scaling the entire machine learning and deep learning lifecycle.

AI Data Management Platform
python
9.1k

activeloopai/deeplake

Deep Lake is an AI data runtime and database optimized for deep learning, offering multimodal data storage, querying, vector search, and streaming for LLM and deep learning applications.

Cloud-Native MLOps Platform
Kubernetes
4.9k

tencentmusic/cube-studio

A comprehensive, cloud-native, one-stop platform for machine learning, deep learning, and large language model development, covering the entire MLOps lifecycle.

Deep Learning Optimization Library
Python
8.1k

bitsandbytes-foundation/bitsandbytes

A PyTorch library enabling accessible large language models by dramatically reducing memory consumption through k-bit quantization for both inference and training.

Low-Code AI/ML Framework
python
11.7k

ludwig-ai/ludwig

Ludwig is a low-code, declarative deep learning framework designed to simplify the building, training, and deployment of custom AI models, including LLMs and neural networks.

Curated Resource List
2.5k

alvinreal/awesome-opensource-ai

A meticulously curated list of battle-tested, production-proven open-source AI models, libraries, infrastructure, and developer tools.

Educational Resource Collection
jupyter-notebook
20.0k

AccumulateMore/CV

A comprehensive and curated collection of deep learning study notes, integrating content from leading educators like Andrew Ng, Li Mu, and TuDui, covering CV, NLP, and Large Language Models.

AI Inference Optimization Toolkit
python
10.1k

openvinotoolkit/openvino

OpenVINO is an open-source toolkit designed to optimize and deploy deep learning models for efficient AI inference across diverse hardware platforms, from edge to cloud.

Educational Curriculum
python
46.4k

microsoft/AI-For-Beginners

A comprehensive 12-week, 24-lesson curriculum designed to introduce beginners to the fundamentals of Artificial Intelligence.

Educational Resource & Code Examples
python
22.9k

spmallick/learnopencv

A comprehensive repository offering C++ and Python code examples for computer vision, deep learning, and AI research, complementing articles on LearnOpenCV.com.

Machine Learning Toolkit
Python
21.6k

recommenders-team/recommenders

A comprehensive toolkit providing best practices and implementations for building, evaluating, and operationalizing recommendation systems.

Deep Learning Framework
python
41.4k

hpcaitech/ColossalAI

Colossal-AI makes training and deploying large AI models cheaper, faster, and more accessible through advanced distributed training techniques.

LLM Training Framework
python
2.2k

AI-Hypercomputer/maxtext

A high-performance, scalable JAX-based open-source library for training large language models on Google Cloud TPUs and GPUs.

AI Development Platform
Docker
4.9k

h2oai/h2o-llmstudio

A no-code GUI and framework for easily fine-tuning state-of-the-art large language models (LLMs).

Deep Learning Adaptation Framework
Python
1.5k

tianrun-chen/SAM-Adapter-PyTorch

A PyTorch-based framework to adapt Meta AI's Segment Anything Model (SAM) for improved performance on challenging downstream computer vision tasks using adapters and prompts.

Educational Resource
4.4k

LLMBook-zh/LLMBook-zh.github.io

A comprehensive Chinese technical book and associated course materials providing a systematic framework and roadmap for understanding Large Language Models, authored by leading experts.

LLM Optimization Toolkit
huggingface
1.1k

ModelCloud/GPTQModel

A toolkit for quantizing (compressing) Large Language Models (LLMs) with hardware acceleration across various GPUs and CPUs, integrating with popular inference frameworks.

Deep Learning Toolkit / Multimodal LLM Framework
linux
1.0k

X-LANCE/SLAM-LLM

A deep learning toolkit for training custom multimodal large language models focused on speech, language, audio, and music processing.

Educational Resource / Course Code Repository
pytorch
3.9k

zyds/transformers-code

A comprehensive code repository accompanying a hands-on course for mastering Huggingface Transformers, covering fundamental concepts to advanced fine-tuning and deployment techniques.

LLM Inference Optimization Library
Python
16.4k

lyogavin/airllm

AirLLM optimizes large language model inference memory, enabling 70B LLMs on a single 4GB GPU without quantization, and 405B Llama3.1 on 8GB VRAM.

Deep Learning Research & Education Platform
pytorch
66.3k

labmlai/annotated_deep_learning_paper_implementations

A comprehensive collection of PyTorch implementations for over 60 deep learning papers, accompanied by detailed side-by-side notes for enhanced understanding.

Machine Learning Library
PyTorch
13.4k

microsoft/LoRA

A PyTorch library implementing LoRA (Low-Rank Adaptation) to efficiently fine-tune large language models by significantly reducing trainable parameters and storage requirements.

AI Research Project
huggingface
2.7k

JIA-Lab-research/LongLoRA

LongLoRA is an efficient fine-tuning method and associated models/datasets designed to extend the context window of Large Language Models (LLMs) for processing longer inputs.

Large Language Model Series
Hugging Face Transformers
7.2k

InternLM/InternLM

A series of high-performance, cost-efficient large language models (LLMs) designed for general-purpose usage and advanced reasoning.

Replaces:
Details
Machine Learning Research Toolkit
python
1.5k

RLHFlow/RLHF-Reward-Modeling

A comprehensive collection of recipes and code for training various reward models crucial for Reinforcement Learning from Human Feedback (RLHF) in large language models.

LLM Alignment Framework
Python
1.4k

OpenLMLab/MOSS-RLHF

An open-source framework providing code, models, and insights for stable Reinforcement Learning from Human Feedback (RLHF) training in Large Language Models, focusing on the PPO algorithm and reward modeling.

AI/ML Inference Library
Python
3.5k

kuprel/min-dalle

A fast, minimal PyTorch port of DALL·E Mini, optimized for efficient text-to-image generation inference.

Replaces:
Details
Deep Learning Model Implementation
pytorch
8.4k

lucidrains/imagen-pytorch

A PyTorch implementation of Google's Imagen, a state-of-the-art text-to-image neural network that surpasses DALL-E2 in synthesis quality.

Replaces:
Details
Deep Learning Library / Generative AI Model Implementation
pytorch
11.3k

lucidrains/DALLE2-pytorch

A PyTorch implementation of OpenAI's DALL-E 2, enabling advanced text-to-image synthesis through a diffusion-based neural network architecture.

Machine Learning Library
pytorch
5.6k

lucidrains/DALLE-pytorch

An open-source PyTorch implementation of OpenAI's DALL-E, a text-to-image transformer, including CLIP for generation ranking.

Command Line Interface Tool
python
4.3k

lucidrains/deep-daze

A simple command-line tool for generating artistic images from text descriptions using OpenAI's CLIP and Siren neural networks.

AI Voice Generation Platform
Docker
29.8k

fishaudio/fish-speech

A state-of-the-art open-source multilingual text-to-speech system offering natural, expressive, and emotionally rich voice generation.

Deep Learning Framework / AI Tool
Python
59.6k

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time using a three-stage deep learning framework.

Voice Cloning and Speech Synthesis Toolkit
Python
36.9k

babysor/MockingBird

A powerful open-source toolkit for real-time voice cloning and arbitrary speech generation from text.

Multimodal AI Inference and Serving Framework
python
4.4k

vllm-project/vllm-omni

vLLM-Omni is an efficient, flexible, and easy-to-use framework extending vLLM to serve omni-modality models (text, image, video, audio) with high throughput and an OpenAI-compatible API.

AI/ML Evaluation Framework
python
4.0k

EvolvingLMMs-Lab/lmms-eval

A unified, reproducible, and efficient multimodal evaluation toolkit for large language models across text, image, video, and audio tasks.

Deep Learning Library
pytorch
1.9k

kyegomez/BitNet

A PyTorch implementation of BitNet, enabling highly efficient 1-bit transformers for large language models.

AI Model Fine-tuning Framework
HuggingFace
1.8k

2U1/Qwen-VL-Series-Finetune

An open-source implementation for efficiently fine-tuning Alibaba Cloud's Qwen-VL series of multimodal large language models using HuggingFace and Liger-Kernel.

Machine Learning Research Framework
pytorch
5.6k

facebookresearch/mmf

A modular PyTorch-based framework from Facebook AI Research for state-of-the-art vision and language multimodal AI research.

AI/ML Foundation Model
2.2k

OpenGVLab/InternVideo

A series of video foundation models and large-scale datasets designed for comprehensive multimodal video understanding and generation.

AI Optimization Library
python
1.0k

intel/auto-round

AutoRound is an advanced quantization toolkit for Large Language Models (LLMs) and Vision-Language Models (VLMs), enabling high-accuracy, ultra-low-bit inference across diverse hardware.

Deep Learning Library / Computer Vision Library
PaddlePaddle
3.6k

ZhaoJ9014/face.evoLVe

A high-performance, comprehensive face recognition library built on PaddlePaddle and PyTorch.

AI Voice Synthesis Library
36.2k

myshell-ai/OpenVoice

An AI voice synthesis library offering instant, accurate, and flexible voice cloning with multi-lingual support.

Deep Learning Library
python
45.1k

coqui-ai/TTS

A deep learning toolkit for Text-to-Speech, offering pretrained models, training tools, and dataset utilities.

AI-powered Text-to-Speech Engine
Docker
8.5k

netease-youdao/EmotiVoice

An open-source, multi-voice, and prompt-controlled text-to-speech engine capable of generating speech with diverse emotions in English and Chinese.

AI/ML Model, Speech Synthesis Library
python
6.2k

yl4579/StyleTTS2

StyleTTS 2 is a text-to-speech model that achieves human-level speech synthesis by leveraging style diffusion and adversarial training with large speech language models.

AI Text-to-Speech (TTS) Model
Docker
4.2k

metavoiceio/metavoice-src

MetaVoice-1B is an open-source, 1.2B parameter foundational model for highly expressive, human-like text-to-speech synthesis and zero-shot voice cloning.

AI/ML Speech Synthesis Framework
tensorflow
4.0k

TensorSpeech/TensorFlowTTS

TensorFlowTTS provides real-time, state-of-the-art speech synthesis architectures based on TensorFlow 2, supporting multiple languages and optimized for fast inference and deployment on various devices.

AI/ML Model Implementation
Python
8.0k

Plachtaa/VALL-E-X

An open-source implementation of Microsoft's VALL-E X, enabling zero-shot multilingual text-to-speech synthesis and voice cloning with emotion control.

Speech Synthesis Library
python
7.9k

jaywalnut310/vits

VITS is an end-to-end text-to-speech model that generates highly natural-sounding audio with diverse rhythms, outperforming traditional two-stage TTS systems.

Speech Synthesis Library
pytorch
10.1k

mozilla/TTS

A deep learning library for advanced Text-to-Speech generation, offering high-quality speech synthesis with pretrained models and multi-language support.

AI/ML Model
huggingface
2.4k

X-PLUG/mPLUG-DocOwl

A modularized multimodal large language model designed for OCR-free document understanding.

Multimodal Large Language Model
3.6k

NExT-GPT/NExT-GPT

The first end-to-end multimodal large language model (MM-LLM) that perceives input and generates output in arbitrary combinations (any-to-any) of text, image, video, and audio.

AI Model / Research Project
2.5k

X-PLUG/mPLUG-Owl

A family of powerful multi-modal large language models (MLLMs) designed to advance AI's understanding and generation capabilities across various data types.

AI Interview Preparation Guide
3.5k

WeThinkIn/AIGC-Interview-Book

A comprehensive interview preparation guide for AIGC and AI algorithm/development roles, covering a wide range of AI technologies and career insights.

Generative AI Educational Resource Hub
google colab
2.7k

FurkanGozukara/Stable-Diffusion

A comprehensive repository offering expert-level tutorials, guides, and courses on various Generative AI technologies, primarily focusing on Stable Diffusion and its ecosystem.

AI Inference Toolkit
mnn
4.4k

xlite-dev/lite.ai.toolkit

A lightweight C++ toolkit for deploying over 100 diverse AI models with multiple inference engines.

AI Video Generation Tool
Python
4.7k

nateraw/stable-diffusion-videos

Create dynamic videos by smoothly transitioning between text prompts using Stable Diffusion's latent space exploration.

AI Video Upscaling Tool
ComfyUI
2.4k

numz/ComfyUI-SeedVR2_VideoUpscaler

Official SeedVR2 Video Upscaler for ComfyUI, enabling high-quality video and image upscaling, also runnable as a standalone CLI.

AI-powered Face Swapping Software
55.2k

deepfakes/faceswap

An open-source deep learning tool that enables users to recognize and swap faces in images and videos.

AI Image Translation Model
python
2.4k

GaParmar/img2img-turbo

A one-step image-to-image translation framework leveraging Stable Diffusion Turbo for rapid generation across various tasks like sketch-to-image and day-to-night transformations.

AI-powered Image Restoration and Upscaling Tool
python
5.5k

Fanghua-Yu/SUPIR

SUPIR is an AI-driven project focused on developing practical algorithms for photo-realistic image restoration and upscaling in real-world scenarios.

OSS Alternative

Explore the best open source alternatives to commercial software.

© 2026 OSS Alternative. hotgithub.com - All rights reserved.