Ecosystem & Stack: huggingface
vllm-project/vllm
A high-throughput and memory-efficient open-source engine designed for fast, easy, and cost-effective serving of large language models.
meta-llama/llama-cookbook
A comprehensive guide and collection of recipes for building with the Llama model family, covering inference, fine-tuning, RAG, and end-to-end use cases.
weaviate/weaviate
An open-source, cloud-native vector database enabling semantic search at scale by combining vector similarity with structured filtering and integrated AI models.
stas00/ml-engineering
An open collection of methodologies, tools, and step-by-step instructions for successfully training, fine-tuning, and inferencing large language and multi-modal models.
PacktPublishing/LLM-Engineers-Handbook
A comprehensive practical guide and accompanying codebase for building, deploying, and monitoring advanced LLM and RAG applications on AWS, emphasizing LLMOps best practices.
ludwig-ai/ludwig
Ludwig is a low-code, declarative deep learning framework designed to simplify the building, training, and deployment of custom AI models, including LLMs and neural networks.
openvinotoolkit/openvino
OpenVINO is an open-source toolkit designed to optimize and deploy deep learning models for efficient AI inference across diverse hardware platforms, from edge to cloud.
decodingai-magazine/llm-twin-course
A free, hands-on course teaching how to build and deploy production-ready LLM and RAG systems, including a personalized 'LLM Twin', using LLMOps best practices.
bespokelabsai/curator
A Python library for generating and curating high-quality synthetic data for AI model training and structured data extraction.
ashishpatel26/LLM-Finetuning
Provides a collection of Colab notebooks for efficiently fine-tuning large language models using PEFT methods like LoRA and Hugging Face transformers.
ModelCloud/GPTQModel
A toolkit for quantizing (compressing) Large Language Models (LLMs) with hardware acceleration across various GPUs and CPUs, integrating with popular inference frameworks.
hiyouga/ChatGLM-Efficient-Tuning
An efficient toolkit for fine-tuning ChatGLM-6B models using PEFT methods, enabling customization and deployment of large language models.
nunchaku-ai/nunchaku
Nunchaku is a high-performance inference engine that optimizes 4-bit neural networks, especially diffusion models, for speed and efficiency.
JIA-Lab-research/LongLoRA
LongLoRA is an efficient fine-tuning method and associated models/datasets designed to extend the context window of Large Language Models (LLMs) for processing longer inputs.
cloneofsimo/lora
A tool for fast and efficient fine-tuning of diffusion models using Low-rank Adaptation (LoRA), producing small, shareable models.
PKU-Alignment/align-anything
A modular framework for aligning any-modality large models with human intentions and values using diverse fine-tuning and reinforcement learning methods.
zai-org/ImageReward
A human preference reward model for evaluating and improving text-to-image generation models.
AI4Finance-Foundation/FinGPT
FinGPT is an open-source initiative providing cost-effective and rapidly adaptable large language models specifically designed for the dynamic financial sector.
Hunyuan-PromptEnhancer/PromptEnhancer
A prompt rewriting tool that refines user prompts into clearer, structured versions to enhance the quality of text-to-image generation and image-to-image editing.
XavierXiao/Dreambooth-Stable-Diffusion
An implementation of Google's Dreambooth technique on Stable Diffusion, enabling personalized text-to-image model fine-tuning with limited examples.
EvolvingLMMs-Lab/lmms-eval
A unified, reproducible, and efficient multimodal evaluation toolkit for large language models across text, image, video, and audio tasks.
2U1/Qwen-VL-Series-Finetune
An open-source implementation for efficiently fine-tuning Alibaba Cloud's Qwen-VL series of multimodal large language models using HuggingFace and Liger-Kernel.
Marker-Inc-Korea/AutoRAG
An open-source framework that automates the evaluation and optimization of Retrieval-Augmented Generation (RAG) pipelines using AutoML-style techniques for specific datasets.
sentient-agi/OML-1.0-Fingerprinting
A framework for embedding secret cryptographic fingerprints into Large Language Models (LLMs) via fine-tuning to verify ownership and prevent unauthorized use.
PhoebusSi/Alpaca-CoT
A unified platform simplifying instruction-tuning, parameter-efficient methods, and large language model integration for researchers and developers.
canopyai/Orpheus-TTS
A state-of-the-art open-source text-to-speech system leveraging LLMs to generate human-like, emotional, and low-latency speech with zero-shot voice cloning capabilities.
TensorSpeech/TensorFlowTTS
TensorFlowTTS provides real-time, state-of-the-art speech synthesis architectures based on TensorFlow 2, supporting multiple languages and optimized for fast inference and deployment on various devices.
X-PLUG/mPLUG-DocOwl
A modularized multimodal large language model designed for OCR-free document understanding.