Ecosystem & Stack: deepspeed
modelscope/ms-swift
A comprehensive framework from ModelScope for efficiently fine-tuning, evaluating, and deploying over 1000 large language models and multimodal large models using advanced techniques.
OpenRLHF/OpenRLHF
An easy-to-use, scalable, and high-performance open-source framework for Reinforcement Learning from Human Feedback (RLHF) based on Ray and vLLM.
oumi-ai/oumi
An end-to-end platform for fine-tuning, evaluating, and deploying open-source Large Language Models (LLMs) and Vision Language Models (VLMs).
bghira/SimpleTuner
A user-friendly, versatile fine-tuning kit for image, video, and audio diffusion models, emphasizing simplicity and cutting-edge features.
X-LANCE/SLAM-LLM
A deep learning toolkit for training custom multimodal large language models focused on speech, language, audio, and music processing.
LianjiaTech/BELLE
BELLE is an open-source project dedicated to fostering the development of Chinese conversational large language models, aiming to make LLMs accessible to everyone.
wenge-research/YAYI
YaYi is an open-source Chinese large language model series, built on LLaMA 2 & BLOOM, designed to provide secure, reliable, and domain-specific AI capabilities for enterprise customers through extensive multi-domain instruction tuning.
huggingface/alignment-handbook
Provides robust recipes and training code to align language models with human and AI preferences, enhancing helpfulness and safety.
OpenLMLab/MOSS-RLHF
An open-source framework providing code, models, and insights for stable Reinforcement Learning from Human Feedback (RLHF) training in Large Language Models, focusing on the PPO algorithm and reward modeling.
lucidrains/DALLE-pytorch
An open-source PyTorch implementation of OpenAI's DALL-E, a text-to-image transformer, including CLIP for generation ranking.
sentient-agi/OML-1.0-Fingerprinting
A framework for embedding secret cryptographic fingerprints into Large Language Models (LLMs) via fine-tuning to verify ownership and prevent unauthorized use.
liucongg/ChatGLM-Finetuning
A comprehensive toolkit for fine-tuning ChatGLM-6B, ChatGLM2-6B, and ChatGLM3-6B models using various methods like Freeze, Lora, P-tuning, and full parameter fine-tuning.
X-PLUG/mPLUG-DocOwl
A modularized multimodal large language model designed for OCR-free document understanding.