Tags: #rlhf
OpenRLHF/OpenRLHF
An easy-to-use, scalable, and high-performance open-source framework for Reinforcement Learning from Human Feedback (RLHF), leveraging Ray and vLLM for distributed training of LLMs and VLMs.
hiyouga/ChatGLM-Efficient-Tuning
An efficient framework for fine-tuning ChatGLM-6B and ChatGLM2-6B models using PEFT methods, including LoRA, QLoRA, and RLHF, with a Web UI.
natolambert/rlhf-book
A comprehensive open-source textbook and code repository dedicated to Reinforcement Learning from Human Feedback (RLHF) and post-training language models.
huggingface/alignment-handbook
Provides robust training recipes and scripts to align large language models with human and AI preferences, enhancing helpfulness and safety.
opendilab/awesome-RLHF
A continually updated, curated list of essential resources for Reinforcement Learning with Human Feedback (RLHF), encompassing research papers, codebases, and datasets.
PKU-Alignment/align-anything
A modular framework for aligning any-modality large models with human intentions and values using various fine-tuning and reinforcement learning methods.
PKU-Alignment/safe-rlhf
A modular open-source framework for training constrained value-aligned Large Language Models (LLMs) using Safe Reinforcement Learning from Human Feedback (RLHF).
zai-org/ImageReward
A human preference reward model for evaluating and improving text-to-image generation models.
RLHFlow/RLHF-Reward-Modeling
A comprehensive collection of recipes and code for training various reward models crucial for Reinforcement Learning from Human Feedback (RLHF) in large language models.