Tags: #rlhf

Reinforcement Learning Framework

9.4k

OpenRLHF/OpenRLHF

An easy-to-use, scalable, and high-performance open-source framework for Reinforcement Learning from Human Feedback (RLHF), leveraging Ray and vLLM for distributed training of LLMs and VLMs.

rlhf llm vlm

Details

LLM Fine-tuning Framework

3.7k

hiyouga/ChatGLM-Efficient-Tuning

An efficient framework for fine-tuning ChatGLM-6B and ChatGLM2-6B models using PEFT methods, including LoRA, QLoRA, and RLHF, with a Web UI.

llm fine-tuning peft

Details

Educational Resource / Technical Textbook

python

1.8k

natolambert/rlhf-book

A comprehensive open-source textbook and code repository dedicated to Reinforcement Learning from Human Feedback (RLHF) and post-training language models.

rlhf machine-learning llm

Details

LLM Alignment Toolkit

DeepSpeed

5.6k

huggingface/alignment-handbook

Provides robust training recipes and scripts to align large language models with human and AI preferences, enhancing helpfulness and safety.

llm alignment fine-tuning rlhf

Details

Awesome List / Research Resource Collection

4.4k

opendilab/awesome-RLHF

A continually updated, curated list of essential resources for Reinforcement Learning with Human Feedback (RLHF), encompassing research papers, codebases, and datasets.

rlhf awesome-list machine-learning

Details

AI/ML Training Framework

python

4.7k

PKU-Alignment/align-anything

A modular framework for aligning any-modality large models with human intentions and values using various fine-tuning and reinforcement learning methods.

multi-modal llm alignment

Details

AI/ML Research Framework

Hugging Face

1.6k

PKU-Alignment/safe-rlhf

A modular open-source framework for training constrained value-aligned Large Language Models (LLMs) using Safe Reinforcement Learning from Human Feedback (RLHF).

llm rlhf safety

Details

AI/ML Library

python

1.7k

zai-org/ImageReward

A human preference reward model for evaluating and improving text-to-image generation models.

text-to-image reward model human preference

Details

Machine Learning Research Toolkit

python

1.5k

RLHFlow/RLHF-Reward-Modeling

A comprehensive collection of recipes and code for training various reward models crucial for Reinforcement Learning from Human Feedback (RLHF) in large language models.

rlhf reward modeling large language models

Details

LLM Alignment Framework

Python

1.4k

An open-source framework providing code, models, and insights for stable Reinforcement Learning from Human Feedback (RLHF) training in Large Language Models, focusing on the PPO algorithm and reward modeling.

llm rlhf ppo

Details

Tags: #rlhf

OpenRLHF/OpenRLHF

hiyouga/ChatGLM-Efficient-Tuning

natolambert/rlhf-book

huggingface/alignment-handbook

opendilab/awesome-RLHF

PKU-Alignment/align-anything

PKU-Alignment/safe-rlhf

zai-org/ImageReward

RLHFlow/RLHF-Reward-Modeling

OpenLMLab/MOSS-RLHF