Tags: #ppo - OSS Alternative - Discover Top Open Source Alternatives to Popular Software

Tags: #ppo

LLM Alignment Framework

1.4k

OpenLMLab/MOSS-RLHF

An open-source framework providing code, models, and insights for stable Reinforcement Learning from Human Feedback (RLHF) training in Large Language Models, focusing on the PPO algorithm and reward modeling.

llm rlhf ppo

Details