Tags: #textbook
Educational Resource / Technical Textbook
python
1.8k
natolambert/rlhf-book
A comprehensive open-source textbook and code repository dedicated to Reinforcement Learning from Human Feedback (RLHF) and post-training language models.
A comprehensive open-source textbook and code repository dedicated to Reinforcement Learning from Human Feedback (RLHF) and post-training language models.