Ecosystem & Stack: weights-and-biases
Reinforcement Learning Framework
python
9.2k
OpenPipe/ART
An open-source framework for training multi-step LLM agents using reinforcement learning (GRPO) to learn from experience, offering a serverless RL training service.