OpenPipe/ART
An open-source framework for training multi-step LLM agents using reinforcement learning (GRPO) to learn from experience, offering a serverless RL training service.
Core Features
Quick Start
pip install openpipe-artDetailed Introduction
ART (Agent Reinforcement Trainer) is an open-source framework designed to enhance the reliability of large language model (LLM) agents by enabling them to learn from experience through reinforcement learning. It integrates GRPO (Generalized Reinforcement Policy Optimization) to facilitate the training of multi-step agents for complex, real-world tasks. Complementing this, ART offers "W&B Training," a serverless RL service that automates infrastructure management, significantly reducing training costs, accelerating feedback cycles, and eliminating DevOps complexities, making advanced RL accessible and efficient for developers.