Tags: #reinforcement-learning

Reinforcement Learning Infrastructure

5.1k

inclusionAI/AReaL

A scalable asynchronous reinforcement learning infrastructure designed to bridge foundation model training with modern LLM-based agent applications.

reinforcement-learning llm-agents asynchronous-training

Details

AI/ML Training Framework

Python

9.2k

An open-source framework for training multi-step LLM agents using reinforcement learning (GRPO) to improve their reliability and performance on real-world tasks, with an optional serverless training service.

reinforcement learning llm agents ai training

Details

Reinforcement Learning Framework

ray

9.4k

OpenRLHF/OpenRLHF

An easy-to-use, scalable, and high-performance open-source framework for Reinforcement Learning from Human Feedback (RLHF), leveraging Ray and vLLM for distributed training of LLMs and VLMs.

rlhf llm vlm

Details

AI Agent Training Framework

python

17.0k

microsoft/agent-lightning

A versatile framework designed to train and optimize AI agents from any framework with minimal code changes, leveraging advanced algorithms like Reinforcement Learning.

ai-agents reinforcement-learning agent-training

Details

Reinforcement Learning Library for LLMs

Ray

3.1k

alibaba/ROLL

An efficient and user-friendly scaling library designed to optimize Reinforcement Learning with Large Language Models, enhancing performance in complex AI tasks.

reinforcement-learning large-language-models distributed-training

Details

Reinforcement Learning Framework for AI Agents

gpu

5.2k

Gen-Verse/OpenClaw-RL

An asynchronous reinforcement learning framework enabling personalized AI agent training through natural language conversations and scalable real-world deployments.

reinforcement-learning ai-agents llm

Details

AI Model Framework

6.0k

om-ai-lab/VLM-R1

VLM-R1 is a stable and generalizable R1-style Large Vision-Language Model that leverages reinforcement learning to significantly improve visual understanding tasks.

large vision-language model reinforcement learning fine-tuning

Details

Reinforcement Learning Infrastructure

5.2k

areal-project/AReaL

A scalable asynchronous reinforcement learning infrastructure designed to bridge foundation model training with modern LLM-based agent applications.

reinforcement-learning llm-agents asynchronous-training

Details

Tags: #reinforcement-learning

inclusionAI/AReaL

OpenPipe/ART

OpenRLHF/OpenRLHF

microsoft/agent-lightning

alibaba/ROLL

Gen-Verse/OpenClaw-RL

om-ai-lab/VLM-R1

areal-project/AReaL