Accelerating Open Source Rl And

Accelerating Open-Source RL and Agentic Inference with vLLM - Michael Goin, Red Hat | vLLM

Accelerating Open-Source RL and

Introduction to coax: A Modular RL Package

This is a short presentation introducing the

Self Learning AI: Accelerate w/ new RL

The frontier of LLM research has shifted decisively toward Post-Training and System 2 reasoning. We all know the recipe for ...

Environments Hub: A Community Hub To Scale RL To Open AGI

RL

WarpDrive: Orders of Magnitude Faster Multi-Agent Deep RL on a GPU

Speakers: Stephan Zheng, Lead Research Scientist, Salesforce Stephan Zheng (www.stephanzheng.com) leads the AI ...

Reinforcement Learning, Agents & OpenEnv

Thanks for joining the "Mini

Survive-RL 🫏: An open source environment for multi-agent reinforcement learning.

Mandred Tech is proud to release its first

Revolutionizing Reinforcement Learning Framework for DLMs (Sep 2025)

Title: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models (Sep 2025) Link: ...

Optimizing Large-Scale RL with SGLang | Chenyang Zhao | AER Labs

This talk addresses the Training-Inference Mismatch problem commonly encountered in large-scale reinforcement learning (

Open Reasoner Zero - Large-Scale Reasoning-Oriented RL - Install Locally

This video locally installs Open-Reasoner-Zero, which is first

GoLongRL: Multitask RL for Long-Context LLMs

In this AI Research Roundup episode, Alex discusses the paper: 'GoLongRL: Capability-Oriented Long Context Reinforcement ...

Let LLMs Wander: Engineering RL Environments — Stefano Fiorucci

Reasoning models like DeepSeek R1 have demonstrated that learning from interaction is just as critical as learning from ...

Contextual + Ray: Boosting SFT, RL & Inference at Scale | Ray Summit 2025

At Ray Summit 2025, Fanhai Lu from Contextual AI shares how the company builds enterprise-grade AI agents and applications ...

AReaL: Async RL for Language Reasoning

In this episode of the AI Research Roundup, host Alex explores a cutting-edge paper on large-scale reinforcement learning ...

How to Train Your Agent: Building Reliable Agents with RL — Kyle Corbitt, OpenPipe

Have you ever launched an awesome agentic demo, only to realize no amount of prompting will make it reliable enough to deploy ...

SWE-RL: Reinforcement Learning for LLMs on Software Evolution

This paper introduces SWE-

Trending GitHub Projects Part-1 : Open Source AI, Automation, RL, 3D & Developer Tools

Trending GitHub Projects Part-1 :

Accelerating RL Post-Training Rollouts via System-Integrated Speculative Decoding

This paper addresses rollout generation as a major bottleneck in