NeMo RL Evolution: PR Tracking
This issue consolidates the open and in-flight PRs across the NeMo RL evolution workstreams. It is meant as a single index so we can track status in one place. PRs are grouped by workstream, with cross-workstream PRs noted where relevant.
Data Plane
Owner: Zhiyu Li
Router Replay (R3)
Owner: Zeyu Zhou
FT, Distillation
Owner: Pranav Prashant Thombre
Refit
Owner: Songlin Jiang
Core refactor series (blocks the delta and RDMA refit work):
Delta Weight Transfer
P2P RDMA based refit
NCCL hierarchical API
Owner: Youngeun Kwon
AsyncRL
Owners: Akash Mehra, Yuki Huang
Train pump (3-PR async-GRPO split-API stack):
Rollout and per-prompt streaming:
Single Controller:
Gym
Owners: Ananth Subramaniam, Hemil Desai
Generation
Reference links (full URLs)
NeMo RL Evolution: PR Tracking
This issue consolidates the open and in-flight PRs across the NeMo RL evolution workstreams. It is meant as a single index so we can track status in one place. PRs are grouped by workstream, with cross-workstream PRs noted where relevant.
Data Plane
Owner: Zhiyu Li
Router Replay (R3)
Owner: Zeyu Zhou
FT, Distillation
Owner: Pranav Prashant Thombre
Refit
Owner: Songlin Jiang
Core refactor series (blocks the delta and RDMA refit work):
Delta Weight Transfer
P2P RDMA based refit
NCCL hierarchical API
Owner: Youngeun Kwon
AsyncRL
Owners: Akash Mehra, Yuki Huang
Train pump (3-PR async-GRPO split-API stack):
Rollout and per-prompt streaming:
Single Controller:
Gym
Owners: Ananth Subramaniam, Hemil Desai
Generation
Reference links (full URLs)