Skip to content

NeMo RL Software Architecture Update: PR Tracking #2905

Description

@anwithk

NeMo RL Evolution: PR Tracking

This issue consolidates the open and in-flight PRs across the NeMo RL evolution workstreams. It is meant as a single index so we can track status in one place. PRs are grouped by workstream, with cross-workstream PRs noted where relevant.

Data Plane

Owner: Zhiyu Li

Router Replay (R3)

Owner: Zeyu Zhou

FT, Distillation

Owner: Pranav Prashant Thombre

Refit

Owner: Songlin Jiang

Core refactor series (blocks the delta and RDMA refit work):

Delta Weight Transfer

P2P RDMA based refit

NCCL hierarchical API

Owner: Youngeun Kwon

AsyncRL

Owners: Akash Mehra, Yuki Huang

Train pump (3-PR async-GRPO split-API stack):

Rollout and per-prompt streaming:

Single Controller:

Gym

Owners: Ananth Subramaniam, Hemil Desai

Generation

Reference links (full URLs)

Metadata

Metadata

Assignees

No one assigned

    Labels

    DocumentationImprovements or additions to documentation

    Type

    No type

    Fields

    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions