Context
NeMo Gym currently ships two mini-swe-agent integrations:
mini_swe_agent - mini-swe-agent v1 (nv-mini-swe-agent fork), swegym_runner, Docker/Singularity, SWE-Gym train + SWE-bench Verified val, RL token/logprob passthrough
mini_swe_agent_2 - mini-swe-agent v2.1.0, Gym sandbox API (OpenSandbox), SWE-bench Verified eval, richer aggregate metrics
They overlap in purpose but serve different deployment/training paths today.
Goal
Decide whether to deprecate mini_swe_agent in favor of mini_swe_agent_2, or keep both with clearer docs on when to use each.
Before deprecating v1, confirm v2 covers:
Acceptance criteria
Context
NeMo Gym currently ships two mini-swe-agent integrations:
mini_swe_agent- mini-swe-agent v1 (nv-mini-swe-agentfork),swegym_runner, Docker/Singularity, SWE-Gym train + SWE-bench Verified val, RL token/logprob passthroughmini_swe_agent_2- mini-swe-agent v2.1.0, Gym sandbox API (OpenSandbox), SWE-bench Verified eval, richer aggregate metricsThey overlap in purpose but serve different deployment/training paths today.
Goal
Decide whether to deprecate
mini_swe_agentin favor ofmini_swe_agent_2, or keep both with clearer docs on when to use each.Before deprecating v1, confirm v2 covers:
mini_swe_agent_2not yet in root README table)Acceptance criteria