Skip to content

docs: MSC cloud checkpointing + expose multi-storage-client under [s3]#2517

Open
edjson wants to merge 12 commits into
NVIDIA-NeMo:mainfrom
edjson:s3-extra-and-cloud-docs
Open

docs: MSC cloud checkpointing + expose multi-storage-client under [s3]#2517
edjson wants to merge 12 commits into
NVIDIA-NeMo:mainfrom
edjson:s3-extra-and-cloud-docs

Conversation

@edjson

@edjson edjson commented Jun 11, 2026

Copy link
Copy Markdown
Contributor

What does this PR do ?

Documents MSC cloud (S3) checkpointing and fixes the [s3] extra so pip install nemo_automodel[s3] installs multi-storage-client.

Changelog

  • Add "Cloud Checkpointing with MSC (S3)" section to docs/guides/checkpointing.md.
  • Add multi-storage-client>=0.13 to the [s3] extra

Before your PR is "Ready for review"

Pre checks:

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

Follow up to #1709. This closes the two remaining gaps raised by @krishnakalyan3 : expose the dependency under [s3], and document MSC configuration. Verified pip install nemo_automodel[s3] now imports both boto3 and multistorageclient.

@edjson edjson requested review from a team and jgerh as code owners June 11, 2026 07:46
@copy-pr-bot

copy-pr-bot Bot commented Jun 11, 2026

Copy link
Copy Markdown

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@jgerh jgerh left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Completed tech pubs review of docs/guides/checkpointing.md and provided a few copyedits.

Comment thread docs/guides/checkpointing.md
Comment thread docs/guides/checkpointing.md Outdated
Comment thread docs/guides/checkpointing.md Outdated
Comment thread docs/guides/checkpointing.md Outdated
Comment thread docs/guides/checkpointing.md Outdated
Comment thread docs/guides/checkpointing.md Outdated
Comment thread docs/guides/checkpointing.md Outdated
Comment thread docs/guides/checkpointing.md Outdated
Comment thread docs/guides/checkpointing.md Outdated
Comment thread docs/guides/checkpointing.md Outdated
@akoumpa akoumpa added the docs-only With great power comes great responsibility. label Jun 11, 2026
edjson and others added 11 commits June 11, 2026 11:43
…odel[s3] installs MSC

Signed-off-by: edjson <edisonggacc@gmail.com>
Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>
Signed-off-by: edjson <edisonggacc@gmail.com>
Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>
Signed-off-by: edjson <edisonggacc@gmail.com>
Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>
Signed-off-by: edjson <edisonggacc@gmail.com>
Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>
Signed-off-by: edjson <edisonggacc@gmail.com>
Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>
Signed-off-by: edjson <edisonggacc@gmail.com>
Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>
Signed-off-by: edjson <edisonggacc@gmail.com>
Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>
Signed-off-by: edjson <edisonggacc@gmail.com>
Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>
Signed-off-by: edjson <edisonggacc@gmail.com>
Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>
Signed-off-by: edjson <edisonggacc@gmail.com>
Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>
Signed-off-by: edjson <edisonggacc@gmail.com>
@edjson edjson force-pushed the s3-extra-and-cloud-docs branch from b22542f to e36d131 Compare June 11, 2026 18:43
@svcnvidia-nemo-ci svcnvidia-nemo-ci added the waiting-on-customer Waiting on the original author to respond label Jun 11, 2026
@krishnakalyan3

Copy link
Copy Markdown
Contributor

@edjson the PR looks good can you please resolve the conflict issues?.

Signed-off-by: edjson <edisonggacc@gmail.com>
@svcnvidia-nemo-ci svcnvidia-nemo-ci added waiting-on-customer Waiting on the original author to respond and removed waiting-on-customer Waiting on the original author to respond labels Jun 14, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

community-request docs-only With great power comes great responsibility. waiting-on-customer Waiting on the original author to respond

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants