Skip to content

Sync Gersbach Hep cleanser config compute-resource fixes to lineage-atlas snapshot#30

Open
adamklie wants to merge 1 commit into
mainfrom
fix/gersbach-hep-config-compute-resources
Open

Sync Gersbach Hep cleanser config compute-resource fixes to lineage-atlas snapshot#30
adamklie wants to merge 1 commit into
mainfrom
fix/gersbach-hep-config-compute-resources

Conversation

@adamklie

Copy link
Copy Markdown
Collaborator

What

Commits and propagates the three compute-resource fixes that got the Gersbach WTC11 hepatocyte cleanser_initial run (2026_06_10) past its failure points. These were live in the as-run setup/configs file but uncommitted and not mirrored into the CRISPRi_lineage_atlas/config snapshot — they'd be lost on a clean checkout.

Fixes captured

Stage Change Why
anndata_concat boot disk 100 → 500 GB 47 sub-pools wrote temp_processed/*.h5ad to the task disk; overflowed 100 GB, died at file 44/47 (No space left on device)
shared withName block n2-highmem-16 → n2-highmem-32 retry-scaled memory (50GB×attempt, up to 200 GB) overflowed the -16's 128 GB; GCP rejected as machine_type cannot satisfy compute_resource
PreprocessAnnData (new override) n2-highmem-64 / 320GB×attempt cap 480 loads the ~110 GiB concatenated AnnData; OOM-killed (exit 137) at every retry up to the 200 GB ceiling on Hep's 47 pools

Files

  • datasets/Gersbach…/setup/configs/…_cleanser_initial.config — commits the previously-uncommitted as-run config
  • CRISPRi_lineage_atlas/config/v0.0.1/Gersbach…config — verbatim copy so the lineage snapshot matches what ran

Hon CM was already in sync (its lineage copy already carried the analogous PreprocessAnnData → n2-highmem-32 fix). Guide assignment itself completed (47/47); the run's actual wall is downstream at inference — not addressed here.

🤖 Generated with Claude Code

The as-run config for the 2026_06_10/cleanser_initial run had three
compute-resource fixes that were never committed and never mirrored
into the CRISPRi_lineage_atlas/config snapshot:

- google profile boot disk 100 GB -> 500 GB (anndata_concat overflowed
  the 100 GB task disk writing 47 temp_processed/*.h5ad, died at 44/47)
- shared withName block n2-highmem-16 -> n2-highmem-32 (retry-scaled
  memory up to 200 GB overflowed the -16's 128 GB)
- dedicated PreprocessAnnData override -> n2-highmem-64 / 320 GB*attempt
  (OOM-killed at every retry up to the 200 GB ceiling on Hep's 47 pools)

Commits the modified setup/configs file and copies it verbatim into
CRISPRi_lineage_atlas/config/v0.0.1/ so the lineage-atlas snapshot
matches what actually produced the run. Hon was already in sync.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant