Skip to content

Fix #474: fix: pass all arguments through gradient checkpoint in BasicTransforme#488

Draft
Mr-Neutr0n wants to merge 1 commit into
Stability-AI:mainfrom
Mr-Neutr0n:agent/issue-474-fix-pass-all-arguments
Draft

Fix #474: fix: pass all arguments through gradient checkpoint in BasicTransforme#488
Mr-Neutr0n wants to merge 1 commit into
Stability-AI:mainfrom
Mr-Neutr0n:agent/issue-474-fix-pass-all-arguments

Conversation

@Mr-Neutr0n

Copy link
Copy Markdown

Fixes #474

Pass additional_tokens and n_times_crossframe_attn_in_self through checkpoint() in BasicTransformerBlock.forward() so that gradient checkpointing doesn't silently drop these arguments.

Local test infra unavailable in CI sandbox.


This change was prepared with AI assistance under human direction and review.

…oint in Basic

Signed-off-by: Mr-Neutr0n <64578610+Mr-Neutr0n@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant