Wan LoRA: cache the whole stack (VAE + UMT5 + transformer) across jobs
Extend the cross-job cache from just the transformer expert(s) to the full Wan
stack: VAE, tokenizer and text encoder are kept on CPU between jobs (moved to GPU
only while encoding), experts stay on GPU. A back-to-back training against the
same base now reloads nothing from disk — previously the small VAE/text-encoder
still reloaded each job. The releaser and error path clear all cached components.
Co-Authored-By:
Claude Opus 4.8 <noreply@anthropic.com>
Showing
Please
register
or
sign in
to comment