• Stefy Lanza (nextime / spora )'s avatar
    LoRA trainer: in-process Wan video-DiT (QLoRA) training path · 9071e839
    Stefy Lanza (nextime / spora ) authored
    Adds a target="video" path that trains a LoRA directly against the configured
    video model so it loads on the video pipeline (image LoRAs can't apply to a Wan
    DiT). _train_wan: encodes stills as 1-frame latents via the Wan 3D VAE
    (latents_mean/std normalized), encodes the prompt via UMT5, loads the
    transformer expert(s) in 4-bit (QLoRA) with gradient checkpointing, adds PEFT
    LoRA to the attention projections, and trains a rectified-flow loss. Handles
    Wan2.2's dual experts (transformer + transformer_2) via boundary_ratio routing,
    and saves both expert LoRA layers (falls back to high-noise only on older
    diffusers). Reuses the queue, eviction, thermal checkpoints and progress.
    
    LoraTrainRequest gains target/quantize_4bit/num_frames; base-path resolution
    gains a "video" category so it resolves the video model entry.
    Co-Authored-By: 's avatarClaude Opus 4.8 <noreply@anthropic.com>
    9071e839
Name
Last commit
Last update
codai Loading commit data...
docs/superpowers Loading commit data...
samples Loading commit data...
tests Loading commit data...
tools Loading commit data...
.gitignore Loading commit data...
CoderAI.gif Loading commit data...
LICENSE.md Loading commit data...
MULTIMODAL_CAPABILITIES.md Loading commit data...
MULTIMODAL_UI_EXAMPLES.md Loading commit data...
README.md Loading commit data...
build.ps1 Loading commit data...
build.sh Loading commit data...
coderai Loading commit data...
coderai-broker-implementation-reference.md Loading commit data...
coderai-integration.md Loading commit data...
osxbuild.sh Loading commit data...
requirements-nvidia.txt Loading commit data...
requirements-vulkan.txt Loading commit data...
requirements.txt Loading commit data...
todo.md Loading commit data...