Files · 8e8c0a4570b6a3e61044a2abccf2cab2f3f995d4 · nexlab / coderai

Wan LoRA trainer: fix fp32/bf16 dtype mismatch in the train step · 8e8c0a45

Stefy Lanza (nextime / spora ) authored Jun 09, 2026

torch.rand defaults to fp32, so the rectified-flow interpolation promoted x_t to
fp32 while the patch-embedding Conv3d stays bf16 (bitsandbytes 4-bit quantizes
only Linear layers), raising "Input type (float) and bias type (BFloat16) should
be the same". Compute the interpolation in fp32 then cast x_t/target back to the
model compute dtype, and pass timestep as fp32 (Wan casts it internally).
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

8e8c0a45

Name	Last commit	Last update
codai		Loading commit data...
docs/superpowers		Loading commit data...
samples		Loading commit data...
tests		Loading commit data...
tools		Loading commit data...
.gitignore		Loading commit data...
CoderAI.gif		Loading commit data...
LICENSE.md		Loading commit data...
MULTIMODAL_CAPABILITIES.md		Loading commit data...
MULTIMODAL_UI_EXAMPLES.md		Loading commit data...
README.md		Loading commit data...
build.ps1		Loading commit data...
build.sh		Loading commit data...
coderai		Loading commit data...
coderai-broker-implementation-reference.md		Loading commit data...
coderai-integration.md		Loading commit data...
osxbuild.sh		Loading commit data...
requirements-nvidia.txt		Loading commit data...
requirements-vulkan.txt		Loading commit data...
requirements.txt		Loading commit data...
todo.md		Loading commit data...

README.md