Fix OOM in I2V mode: sequential model loading

- Defer I2V model loading when in I2V mode without provided image
- Generate image first with T2I model
- Unload T2I model completely (del, empty_cache, gc.collect)
- Then load I2V model and generate video
- This ensures only one model is in memory at a time
- Fixes Linux OOM killer issue when loading multiple models
Status Job ID Name Coverage
  Build
canceled #301
build

 
  Test
canceled #303
allowed to fail
code_quality

canceled #302
test