Commits · c8cf5eaca5f7015f3b17f88a93b6207cc1018c6e · nexlab / videogen

27 Feb, 2026 19 commits

Merge branch 'experimental' · c8cf5eac
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

c8cf5eac

Fix Wan2.2 I2V LoRA adapter detection and base model suffix · 9f29b0c1

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

- Add model ID check for I2V detection (not just tags)
- Add -Diffusers suffix when extracting base model from tags
- Add runtime validation for Wan LoRA base model against tags

9f29b0c1

Merge branch 'experimental' · 186980e3
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

186980e3

Fix base model detection for Wan LoRA adapters · 410f97e9

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

For Wan LoRA adapters, skip reading base_model from stored config
(which may have incorrect values like T2V instead of I2V).
Instead, always infer from LoRA ID by checking for 'i2v' in the name.

This ensures wan2_2_i2v_general_nsfw_lora uses the correct
Wan-AI/Wan2.2-I2V-A14B-Diffusers base model instead of the
incorrect Wan-AI/Wan2.2-T2V-A14B-Diffusers.

410f97e9

Merge branch 'experimental' · 4254eddc
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

4254eddc

Fix I2V detection for LoRA adapters (e.g., wan2_2_i2v_general_nsfw_lora) · e668b610

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

The check at line 9414 only checked the stored supports_i2v flag
from the model configuration, but didn't check the model ID string
for 'i2v' like the detect_model_type() function does.

Now the I2V validation also detects I2V capability from the model ID,
making it consistent with detect_model_type() and properly detecting
I2V capability for LoRA adapters like lopi999/Wan2.2-I2V_General-NSFW-LoRA.

e668b610

Merge branch 'experimental' · e6901870
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

e6901870

Fix LoRA base model and I2V detection · 8eca258c

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

- Always infer base model from LoRA name, not stored database value
- Fix supports_i2v detection for LoRAs based on LoRA name
- Ensures Wan 2.2 I2V LoRAs use correct I2V base model

8eca258c

Merge branch 'experimental' · d0b89e57
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

d0b89e57
Fix Open-Sora model name to Open-Sora-v2 · 4a3889e4
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

4a3889e4
Merge branch 'experimental' · af8063a2
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

af8063a2
Update model references: Open-Sora-2 and Mochi-1-Preview · 76df0069
Stefy Lanza (nextime / spora ) authored Feb 27, 2026
```
- hpcai-tech/Open-Sora-1.2 -> hpcai-tech/Open-Sora-2
- genmo/mochi -> genmo/mochi-1-preview
```
76df0069

Update Wan 2.1 I2V base model to use 720P resolution variant · 4866982f

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

When selecting a Wan 2.1 I2V model that is a LoRA or tensor weight,
the base model now uses Wan-AI/Wan2.1-I2V-14B-720P-Diffusers instead
of the generic Wan-AI/Wan2.1-I2V-14B-Diffusers.

4866982f

Fix tuple unpacking error in print_model_list · 1456df8d

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

Fixed ValueError: too many values to unpack (expected 5)
- Line 3929: Added missing 6th element to unpacking
- Line 3936: Added missing 6th element to unpacking

The results tuple has 6 elements:
(name, info, caps, is_disabled, fail_count, orig_idx)

1456df8d

Update Wan 2.1 I2V base model to use 720P resolution variant · b6d75d21

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

When selecting a Wan 2.1 I2V model that is a LoRA or tensor weight,
the base model now uses Wan-AI/Wan2.1-I2V-14B-720P-Diffusers instead
of the generic Wan-AI/Wan2.1-I2V-14B-Diffusers.

b6d75d21

Fix tuple unpacking error in print_model_list · 8459a4d7

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

Fixed ValueError: too many values to unpack (expected 5)
- Line 3929: Added missing 6th element to unpacking
- Line 3936: Added missing 6th element to unpacking

The results tuple has 6 elements:
(name, info, caps, is_disabled, fail_count, orig_idx)

8459a4d7

Merge branch 'experimental' into 'master' · 6db57c26
Stefy Lanza (nextime / spora ) authored Feb 27, 2026
```
Fix I2V pipeline auto-detection

See merge request !1
```
6db57c26

Fix I2V pipeline auto-detection · 1ef3d1b8

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

- Add detect_model_family() to identify model family (wan, sdxl, sd, ltx, etc.)
- Add get_pipeline_for_model_family() for proper pipeline selection based on family + task
- Enhance detect_generation_type() to check --image FIRST for I2V detection
- Add support for --image_model, --prompt_image, --prompt_animation as I2V indicators
- Add support for audio/subtitle options as T2V+V2V chaining indicators

This fixes the issue where SDXL models were incorrectly using WanPipeline
for I2V tasks, causing type mismatch errors (expected UMT5EncoderModel,
got CLIPTextModel). Now SDXL models correctly use DiffusionPipeline
or StableDiffusionXLPipeline.

1ef3d1b8

Update SKILL.md with new CLI options and MCP documentation · 803b1763
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

803b1763

26 Feb, 2026 21 commits

Reduce VRAM estimation overhead from 50% to 25% · c2ef12d3
Stefy Lanza (nextime / spora ) authored Feb 26, 2026

c2ef12d3

Update documentation and integrations with new CLI options · 30b322a6

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

- Add --model-list-batch to EXAMPLES.md for batch model listing
- Add new CLI options to MCP server (--output-dir, --yes, --audio-chunk)
- Add new CLI options to webapp build_command function
- Update README.md with Output Options section

30b322a6

Add --audio-chunk option for audio/video chunking strategies · 20db65c1

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Added --audio-chunk argument with 3 modes:
- overlap (default): overlapping chunks like [0-60], [58-118]
- word-boundary: uses Whisper timestamps to split at word boundaries
- vad: uses Voice Activity Detection to skip silence

Also added --audio-chunk-overlap to control overlap duration.

New functions added:
- process_video_with_vad(): VAD-based chunking
- process_video_word_boundary(): Word-boundary chunking using Whisper

Modified:
- transcribe_video_audio(): accepts audio_chunk_type and audio_chunk_overlap params
- _transcribe_chunked(): accepts chunk_type and overlap params

20db65c1

Add detection for SD1.5 models (AbyssOrangeMix, NAI, etc.) · caf3c707

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Models like Faber8/AbyssOrangeMix2 are SD1.5 models, not Flux.
Now detected as StableDiffusionPipeline instead of FluxPipeline.

caf3c707

Add handling for missing VAE files · 8082b88e

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

When a model has a VAE configured but the VAE files don't exist in
the repository, try loading with the default VAE instead.

8082b88e

Fix model lookup by HuggingFace ID · 8b05676e

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Allow using full HuggingFace model ID (e.g., Faber8/AbyssOrangeMix2_nsfw)
as --model argument by looking up both short name and full ID in MODELS.

8b05676e

Fix frozenset error in clear_cache · 96adc8b3

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

The repos attribute from scan_cache_dir() is a frozenset, not a list,
so .pop() doesn't work. Fixed by using next(iter()) instead.

96adc8b3

Suppress 'Loaded N models' message in batch mode · 2c0ed5db
Stefy Lanza (nextime / spora ) authored Feb 26, 2026
```
The message is now suppressed when using --model-list-batch to make
the output cleaner for scripts.
```
2c0ed5db

Add --yes flag for auto-confirmation of cache deletion · 2e86db90

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Added --yes / -y argument that automatically answers 'yes' to confirmation
prompts when deleting cached models or clearing the entire cache.

Usage:
  videogen --remove-cached-model MODEL_ID --yes
  videogen --clear-cache --yes

2e86db90

Fix --model-list-batch to actually work · 506faa85

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

The --model-list-batch option was added but wasn't being handled properly.
Now it correctly exits after printing the batch output, and is also
added to the list of options that don't require --prompt.

506faa85

Fix model ID consistency with filters and add new CLI options · 4c421b4f

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

- Fixed model ID consistency: numeric IDs now remain the same when using
  filters like --nsfw-friendly, --t2v-only, --i2v-only, etc.
  Previously, filtered lists would renumber models making --show-model
  by numeric ID unreliable.

- Added --model-list-batch option for script-friendly output:
  Outputs 'NUMERIC_ID:FULL_MODEL_NAME' format for easy parsing

- Added --output-dir option to specify output directory:
  Sets the directory where output files will be saved

- Fixed syntax error in argparse epilog string that was causing
  'SyntaxError: invalid decimal literal'

4c421b4f

Fix Wan 2.2 base model IDs to use A14B suffix · 1f33b9e5
Stefy Lanza (nextime / spora ) authored Feb 26, 2026

1f33b9e5

Fix Wan 2.2 I2V base model detection · 6a982a8b

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

- Fixed model ID normalization to handle hyphens (in addition to underscores)
- Fixed dictionary key ordering in base_model_fallbacks so more specific keys (wan2.2.i2v) are checked before generic keys (wan2.2)
- Fixed Wan 2.1 I2V base model mapping (was incorrectly pointing to T2V)
- Fixed base model detection in earlier code sections to check model ID directly instead of relying on m_info.get('supports_i2v')
- Fixed typo: Wan 2.2 generic fallback now correctly uses Wan2.2-T2V

Now Wan 2.2 I2V models like Wan-AI/Wan2.2-I2V-A14B will correctly use Wan-AI/Wan2.2-I2V-14B-Diffusers as the base model instead of the incorrect Wan-AI/Wan2.2-T2V-14B-Diffusers.

6a982a8b

Fix Wan 2.2 I2V base model detection · 4a5213f8

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

- Fixed mapping table to use correct I2V base model (Wan-AI/Wan2.2-I2V-14B-Diffusers)
- Fixed Diffuser -> Diffusers typo in model IDs
- Updated all Wan 2.2 I2V references to use correct model ID

4a5213f8

Fix Wan 2.2 base model name: Diffusers (with s) · d5ac0826
Stefy Lanza (nextime / spora ) authored Feb 26, 2026

d5ac0826

Fix Wan 2.2 I2V base model detection and VRAM estimation · 2d9d09f6

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

- Add correct base model for Wan 2.2 I2V: Wan-AI/Wan2.2-I2V-A14B-Diffuser
- Add specific VRAM estimate for Wan 2.2 I2V MoE models (~14GB)
- Apply more conservative VRAM calculation for models with weights/LoRAs
- Fix indentation error in add_model_from_hf function

2d9d09f6

fix: More conservative VRAM estimate for base models with weights · 1fdeb905
Stefy Lanza (nextime / spora ) authored Feb 26, 2026
```
Calculate: base_vram + 2GB + 50%
This ensures a 14B model estimated at 18GB will require ~29GB instead of 22.5GB.
```
1fdeb905

fix: Calculate 25% more VRAM for base models with weights/LoRAs · 62e22d1f

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Instead of adding a fixed 2GB overhead, now calculates 25% more
VRAM for base models that will have fine-tuned weights/tensors
or LoRA adapters loaded on top.

62e22d1f

fix: Use Wan2.2-T2V as base for all Wan 2.2 models · 1162d3c0
Stefy Lanza (nextime / spora ) authored Feb 26, 2026
```
The user confirmed that Wan2.2-I2V models should use Wan2.2-T2V
as the base model, not the I2V variant.
```
1162d3c0

fix: Normalize model ID when matching Wan base models · 77a089f8

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

The issue was that model IDs from HuggingFace use dots (wan2.2-i2v-a14b)
while user config names use underscores (wan2_2_i2v_a14b).

Now we normalize the model ID by replacing underscores with dots before
matching against the base_model_fallbacks dictionary.

77a089f8

fix: Ensure more specific Wan model keys match first · 9a48c010

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

The issue was that model IDs like 'wan2_2_i2v_a14b' would match
'wan2_2' (T2V) before 'wan2_2_i2v' (I2V) because 'wan2_2' comes
first in the dictionary.

Now the dictionary is ordered with more specific keys first:
- wan2_2_i2v_a14b (most specific)
- wan2.2_i2v_a14b
- wan2_2_i2v
- wan2.2_i2v
- wan2_2
- wan2.2
etc.

This ensures longer/more specific keys are checked before shorter ones.

9a48c010