Commits · 8eca258cc61774ff6cbe4b8472ceb9bbee42abae · nexlab / videogen

27 Feb, 2026 7 commits

Fix LoRA base model and I2V detection · 8eca258c

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

- Always infer base model from LoRA name, not stored database value
- Fix supports_i2v detection for LoRAs based on LoRA name
- Ensures Wan 2.2 I2V LoRAs use correct I2V base model

8eca258c

Fix Open-Sora model name to Open-Sora-v2 · 4a3889e4
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

4a3889e4
Update model references: Open-Sora-2 and Mochi-1-Preview · 76df0069
Stefy Lanza (nextime / spora ) authored Feb 27, 2026
```
- hpcai-tech/Open-Sora-1.2 -> hpcai-tech/Open-Sora-2
- genmo/mochi -> genmo/mochi-1-preview
```
76df0069

Update Wan 2.1 I2V base model to use 720P resolution variant · b6d75d21

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

When selecting a Wan 2.1 I2V model that is a LoRA or tensor weight,
the base model now uses Wan-AI/Wan2.1-I2V-14B-720P-Diffusers instead
of the generic Wan-AI/Wan2.1-I2V-14B-Diffusers.

b6d75d21

Fix tuple unpacking error in print_model_list · 8459a4d7

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

Fixed ValueError: too many values to unpack (expected 5)
- Line 3929: Added missing 6th element to unpacking
- Line 3936: Added missing 6th element to unpacking

The results tuple has 6 elements:
(name, info, caps, is_disabled, fail_count, orig_idx)

8459a4d7

Fix I2V pipeline auto-detection · 1ef3d1b8

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

- Add detect_model_family() to identify model family (wan, sdxl, sd, ltx, etc.)
- Add get_pipeline_for_model_family() for proper pipeline selection based on family + task
- Enhance detect_generation_type() to check --image FIRST for I2V detection
- Add support for --image_model, --prompt_image, --prompt_animation as I2V indicators
- Add support for audio/subtitle options as T2V+V2V chaining indicators

This fixes the issue where SDXL models were incorrectly using WanPipeline
for I2V tasks, causing type mismatch errors (expected UMT5EncoderModel,
got CLIPTextModel). Now SDXL models correctly use DiffusionPipeline
or StableDiffusionXLPipeline.

1ef3d1b8

Update SKILL.md with new CLI options and MCP documentation · 803b1763
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

803b1763

26 Feb, 2026 22 commits

Reduce VRAM estimation overhead from 50% to 25% · c2ef12d3
Stefy Lanza (nextime / spora ) authored Feb 26, 2026

c2ef12d3

Update documentation and integrations with new CLI options · 30b322a6

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

- Add --model-list-batch to EXAMPLES.md for batch model listing
- Add new CLI options to MCP server (--output-dir, --yes, --audio-chunk)
- Add new CLI options to webapp build_command function
- Update README.md with Output Options section

30b322a6

Add --audio-chunk option for audio/video chunking strategies · 20db65c1

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Added --audio-chunk argument with 3 modes:
- overlap (default): overlapping chunks like [0-60], [58-118]
- word-boundary: uses Whisper timestamps to split at word boundaries
- vad: uses Voice Activity Detection to skip silence

Also added --audio-chunk-overlap to control overlap duration.

New functions added:
- process_video_with_vad(): VAD-based chunking
- process_video_word_boundary(): Word-boundary chunking using Whisper

Modified:
- transcribe_video_audio(): accepts audio_chunk_type and audio_chunk_overlap params
- _transcribe_chunked(): accepts chunk_type and overlap params

20db65c1

Add detection for SD1.5 models (AbyssOrangeMix, NAI, etc.) · caf3c707

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Models like Faber8/AbyssOrangeMix2 are SD1.5 models, not Flux.
Now detected as StableDiffusionPipeline instead of FluxPipeline.

caf3c707

Add handling for missing VAE files · 8082b88e

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

When a model has a VAE configured but the VAE files don't exist in
the repository, try loading with the default VAE instead.

8082b88e

Fix model lookup by HuggingFace ID · 8b05676e

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Allow using full HuggingFace model ID (e.g., Faber8/AbyssOrangeMix2_nsfw)
as --model argument by looking up both short name and full ID in MODELS.

8b05676e

Fix frozenset error in clear_cache · 96adc8b3

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

The repos attribute from scan_cache_dir() is a frozenset, not a list,
so .pop() doesn't work. Fixed by using next(iter()) instead.

96adc8b3

Suppress 'Loaded N models' message in batch mode · 2c0ed5db
Stefy Lanza (nextime / spora ) authored Feb 26, 2026
```
The message is now suppressed when using --model-list-batch to make
the output cleaner for scripts.
```
2c0ed5db

Add --yes flag for auto-confirmation of cache deletion · 2e86db90

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Added --yes / -y argument that automatically answers 'yes' to confirmation
prompts when deleting cached models or clearing the entire cache.

Usage:
  videogen --remove-cached-model MODEL_ID --yes
  videogen --clear-cache --yes

2e86db90

Fix --model-list-batch to actually work · 506faa85

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

The --model-list-batch option was added but wasn't being handled properly.
Now it correctly exits after printing the batch output, and is also
added to the list of options that don't require --prompt.

506faa85

Fix model ID consistency with filters and add new CLI options · 4c421b4f

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

- Fixed model ID consistency: numeric IDs now remain the same when using
  filters like --nsfw-friendly, --t2v-only, --i2v-only, etc.
  Previously, filtered lists would renumber models making --show-model
  by numeric ID unreliable.

- Added --model-list-batch option for script-friendly output:
  Outputs 'NUMERIC_ID:FULL_MODEL_NAME' format for easy parsing

- Added --output-dir option to specify output directory:
  Sets the directory where output files will be saved

- Fixed syntax error in argparse epilog string that was causing
  'SyntaxError: invalid decimal literal'

4c421b4f

Fix Wan 2.2 base model IDs to use A14B suffix · 1f33b9e5
Stefy Lanza (nextime / spora ) authored Feb 26, 2026

1f33b9e5

Fix Wan 2.2 I2V base model detection · 6a982a8b

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

- Fixed model ID normalization to handle hyphens (in addition to underscores)
- Fixed dictionary key ordering in base_model_fallbacks so more specific keys (wan2.2.i2v) are checked before generic keys (wan2.2)
- Fixed Wan 2.1 I2V base model mapping (was incorrectly pointing to T2V)
- Fixed base model detection in earlier code sections to check model ID directly instead of relying on m_info.get('supports_i2v')
- Fixed typo: Wan 2.2 generic fallback now correctly uses Wan2.2-T2V

Now Wan 2.2 I2V models like Wan-AI/Wan2.2-I2V-A14B will correctly use Wan-AI/Wan2.2-I2V-14B-Diffusers as the base model instead of the incorrect Wan-AI/Wan2.2-T2V-14B-Diffusers.

6a982a8b

Fix Wan 2.2 I2V base model detection · 4a5213f8

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

- Fixed mapping table to use correct I2V base model (Wan-AI/Wan2.2-I2V-14B-Diffusers)
- Fixed Diffuser -> Diffusers typo in model IDs
- Updated all Wan 2.2 I2V references to use correct model ID

4a5213f8

Fix Wan 2.2 base model name: Diffusers (with s) · d5ac0826
Stefy Lanza (nextime / spora ) authored Feb 26, 2026

d5ac0826

Fix Wan 2.2 I2V base model detection and VRAM estimation · 2d9d09f6

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

- Add correct base model for Wan 2.2 I2V: Wan-AI/Wan2.2-I2V-A14B-Diffuser
- Add specific VRAM estimate for Wan 2.2 I2V MoE models (~14GB)
- Apply more conservative VRAM calculation for models with weights/LoRAs
- Fix indentation error in add_model_from_hf function

2d9d09f6

fix: More conservative VRAM estimate for base models with weights · 1fdeb905
Stefy Lanza (nextime / spora ) authored Feb 26, 2026
```
Calculate: base_vram + 2GB + 50%
This ensures a 14B model estimated at 18GB will require ~29GB instead of 22.5GB.
```
1fdeb905

fix: Calculate 25% more VRAM for base models with weights/LoRAs · 62e22d1f

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Instead of adding a fixed 2GB overhead, now calculates 25% more
VRAM for base models that will have fine-tuned weights/tensors
or LoRA adapters loaded on top.

62e22d1f

fix: Use Wan2.2-T2V as base for all Wan 2.2 models · 1162d3c0
Stefy Lanza (nextime / spora ) authored Feb 26, 2026
```
The user confirmed that Wan2.2-I2V models should use Wan2.2-T2V
as the base model, not the I2V variant.
```
1162d3c0

fix: Normalize model ID when matching Wan base models · 77a089f8

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

The issue was that model IDs from HuggingFace use dots (wan2.2-i2v-a14b)
while user config names use underscores (wan2_2_i2v_a14b).

Now we normalize the model ID by replacing underscores with dots before
matching against the base_model_fallbacks dictionary.

77a089f8

fix: Ensure more specific Wan model keys match first · 9a48c010

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

The issue was that model IDs like 'wan2_2_i2v_a14b' would match
'wan2_2' (T2V) before 'wan2_2_i2v' (I2V) because 'wan2_2' comes
first in the dictionary.

Now the dictionary is ordered with more specific keys first:
- wan2_2_i2v_a14b (most specific)
- wan2.2_i2v_a14b
- wan2_2_i2v
- wan2.2_i2v
- wan2_2
- wan2.2
etc.

This ensures longer/more specific keys are checked before shorter ones.

9a48c010

fix: Detect Wan 2.2 I2V base model correctly for image-to-video models · ec05b598

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Before this fix, using Wan 2.2 I2V models like 'wan2_2_i2v_a14b' would
incorrectly try to load 'Wan-AI/Wan2.1-T2V-14B-Diffusers' as the base
model because the fallback logic only matched 'wan' generically.

Now the base_model_fallbacks dictionary includes specific entries for:
- Wan 2.2 I2V models: wan2_2_i2v, wan2.2_i2v
- Wan 2.2 T2V models: wan2_2, wan2.2
- Wan 2.1 I2V models: wan2_1_i2v, wan2.1_i2v
- Wan 2.1 T2V models: wan2_1, wan2.1
- Generic Wan fallback: wan

The more specific keys are checked first, so model IDs like
'wan2_2_i2v_a14b' will correctly match 'wan2_2_i2v' and use
'Wan-AI/Wan2.2-I2V-14B-Diffusers' as the base model.

ec05b598

25 Feb, 2026 11 commits
- Fix: Use Wan 2.2 base models for LoRA adapters · fbefb047
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  fbefb047
- Fix: Use correct attribute name for video-to-video detection · 3bc73607
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  3bc73607
- Refactor: Always detect pipeline at runtime based on model ID + task · 307154b3
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
```
- Add get_pipeline_for_task() function that determines pipeline
  based on model ID AND task type (t2v, i2v, t2i, i2i, v2v)
- Pipeline class is now ALWAYS detected at runtime, not from config
- Remove old dynamic switching code that's now redundant
- Update check_model.py to show runtime detection instead of fixing config
- Update check_pipelines.py to show V2V pipelines
```
  307154b3
- Update check_model.py to fix incorrect pipeline classes in local config · 2e248868
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  2e248868
- Add dynamic pipeline class selection for T2V/I2V/V2V and I2I/T2I · 0a1b413b
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
```
- Add WanImageToVideoPipeline and WanVideoToVideoPipeline to PIPELINE_CLASS_MAP
- Add CogVideoXImageToVideoPipeline and CogVideoXVideoToVideoPipeline
- Add AnimateDiffVideoToVideoPipeline
- Add StableDiffusionImg2ImgPipeline for SD 1.5
- Add dynamic pipeline switching logic for Wan, LTX, CogVideoX, AnimateDiff
- The pipeline class is now selected at runtime based on task mode
- Fix detect_pipeline_class to correctly identify Wan models
- Remove duplicate LTX handling code
```
  0a1b413b
- Add check_pipelines.py to list available pipeline variants · 30e0871f
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  30e0871f
- Renamed videogen to videogen.py · 9e84ee73
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  9e84ee73
- Add audio generation capabilities to web interface · 3985e613
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  3985e613
- Fix model selection for I2V generation by checking capabilities before scoring · b6d177d8
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  b6d177d8
- Add debug script to analyze model selection · 0a0e2084
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  0a0e2084
- Add --allow-bigger-models option to web interface and MCP server · ad1c700b
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  ad1c700b