Commits · c2ef12d321bed683a1b25f96958a353bb279c989 · nexlab / videogen

26 Feb, 2026 22 commits

Reduce VRAM estimation overhead from 50% to 25% · c2ef12d3
Stefy Lanza (nextime / spora ) authored Feb 26, 2026

c2ef12d3

Update documentation and integrations with new CLI options · 30b322a6

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

- Add --model-list-batch to EXAMPLES.md for batch model listing
- Add new CLI options to MCP server (--output-dir, --yes, --audio-chunk)
- Add new CLI options to webapp build_command function
- Update README.md with Output Options section

30b322a6

Add --audio-chunk option for audio/video chunking strategies · 20db65c1

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Added --audio-chunk argument with 3 modes:
- overlap (default): overlapping chunks like [0-60], [58-118]
- word-boundary: uses Whisper timestamps to split at word boundaries
- vad: uses Voice Activity Detection to skip silence

Also added --audio-chunk-overlap to control overlap duration.

New functions added:
- process_video_with_vad(): VAD-based chunking
- process_video_word_boundary(): Word-boundary chunking using Whisper

Modified:
- transcribe_video_audio(): accepts audio_chunk_type and audio_chunk_overlap params
- _transcribe_chunked(): accepts chunk_type and overlap params

20db65c1

Add detection for SD1.5 models (AbyssOrangeMix, NAI, etc.) · caf3c707

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Models like Faber8/AbyssOrangeMix2 are SD1.5 models, not Flux.
Now detected as StableDiffusionPipeline instead of FluxPipeline.

caf3c707

Add handling for missing VAE files · 8082b88e

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

When a model has a VAE configured but the VAE files don't exist in
the repository, try loading with the default VAE instead.

8082b88e

Fix model lookup by HuggingFace ID · 8b05676e

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Allow using full HuggingFace model ID (e.g., Faber8/AbyssOrangeMix2_nsfw)
as --model argument by looking up both short name and full ID in MODELS.

8b05676e

Fix frozenset error in clear_cache · 96adc8b3

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

The repos attribute from scan_cache_dir() is a frozenset, not a list,
so .pop() doesn't work. Fixed by using next(iter()) instead.

96adc8b3

Suppress 'Loaded N models' message in batch mode · 2c0ed5db
Stefy Lanza (nextime / spora ) authored Feb 26, 2026
```
The message is now suppressed when using --model-list-batch to make
the output cleaner for scripts.
```
2c0ed5db

Add --yes flag for auto-confirmation of cache deletion · 2e86db90

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Added --yes / -y argument that automatically answers 'yes' to confirmation
prompts when deleting cached models or clearing the entire cache.

Usage:
  videogen --remove-cached-model MODEL_ID --yes
  videogen --clear-cache --yes

2e86db90

Fix --model-list-batch to actually work · 506faa85

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

The --model-list-batch option was added but wasn't being handled properly.
Now it correctly exits after printing the batch output, and is also
added to the list of options that don't require --prompt.

506faa85

Fix model ID consistency with filters and add new CLI options · 4c421b4f

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

- Fixed model ID consistency: numeric IDs now remain the same when using
  filters like --nsfw-friendly, --t2v-only, --i2v-only, etc.
  Previously, filtered lists would renumber models making --show-model
  by numeric ID unreliable.

- Added --model-list-batch option for script-friendly output:
  Outputs 'NUMERIC_ID:FULL_MODEL_NAME' format for easy parsing

- Added --output-dir option to specify output directory:
  Sets the directory where output files will be saved

- Fixed syntax error in argparse epilog string that was causing
  'SyntaxError: invalid decimal literal'

4c421b4f

Fix Wan 2.2 base model IDs to use A14B suffix · 1f33b9e5
Stefy Lanza (nextime / spora ) authored Feb 26, 2026

1f33b9e5

Fix Wan 2.2 I2V base model detection · 6a982a8b

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

- Fixed model ID normalization to handle hyphens (in addition to underscores)
- Fixed dictionary key ordering in base_model_fallbacks so more specific keys (wan2.2.i2v) are checked before generic keys (wan2.2)
- Fixed Wan 2.1 I2V base model mapping (was incorrectly pointing to T2V)
- Fixed base model detection in earlier code sections to check model ID directly instead of relying on m_info.get('supports_i2v')
- Fixed typo: Wan 2.2 generic fallback now correctly uses Wan2.2-T2V

Now Wan 2.2 I2V models like Wan-AI/Wan2.2-I2V-A14B will correctly use Wan-AI/Wan2.2-I2V-14B-Diffusers as the base model instead of the incorrect Wan-AI/Wan2.2-T2V-14B-Diffusers.

6a982a8b

Fix Wan 2.2 I2V base model detection · 4a5213f8

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

- Fixed mapping table to use correct I2V base model (Wan-AI/Wan2.2-I2V-14B-Diffusers)
- Fixed Diffuser -> Diffusers typo in model IDs
- Updated all Wan 2.2 I2V references to use correct model ID

4a5213f8

Fix Wan 2.2 base model name: Diffusers (with s) · d5ac0826
Stefy Lanza (nextime / spora ) authored Feb 26, 2026

d5ac0826

Fix Wan 2.2 I2V base model detection and VRAM estimation · 2d9d09f6

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

- Add correct base model for Wan 2.2 I2V: Wan-AI/Wan2.2-I2V-A14B-Diffuser
- Add specific VRAM estimate for Wan 2.2 I2V MoE models (~14GB)
- Apply more conservative VRAM calculation for models with weights/LoRAs
- Fix indentation error in add_model_from_hf function

2d9d09f6

fix: More conservative VRAM estimate for base models with weights · 1fdeb905
Stefy Lanza (nextime / spora ) authored Feb 26, 2026
```
Calculate: base_vram + 2GB + 50%
This ensures a 14B model estimated at 18GB will require ~29GB instead of 22.5GB.
```
1fdeb905

fix: Calculate 25% more VRAM for base models with weights/LoRAs · 62e22d1f

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Instead of adding a fixed 2GB overhead, now calculates 25% more
VRAM for base models that will have fine-tuned weights/tensors
or LoRA adapters loaded on top.

62e22d1f

fix: Use Wan2.2-T2V as base for all Wan 2.2 models · 1162d3c0
Stefy Lanza (nextime / spora ) authored Feb 26, 2026
```
The user confirmed that Wan2.2-I2V models should use Wan2.2-T2V
as the base model, not the I2V variant.
```
1162d3c0

fix: Normalize model ID when matching Wan base models · 77a089f8

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

The issue was that model IDs from HuggingFace use dots (wan2.2-i2v-a14b)
while user config names use underscores (wan2_2_i2v_a14b).

Now we normalize the model ID by replacing underscores with dots before
matching against the base_model_fallbacks dictionary.

77a089f8

fix: Ensure more specific Wan model keys match first · 9a48c010

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

The issue was that model IDs like 'wan2_2_i2v_a14b' would match
'wan2_2' (T2V) before 'wan2_2_i2v' (I2V) because 'wan2_2' comes
first in the dictionary.

Now the dictionary is ordered with more specific keys first:
- wan2_2_i2v_a14b (most specific)
- wan2.2_i2v_a14b
- wan2_2_i2v
- wan2.2_i2v
- wan2_2
- wan2.2
etc.

This ensures longer/more specific keys are checked before shorter ones.

9a48c010

fix: Detect Wan 2.2 I2V base model correctly for image-to-video models · ec05b598

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Before this fix, using Wan 2.2 I2V models like 'wan2_2_i2v_a14b' would
incorrectly try to load 'Wan-AI/Wan2.1-T2V-14B-Diffusers' as the base
model because the fallback logic only matched 'wan' generically.

Now the base_model_fallbacks dictionary includes specific entries for:
- Wan 2.2 I2V models: wan2_2_i2v, wan2.2_i2v
- Wan 2.2 T2V models: wan2_2, wan2.2
- Wan 2.1 I2V models: wan2_1_i2v, wan2.1_i2v
- Wan 2.1 T2V models: wan2_1, wan2.1
- Generic Wan fallback: wan

The more specific keys are checked first, so model IDs like
'wan2_2_i2v_a14b' will correctly match 'wan2_2_i2v' and use
'Wan-AI/Wan2.2-I2V-14B-Diffusers' as the base model.

ec05b598

25 Feb, 2026 18 commits
- Fix: Use Wan 2.2 base models for LoRA adapters · fbefb047
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  fbefb047
- Fix: Use correct attribute name for video-to-video detection · 3bc73607
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  3bc73607
- Refactor: Always detect pipeline at runtime based on model ID + task · 307154b3
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
```
- Add get_pipeline_for_task() function that determines pipeline
  based on model ID AND task type (t2v, i2v, t2i, i2i, v2v)
- Pipeline class is now ALWAYS detected at runtime, not from config
- Remove old dynamic switching code that's now redundant
- Update check_model.py to show runtime detection instead of fixing config
- Update check_pipelines.py to show V2V pipelines
```
  307154b3
- Update check_model.py to fix incorrect pipeline classes in local config · 2e248868
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  2e248868
- Add dynamic pipeline class selection for T2V/I2V/V2V and I2I/T2I · 0a1b413b
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
```
- Add WanImageToVideoPipeline and WanVideoToVideoPipeline to PIPELINE_CLASS_MAP
- Add CogVideoXImageToVideoPipeline and CogVideoXVideoToVideoPipeline
- Add AnimateDiffVideoToVideoPipeline
- Add StableDiffusionImg2ImgPipeline for SD 1.5
- Add dynamic pipeline switching logic for Wan, LTX, CogVideoX, AnimateDiff
- The pipeline class is now selected at runtime based on task mode
- Fix detect_pipeline_class to correctly identify Wan models
- Remove duplicate LTX handling code
```
  0a1b413b
- Add check_pipelines.py to list available pipeline variants · 30e0871f
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  30e0871f
- Renamed videogen to videogen.py · 9e84ee73
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  9e84ee73
- Add audio generation capabilities to web interface · 3985e613
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  3985e613
- Fix model selection for I2V generation by checking capabilities before scoring · b6d177d8
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  b6d177d8
- Add debug script to analyze model selection · 0a0e2084
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  0a0e2084
- Add --allow-bigger-models option to web interface and MCP server · ad1c700b
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  ad1c700b
- Document --allow-bigger-models option · e0c0485f
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  e0c0485f
- Add --allow-bigger-models option to allow models larger than VRAM using system RAM · 81008fd6
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  81008fd6
- Adjust VRAM checking to allow models up to 10% less than available VRAM, or... · 1774f810
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
```
Adjust VRAM checking to allow models up to 10% less than available VRAM, or full VRAM with offload strategy
```
  1774f810
- Fix auto mode model selection issues · d3d67441
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  d3d67441
- Add __pycache__ to gitignore and remove discord deb file · 0a02552e
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  0a02552e
- Add model management and cache control features · b8e1a63e
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  b8e1a63e
- Fix audio prompt handling in web interface - use music_prompt for music generation · 48215e18
  Stefy Lanza (nextime / spora ) authored Feb 25, 2026
  
  48215e18