Commits · acfb9c78a5a0bc63c013d1dab163d1f985328e5c · nexlab / videogen

28 Feb, 2026 2 commits
- Merge branch 'experimental' · acfb9c78
  Stefy Lanza (nextime / spora ) authored Feb 28, 2026
  
  acfb9c78
- Auto-detect and fix inverted colors · e7a0c182
  Stefy Lanza (nextime / spora ) authored Feb 28, 2026
```
- Detect inverted colors by checking mean brightness
- Automatically invert if mean > 0.7 (indicating mostly white/bright)
- Print warning when color correction is applied
```
  e7a0c182
27 Feb, 2026 25 commits

Merge branch 'experimental' · dd47f79a
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

dd47f79a

Fix color inversion in video export · 1e1ea201

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

- Handle [-1, 1] range from diffusion models
- Properly normalize to [0, 1] before converting to uint8
- Add clipping to ensure valid color ranges

1e1ea201

Merge branch 'experimental' · 87ed1011
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

87ed1011

Fix frame export for WanImageToVideoPipeline · 40787d0a

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

- Add proper tensor/numpy conversion before export_to_video
- Handle multiple channel formats (NCHW to NHWC)
- Ensure RGB format and uint8 range for video export

40787d0a

Merge branch 'experimental' · 258c584b
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

258c584b
Fix WanImageToVideoPipeline I2V support · b937667b
Stefy Lanza (nextime / spora ) authored Feb 27, 2026
```
Add WanImageToVideoPipeline to i2v_pipelines list so image argument is passed correctly
```
b937667b
Merge branch 'experimental' · c8cf5eac
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

c8cf5eac

Fix Wan2.2 I2V LoRA adapter detection and base model suffix · 9f29b0c1

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

- Add model ID check for I2V detection (not just tags)
- Add -Diffusers suffix when extracting base model from tags
- Add runtime validation for Wan LoRA base model against tags

9f29b0c1

Merge branch 'experimental' · 186980e3
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

186980e3

Fix base model detection for Wan LoRA adapters · 410f97e9

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

For Wan LoRA adapters, skip reading base_model from stored config
(which may have incorrect values like T2V instead of I2V).
Instead, always infer from LoRA ID by checking for 'i2v' in the name.

This ensures wan2_2_i2v_general_nsfw_lora uses the correct
Wan-AI/Wan2.2-I2V-A14B-Diffusers base model instead of the
incorrect Wan-AI/Wan2.2-T2V-A14B-Diffusers.

410f97e9

Merge branch 'experimental' · 4254eddc
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

4254eddc

Fix I2V detection for LoRA adapters (e.g., wan2_2_i2v_general_nsfw_lora) · e668b610

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

The check at line 9414 only checked the stored supports_i2v flag
from the model configuration, but didn't check the model ID string
for 'i2v' like the detect_model_type() function does.

Now the I2V validation also detects I2V capability from the model ID,
making it consistent with detect_model_type() and properly detecting
I2V capability for LoRA adapters like lopi999/Wan2.2-I2V_General-NSFW-LoRA.

e668b610

Merge branch 'experimental' · e6901870
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

e6901870

Fix LoRA base model and I2V detection · 8eca258c

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

- Always infer base model from LoRA name, not stored database value
- Fix supports_i2v detection for LoRAs based on LoRA name
- Ensures Wan 2.2 I2V LoRAs use correct I2V base model

8eca258c

Merge branch 'experimental' · d0b89e57
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

d0b89e57
Fix Open-Sora model name to Open-Sora-v2 · 4a3889e4
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

4a3889e4
Merge branch 'experimental' · af8063a2
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

af8063a2
Update model references: Open-Sora-2 and Mochi-1-Preview · 76df0069
Stefy Lanza (nextime / spora ) authored Feb 27, 2026
```
- hpcai-tech/Open-Sora-1.2 -> hpcai-tech/Open-Sora-2
- genmo/mochi -> genmo/mochi-1-preview
```
76df0069

Update Wan 2.1 I2V base model to use 720P resolution variant · 4866982f

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

When selecting a Wan 2.1 I2V model that is a LoRA or tensor weight,
the base model now uses Wan-AI/Wan2.1-I2V-14B-720P-Diffusers instead
of the generic Wan-AI/Wan2.1-I2V-14B-Diffusers.

4866982f

Fix tuple unpacking error in print_model_list · 1456df8d

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

Fixed ValueError: too many values to unpack (expected 5)
- Line 3929: Added missing 6th element to unpacking
- Line 3936: Added missing 6th element to unpacking

The results tuple has 6 elements:
(name, info, caps, is_disabled, fail_count, orig_idx)

1456df8d

Update Wan 2.1 I2V base model to use 720P resolution variant · b6d75d21

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

When selecting a Wan 2.1 I2V model that is a LoRA or tensor weight,
the base model now uses Wan-AI/Wan2.1-I2V-14B-720P-Diffusers instead
of the generic Wan-AI/Wan2.1-I2V-14B-Diffusers.

b6d75d21

Fix tuple unpacking error in print_model_list · 8459a4d7

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

Fixed ValueError: too many values to unpack (expected 5)
- Line 3929: Added missing 6th element to unpacking
- Line 3936: Added missing 6th element to unpacking

The results tuple has 6 elements:
(name, info, caps, is_disabled, fail_count, orig_idx)

8459a4d7

Merge branch 'experimental' into 'master' · 6db57c26
Stefy Lanza (nextime / spora ) authored Feb 27, 2026
```
Fix I2V pipeline auto-detection

See merge request !1
```
6db57c26

Fix I2V pipeline auto-detection · 1ef3d1b8

Stefy Lanza (nextime / spora ) authored Feb 27, 2026

- Add detect_model_family() to identify model family (wan, sdxl, sd, ltx, etc.)
- Add get_pipeline_for_model_family() for proper pipeline selection based on family + task
- Enhance detect_generation_type() to check --image FIRST for I2V detection
- Add support for --image_model, --prompt_image, --prompt_animation as I2V indicators
- Add support for audio/subtitle options as T2V+V2V chaining indicators

This fixes the issue where SDXL models were incorrectly using WanPipeline
for I2V tasks, causing type mismatch errors (expected UMT5EncoderModel,
got CLIPTextModel). Now SDXL models correctly use DiffusionPipeline
or StableDiffusionXLPipeline.

1ef3d1b8

Update SKILL.md with new CLI options and MCP documentation · 803b1763
Stefy Lanza (nextime / spora ) authored Feb 27, 2026

803b1763

26 Feb, 2026 13 commits

Reduce VRAM estimation overhead from 50% to 25% · c2ef12d3
Stefy Lanza (nextime / spora ) authored Feb 26, 2026

c2ef12d3

Update documentation and integrations with new CLI options · 30b322a6

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

- Add --model-list-batch to EXAMPLES.md for batch model listing
- Add new CLI options to MCP server (--output-dir, --yes, --audio-chunk)
- Add new CLI options to webapp build_command function
- Update README.md with Output Options section

30b322a6

Add --audio-chunk option for audio/video chunking strategies · 20db65c1

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Added --audio-chunk argument with 3 modes:
- overlap (default): overlapping chunks like [0-60], [58-118]
- word-boundary: uses Whisper timestamps to split at word boundaries
- vad: uses Voice Activity Detection to skip silence

Also added --audio-chunk-overlap to control overlap duration.

New functions added:
- process_video_with_vad(): VAD-based chunking
- process_video_word_boundary(): Word-boundary chunking using Whisper

Modified:
- transcribe_video_audio(): accepts audio_chunk_type and audio_chunk_overlap params
- _transcribe_chunked(): accepts chunk_type and overlap params

20db65c1

Add detection for SD1.5 models (AbyssOrangeMix, NAI, etc.) · caf3c707

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Models like Faber8/AbyssOrangeMix2 are SD1.5 models, not Flux.
Now detected as StableDiffusionPipeline instead of FluxPipeline.

caf3c707

Add handling for missing VAE files · 8082b88e

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

When a model has a VAE configured but the VAE files don't exist in
the repository, try loading with the default VAE instead.

8082b88e

Fix model lookup by HuggingFace ID · 8b05676e

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Allow using full HuggingFace model ID (e.g., Faber8/AbyssOrangeMix2_nsfw)
as --model argument by looking up both short name and full ID in MODELS.

8b05676e

Fix frozenset error in clear_cache · 96adc8b3

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

The repos attribute from scan_cache_dir() is a frozenset, not a list,
so .pop() doesn't work. Fixed by using next(iter()) instead.

96adc8b3

Suppress 'Loaded N models' message in batch mode · 2c0ed5db
Stefy Lanza (nextime / spora ) authored Feb 26, 2026
```
The message is now suppressed when using --model-list-batch to make
the output cleaner for scripts.
```
2c0ed5db

Add --yes flag for auto-confirmation of cache deletion · 2e86db90

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

Added --yes / -y argument that automatically answers 'yes' to confirmation
prompts when deleting cached models or clearing the entire cache.

Usage:
  videogen --remove-cached-model MODEL_ID --yes
  videogen --clear-cache --yes

2e86db90

Fix --model-list-batch to actually work · 506faa85

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

The --model-list-batch option was added but wasn't being handled properly.
Now it correctly exits after printing the batch output, and is also
added to the list of options that don't require --prompt.

506faa85

Fix model ID consistency with filters and add new CLI options · 4c421b4f

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

- Fixed model ID consistency: numeric IDs now remain the same when using
  filters like --nsfw-friendly, --t2v-only, --i2v-only, etc.
  Previously, filtered lists would renumber models making --show-model
  by numeric ID unreliable.

- Added --model-list-batch option for script-friendly output:
  Outputs 'NUMERIC_ID:FULL_MODEL_NAME' format for easy parsing

- Added --output-dir option to specify output directory:
  Sets the directory where output files will be saved

- Fixed syntax error in argparse epilog string that was causing
  'SyntaxError: invalid decimal literal'

4c421b4f

Fix Wan 2.2 base model IDs to use A14B suffix · 1f33b9e5
Stefy Lanza (nextime / spora ) authored Feb 26, 2026

1f33b9e5

Fix Wan 2.2 I2V base model detection · 6a982a8b

Stefy Lanza (nextime / spora ) authored Feb 26, 2026

- Fixed model ID normalization to handle hyphens (in addition to underscores)
- Fixed dictionary key ordering in base_model_fallbacks so more specific keys (wan2.2.i2v) are checked before generic keys (wan2.2)
- Fixed Wan 2.1 I2V base model mapping (was incorrectly pointing to T2V)
- Fixed base model detection in earlier code sections to check model ID directly instead of relying on m_info.get('supports_i2v')
- Fixed typo: Wan 2.2 generic fallback now correctly uses Wan2.2-T2V

Now Wan 2.2 I2V models like Wan-AI/Wan2.2-I2V-A14B will correctly use Wan-AI/Wan2.2-I2V-14B-Diffusers as the base model instead of the incorrect Wan-AI/Wan2.2-T2V-14B-Diffusers.

6a982a8b