1. 25 Feb, 2026 11 commits
    • Stefy Lanza (nextime / spora )'s avatar
      Add 404 fallback to deferred I2V model loading · 344cd12a
      Stefy Lanza (nextime / spora ) authored
      - Apply same 404 fallback strategy to deferred I2V model loading
      - Try DiffusionPipeline as fallback when model_index.json not found
      - Ensures all model loading paths have consistent error handling
      344cd12a
    • Stefy Lanza (nextime / spora )'s avatar
      Fix model loading 404 errors and improve time estimation · c2c62b60
      Stefy Lanza (nextime / spora ) authored
      Model Loading Fixes:
      - Add fallback loading when model_index.json returns 404
      - Try alternative paths (diffusers/, diffusion_model/, pipeline/)
      - Try generic DiffusionPipeline as fallback
      - Check HuggingFace API for actual file structure
      - Load from subdirectories if model_index.json found there
      - Apply same fallback to I2V image model loading
      
      Time Estimation Improvements:
      - Add hardware detection (GPU model, VRAM, RAM, CPU cores)
      - Detect GPU tier (extreme/high/medium/low/very_low)
      - Calculate realistic time estimates based on GPU performance
      - Account for VRAM constraints and offloading penalty
      - Consider distributed/multi-GPU setups
      - More accurate model loading times (minutes, not seconds)
      - Account for resolution impact (quadratic relationship)
      - Add 20% overhead for memory management
      - Print hardware info for transparency
      
      GPU Tier Performance Multipliers:
      - Extreme (RTX 4090, A100, H100): 1.0x
      - High (RTX 4080, RTX 3090, V100): 1.5x
      - Medium (RTX 4070, RTX 3080, T4): 2.5x
      - Low (RTX 3060, RTX 2070): 4.0x
      - Very Low (GTX 1060, etc.): 8.0x
      c2c62b60
    • Stefy Lanza (nextime / spora )'s avatar
      Add video dubbing, translation, and subtitle features · 6505a00a
      Stefy Lanza (nextime / spora ) authored
      Features Added:
      - Video dubbing with voice preservation (--dub-video)
      - Automatic subtitle generation (--create-subtitles)
      - Subtitle translation (--translate-subtitles)
      - Burn subtitles into video (--burn-subtitles)
      - Audio transcription using Whisper (--transcribe)
      - Text translation using MarianMT models
      
      New Command-Line Arguments:
      - --transcribe: Transcribe audio from video
      - --whisper-model: Select Whisper model size (tiny/base/small/medium/large)
      - --source-lang: Source language code
      - --target-lang: Target language code for translation
      - --create-subtitles: Create SRT subtitles from video
      - --translate-subtitles: Translate subtitles to target language
      - --burn-subtitles: Burn subtitles into video
      - --subtitle-style: Customize subtitle appearance
      - --dub-video: Translate and dub video with voice preservation
      - --voice-clone/--no-voice-clone: Enable/disable voice cloning
      
      MCP Server Updates:
      - Added videogen_transcribe_video tool
      - Added videogen_create_subtitles tool
      - Added videogen_dub_video tool
      - Added videogen_translate_text tool
      
      Documentation Updates:
      - Updated SKILL.md with dubbing/translation section
      - Updated EXAMPLES.md with comprehensive examples
      - Updated requirements.txt with openai-whisper dependency
      
      Supported Languages:
      English, Spanish, French, German, Italian, Portuguese, Russian, Chinese, Japanese, Korean, Arabic, Hindi, Dutch, Polish, Turkish, Vietnamese, Thai, Indonesian, Swedish, Ukrainian
      6505a00a
    • Stefy Lanza (nextime / spora )'s avatar
      Add model type filters and update MCP server · 1c01f5b7
      Stefy Lanza (nextime / spora ) authored
      Features Added:
      - Model type filters: --t2i-only, --v2v-only, --v2i-only, --3d-only, --tts-only, --audio-only
      - Enhanced model list table with new capability columns (V2V, V2I, 3D, TTS)
      - Updated detect_model_type() to detect all model capabilities
      
      MCP Server Updates:
      - Added videogen_video_to_video tool for V2V style transfer
      - Added videogen_apply_video_filter tool for video filters
      - Added videogen_extract_frames tool for frame extraction
      - Added videogen_create_collage tool for thumbnail grids
      - Added videogen_upscale_video tool for AI upscaling
      - Added videogen_convert_3d tool for 2D-to-3D conversion
      - Added videogen_concat_videos tool for video concatenation
      - Updated model list filter to support all new types
      
      SKILL.md Updates:
      - Added V2V, V2I, 3D to generation types table
      - Added model filter examples
      - Added 8 new use cases for V2V, filters, frames, collage, upscale, 3D, concat
      1c01f5b7
    • Stefy Lanza (nextime / spora )'s avatar
      Add V2V, V2I, 2D-to-3D conversion, and cluster documentation · e69c2d81
      Stefy Lanza (nextime / spora ) authored
      Features Added:
      - Video-to-Video (V2V): Style transfer, filters, concatenation
      - Video-to-Image (V2I): Frame extraction, keyframes, collages
      - 2D-to-3D Conversion: SBS, anaglyph, VR 360 formats
      - Video upscaling with AI (ESRGAN, Real-ESRGAN, SwinIR)
      - Video filters (grayscale, sepia, blur, speed, slow-mo, etc.)
      
      Command-line Arguments:
      - --video: Input video file for V2V/V2I operations
      - --video-to-video: Enable V2V style transfer
      - --video-filter: Apply video filters
      - --extract-frame, --extract-keyframes, --extract-frames
      - --convert-3d-sbs, --convert-3d-anaglyph, --convert-vr
      - --upscale-video, --upscale-method
      
      Model Discovery:
      - Added depth estimation models to --update-models
      - Added 2D-to-3D model searches
      - Added V2V style transfer models
      
      Documentation:
      - Updated README.md with new features
      - Added comprehensive V2V/V2I/2D-to-3D examples
      - Added multi-node cluster setup guide
      - Added NFS shared storage configuration
      e69c2d81
    • Stefy Lanza (nextime / spora )'s avatar
      Add V2V (Video-to-Video), V2I (Video-to-Image), and video processing features · 6f862e60
      Stefy Lanza (nextime / spora ) authored
      - Add video frame extraction (extract_video_frames, extract_keyframes)
      - Add video info retrieval (get_video_info)
      - Add frames to video conversion (frames_to_video)
      - Add video upscaling with AI support (upscale_video)
      - Add video-to-video style transfer (video_to_video_style_transfer)
      - Add video-to-image extraction (video_to_image)
      - Add video collage creation (create_video_collage)
      - Add video filters (apply_video_filter - grayscale, sepia, blur, etc.)
      - Add video concatenation (concat_videos)
      - Add image upscaling (upscale_image)
      
      Features:
      - Extract frames at specific FPS or timestamps
      - AI upscaling with ESRGAN/SwinIR support
      - Scene detection for keyframe extraction
      - Multiple video filters and effects
      - Video concatenation with re-encoding or stream copy
      6f862e60
    • Stefy Lanza (nextime / spora )'s avatar
      Add character consistency features: IP-Adapter, InstantID, Character Profiles, LoRA Training · b0d20d0b
      Stefy Lanza (nextime / spora ) authored
      - Add IP-Adapter integration for character consistency using reference images
      - Add InstantID support for superior face identity preservation
      - Add Character Profile System to store reference images and face embeddings
      - Add LoRA Training Workflow for perfect character consistency
      - Add command-line arguments for all character consistency features
      - Update EXAMPLES.md with comprehensive character consistency documentation
      - Update requirements.txt with optional dependencies (insightface, onnxruntime)
      
      New commands:
      - --character: Use saved character profile
      - --create-character: Create new character profile from reference images
      - --list-characters: List all saved profiles
      - --show-character: Show profile details
      - --ipadapter: Enable IP-Adapter for consistency
      - --instantid: Enable InstantID for face identity
      - --train-lora: Train custom LoRA for character
      b0d20d0b
    • Stefy Lanza (nextime / spora )'s avatar
      Validate base model exists before adding LoRA to model list · 84d460f6
      Stefy Lanza (nextime / spora ) authored
      - When --update-models detects a LoRA adapter, validate that the base
        model exists on HuggingFace before adding it to the model list
      - Skip LoRAs whose base models are not found on HuggingFace
      - Added support for flux and sdxl base model detection
      - Print informative messages when skipping LoRAs with missing base models
      84d460f6
    • Stefy Lanza (nextime / spora )'s avatar
      Fix: Add peft to requirements.txt for LoRA adapter support · 2e8b5bc7
      Stefy Lanza (nextime / spora ) authored
      PEFT (Parameter-Efficient Fine-Tuning) is required for loading LoRA
      adapters with pipe.load_lora_weights(). Without it, LoRA loading fails
      with: 'PEFT backend is required for this method.'
      2e8b5bc7
    • Stefy Lanza (nextime / spora )'s avatar
      Feat: Update models.json when pipeline mismatch is detected and corrected · 2b570a0a
      Stefy Lanza (nextime / spora ) authored
      - Add update_model_pipeline_class() function to update model config
      - Call function when main model pipeline mismatch is corrected
      - Call function when image model pipeline mismatch is corrected
      - Ensures future runs use the correct pipeline class automatically
      2b570a0a
    • Stefy Lanza (nextime / spora )'s avatar
  2. 24 Feb, 2026 16 commits