Add V2V, V2I, 2D-to-3D conversion, and cluster documentation

Features Added:
- Video-to-Video (V2V): Style transfer, filters, concatenation
- Video-to-Image (V2I): Frame extraction, keyframes, collages
- 2D-to-3D Conversion: SBS, anaglyph, VR 360 formats
- Video upscaling with AI (ESRGAN, Real-ESRGAN, SwinIR)
- Video filters (grayscale, sepia, blur, speed, slow-mo, etc.)

Command-line Arguments:
- --video: Input video file for V2V/V2I operations
- --video-to-video: Enable V2V style transfer
- --video-filter: Apply video filters
- --extract-frame, --extract-keyframes, --extract-frames
- --convert-3d-sbs, --convert-3d-anaglyph, --convert-vr
- --upscale-video, --upscale-method

Model Discovery:
- Added depth estimation models to --update-models
- Added 2D-to-3D model searches
- Added V2V style transfer models

Documentation:
- Updated README.md with new features
- Added comprehensive V2V/V2I/2D-to-3D examples
- Added multi-node cluster setup guide
- Added NFS shared storage configuration
parent 6f862e60
This diff is collapsed.
......@@ -2,7 +2,7 @@
**Copyleft © 2026 Stefy <stefy@nexlab.net>**
A comprehensive, GPU-accelerated video generation toolkit supporting Text-to-Video (T2V), Image-to-Video (I2V), Text-to-Image (T2I), and Image-to-Image (I2I) generation with audio synthesis, synchronization, and lip-sync capabilities.
A comprehensive, GPU-accelerated video generation toolkit supporting Text-to-Video (T2V), Image-to-Video (I2V), Text-to-Image (T2I), Image-to-Image (I2I), Video-to-Video (V2V), Video-to-Image (V2I), and 2D-to-3D conversion with audio synthesis, synchronization, and lip-sync capabilities.
---
......@@ -13,6 +13,20 @@ A comprehensive, GPU-accelerated video generation toolkit supporting Text-to-Vid
- **Image-to-Video (I2V)**: Animate static images
- **Text-to-Image (T2I)**: Generate high-quality images
- **Image-to-Image (I2I)**: Transform existing images
- **Video-to-Video (V2V)**: Style transfer and filters for videos
- **Video-to-Image (V2I)**: Extract frames and keyframes from videos
### Video Processing
- **Video Upscaling**: AI-powered video upscaling (ESRGAN, Real-ESRGAN, SwinIR)
- **Video Filters**: Grayscale, sepia, blur, sharpen, contrast, speed, slow-mo, reverse, fade, denoise, stabilize
- **Video Concatenation**: Join multiple videos
- **Frame Extraction**: Extract single frames, keyframes, or all frames
### 2D-to-3D Conversion
- **3D Side-by-Side (SBS)**: Convert 2D videos to 3D SBS format for VR headsets and 3D TVs
- **3D Anaglyph**: Convert to red/cyan anaglyph format for 3D glasses
- **VR 360**: Convert 2D videos to VR 360 equirectangular format
- **Depth Estimation**: AI-powered depth map generation
### Audio Capabilities
- **Text-to-Speech (TTS)**: Multiple voices via Bark and Edge-TTS
......
This diff is collapsed.
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment