fix: Calculate 25% more VRAM for base models with weights/LoRAs

Instead of adding a fixed 2GB overhead, now calculates 25% more VRAM for base models that will have fine-tuned weights/tensors or LoRA adapters loaded on top.

fix: Calculate 25% more VRAM for base models with weights/LoRAs
Instead of adding a fixed 2GB overhead, now calculates 25% more VRAM for base models that will have fine-tuned weights/tensors or LoRA adapters loaded on top.
62e22d1f · Stefy Lanza (nextime / spora ) · 1162d3c0 · 62e22d1f
Commit 62e22d1f authored Feb 26, 2026 by Stefy Lanza (nextime / spora )
Hide whitespace changes
Inline Side-by-side

Showing with 3 additions and 2 deletions

videogen.py videogen.py +3 -2

No files found.
--- a/videogen.py
+++ b/videogen.py
@@ -3909,8 +3909,9 @@ def select_best_model(gen_type, models, vram_gb=24, prefer_quality=True, return_
                }
            
            # Check VRAM compatibility using base model requirements
-            # LoRAs add a small overhead (~1-2GB)
-            vram_est = parse_vram_estimate(base_model_info.get("vram", "~10 GB")) + 2
+            # LoRAs and fine-tuned weights add significant overhead (25% more)
+            base_vram = parse_vram_estimate(base_model_info.get("vram", "~10 GB"))
+            vram_est = base_vram * 1.25  # 25% more for weights/tensors/loras
            if allow_bigger_models:
                # If allowing bigger models, check if VRAM + 75% of available RAM is sufficient
                available_ram = get_available_ram_gb()