1. 10 Mar, 2026 3 commits
    • Your Name's avatar
      Add GGUF image model support in --loadall mode · e848dd47
      Your Name authored
      - Detect if image model is GGUF (ends with .gguf or contains 'gguf')
      - If GGUF, load using llama.cpp (same as text Vulkan models)
      - If diffusers model, load using Stable Diffusion pipeline
      - Fixed both locations where image model preloading happens
      - Now supports both GGUF and diffusers image generation models
      e848dd47
    • Your Name's avatar
      Fix image model preloading with --loadall flag · 2308d5b0
      Your Name authored
      - Fixed bug where image model wasn't actually being loaded when --loadall was specified
      - The code only printed messages but never loaded the diffusers pipeline
      - Now actually loads the Stable Diffusion pipeline using diffusers library
      - Tries StableDiffusionXLPipeline first, falls back to generic DiffusionPipeline
      - Moves to GPU if CUDA available, enables attention slicing for memory efficiency
      - Also fixes second location where image model is the only configured model
      
      - Debug command line output was already implemented
      2308d5b0
    • Your Name's avatar
      Fix --loadall model preloading and --debug command line output · 9193536a
      Your Name authored
      - Fixed undefined variable bug where model_name wasn't defined in scope
      - Fixed duplicate model loading when using --loadall/--loadswap with multiple models
      - First model is now only loaded once (skipped in loop if already loaded)
      - Loadall mode now properly preloads all models in VRAM respecting offload strategy
      - Loadswap mode properly loads additional models to RAM
      - Ondemand mode doesn't reload first model
      
      Feature 1: --debug now shows full command line as first output
      Feature 2: --loadall with multiple models now preloads all in VRAM
      9193536a
  2. 09 Mar, 2026 30 commits
  3. 08 Mar, 2026 7 commits