- 15 Mar, 2026 17 commits
-
-
Your Name authored
GGUF models are for text/LLM and cannot do image generation.
-
Your Name authored
-
Your Name authored
-
Your Name authored
diffusers is required for Stable Diffusion image generation
-
Your Name authored
The package is named 'stable_diffusion_cpp_python', not 'stable_diffusion_cpp'
-
Your Name authored
If --image-model is not specified, try to use the main --model as the image model fallback when requesting 'default' model.
-
Your Name authored
If loading a cached GGUF model fails with corruption indicators (invalid, corrupt, magic, header), delete the corrupted cache and re-download the model automatically.
-
Your Name authored
The cache filename format was inconsistent: - get_cached_model_path used: {hash}{ext} - load_model download used: {hash}_{filename} This caused cache lookups to always fail. Now both use {hash}_{filename} format to ensure cached models are properly found. -
Your Name authored
Change from detailed installation instructions to simple message: 'Model does not support image generation'
-
Your Name authored
- If request specifies a model, use that - If request doesn't specify a model (empty or 'image'), use default - Legacy 'image:' prefix also falls back to default - Error handling already exists for when no backend is available
-
Your Name authored
Change message from 'VulkanBackend will use CUDA backend' to 'GGUF model will use CUDA backend (forced by --backend nvidia)'
-
Your Name authored
When user explicitly passes --backend nvidia with a GGUF model, vulkan is now removed from the available backends list since llama-cpp-python will use CUDA instead of Vulkan.
-
Your Name authored
- When user specifies --backend nvidia with a GGUF model, show a note indicating that the vulkan backend will use CUDA - This clarifies that Vulkan isn't being used in this scenario
-
Your Name authored
- Add 'all' as a valid backend option - Change default from 'nvidia' to 'all' - Add comprehensive 'all' backend section that installs: - Base requirements - PyTorch with CUDA (nvidia backend) - llama-cpp-python with CUDA and Vulkan support - stable-diffusion-cpp-python with OpenCL - Additional requirements - Detect available hardware (CUDA, Vulkan, OpenCL) and enable accordingly - Show summary of available backends after installation
-
Your Name authored
- Store original backend before switching to vulkan for GGUF files - Pass original_backend to VulkanBackend constructor - Add force_cuda flag that triggers CUDA environment setup - Set CUDA_VISIBLE_DEVICES when force_cuda is True - Update success/error messages to reflect actual backend used - Add debug output for CUDA detection
-
Your Name authored
The local import of os inside the HTTPS block caused Python to treat os as a local variable throughout the main() function.
-
Your Name authored
- Add python-multipart to requirements.txt, requirements-nvidia.txt, requirements-vulkan.txt - Add llama-cpp-python to requirements-nvidia.txt for GGUF support - When using CUDA/nvidia backend with GGUF file, automatically use llama-cpp-python
-
- 14 Mar, 2026 23 commits
-
-
Your Name authored
- Add --vision-model for image/video-to-text models - When --file-path is set, return URL by default, base64 only if explicitly requested - Add --https flag with auto-certificate generation - Add --privkey and --pubkey for custom certificates
-
Your Name authored
- Add --image-cfg-scale CLI option (default 1.0) - Add get_cfg_scale() helper that auto-detects VRAM - If Vulkan and VRAM < 16GB, use cfg_scale=1.0 automatically
-
Your Name authored
- Add --image-backend, --audio-backend, --tts-backend CLI args - Add opencl to backend choices - Add OpenCL build target in build.sh
-
Your Name authored
-
Your Name authored
-
Your Name authored
- When --debug is enabled, show full command line coderai was called with - Add --nopreload flag to disable model preloading at startup - When --nopreload is not specified, skip checking for preloaded sd.cpp models (forces load in worker thread to avoid Vulkan context issues) - Fix image model preloading to respect --nopreload flag
-
Your Name authored
This helps prevent the GPU from staying at 100% utilization after image generation by allowing the Vulkan driver to transition from compute state to idle state.
-
Your Name authored
- Use keep_clip_on_cpu instead of clip_on_cpu - VAE tiling is handled internally in newer builds - Add n_threads using psutil.cpu_count() for optimal performance
-
Your Name authored
- Added --vae-tiling flag to enable VAE tiling for lower VRAM usage - Added --clip-on-cpu flag to run CLIP on CPU to save VRAM - Both options work with stable-diffusion-cpp-python
-
Your Name authored
- Added --image-seed argument to set default seed for image generation - Updated diffusers and sd.cpp code to use request seed or CLI default seed - Priority: request seed > CLI default seed > random
-
Your Name authored
- Added model_key initialization before sd.cpp loading in on-demand section - Added model_key assignment before adding model to manager
-
Your Name authored
- Fixed model_key variable scope issue in GGUF->sd.cpp fallback - Fixed model_path undefined in diffusers preloading section - These fixes prevent startup crashes when using --loadall
-
Your Name authored
- Reordered the image generation backend priority to try torch/diffusers first - If torch/diffusers fails (ImportError or other error), fallback to stable-diffusion-cpp-python - If both backends fail, return a helpful error message with installation instructions - Added dynamic loading of sd.cpp model if not pre-loaded
-
Your Name authored
-
Your Name authored
-
Your Name authored
-
Your Name authored
-
Your Name authored
-
Your Name authored
-
Your Name authored
-
Your Name authored
-
Your Name authored
-
Your Name authored
-