Force CUDA backend in llama-cpp-python when NVIDIA backend is requested with GGUF models
- Store original backend before switching to vulkan for GGUF files - Pass original_backend to VulkanBackend constructor - Add force_cuda flag that triggers CUDA environment setup - Set CUDA_VISIBLE_DEVICES when force_cuda is True - Update success/error messages to reflect actual backend used - Add debug output for CUDA detection
Showing
Please
register
or
sign in
to comment