-
Your Name authored
- Set GGML_DISABLE_VULKAN=1 and GGML_VULKAN_DEVICE='' before loading model - These must be set before llama_cpp import since it reads them at init - Restore Vulkan settings on cleanup so subsequent Vulkan models work - Addresses issue where GGUF models ran on CPU instead of CUDA with --backend nvidia
f77d34da
| Name |
Last commit
|
Last update |
|---|---|---|
| .vscode | ||
| .gitignore | ||
| LICENSE.md | ||
| README.md | ||
| aaa | ||
| build.sh | ||
| coder | ||
| coderai | ||
| requirements-nvidia.txt | ||
| requirements-vulkan.txt | ||
| requirements.txt | ||
| requirements.txt~ |