• Your Name's avatar
    Fix offload-strategy parameter passing to CUDA backend · bf1d3f52
    Your Name authored
    - Add offload_strategy to kwargs in _load_default_model and _load_model_by_name
    - Fix parameter name: ram -> manual_ram_gb to match backend expectation
    - Also pass load_in_4bit, load_in_8bit, and max_gpu_percent
    bf1d3f52
Name
Last commit
Last update
.vscode Loading commit data...
codai Loading commit data...
.gitignore Loading commit data...
LICENSE.md Loading commit data...
README.md Loading commit data...
build.sh Loading commit data...
coder Loading commit data...
coderai Loading commit data...
requirements-nvidia.txt Loading commit data...
requirements-vulkan.txt Loading commit data...
requirements.txt Loading commit data...