• Your Name's avatar
    Implement on-demand model swapping for multiple models · 362b8452
    Your Name authored
    - Add model_backend_types dict to track backend for each model
    - Update set_default_model to accept backend_type parameter
    - Modify get_model_for_request to swap models on-demand when in ondemand mode
    - Unload current model from VRAM and load new model when request arrives for different model
    - Respect --backend flag when loading models on-demand
    - Only activates when no --loadall or --loadswap flag is specified
    362b8452
Name
Last commit
Last update
.vscode Loading commit data...
.gitignore Loading commit data...
LICENSE.md Loading commit data...
README.md Loading commit data...
aaa Loading commit data...
build.sh Loading commit data...
coder Loading commit data...
coderai Loading commit data...
requirements-nvidia.txt Loading commit data...
requirements-vulkan.txt Loading commit data...
requirements.txt Loading commit data...
requirements.txt~ Loading commit data...