• Stefy Lanza (nextime / spora )'s avatar
    admin: actually free VRAM on unload + show whisper-server as loaded · 84def90a
    Stefy Lanza (nextime / spora ) authored
    Two issues when unloading/reporting models on a multi-engine node:
    
    - Unload didn't free VRAM for pooled models. api_model_unload only popped
      multi_model_manager.models and never touched model_pools, so a model
      served with max_instances>1 (which lives only in the pool) kept all its
      instances resident. Now it searches both dicts and calls unload_model(),
      which cleans up the whole pool + runs gc/empty_cache. Also handles
      whisper-server models (their own subprocess) by stopping the server.
    
    - whisper-server showed as "not loaded". It runs as a subprocess tracked
      in whisper_servers, not in .models. Fold each running server (id +
      `audio:` alias) into both the model-loaded-status list and the
      /admin/api/status loaded_keys, so the models page, dashboard count and
      per-engine box all reflect it (incl. on a secondary engine).
    Co-Authored-By: 's avatarClaude Opus 4.8 <noreply@anthropic.com>
    84def90a
Name
Last commit
Last update
codai Loading commit data...
docs Loading commit data...
packaging Loading commit data...
samples Loading commit data...
tests Loading commit data...
tools Loading commit data...
.dockerignore Loading commit data...
.gitignore Loading commit data...
AI.PROMPT Loading commit data...
CODERAI_API_DOCUMENTATION.md Loading commit data...
CoderAI.gif Loading commit data...
DISTRIBUTION.md Loading commit data...
LICENSE.md Loading commit data...
MULTIMODAL_CAPABILITIES.md Loading commit data...
MULTIMODAL_UI_EXAMPLES.md Loading commit data...
README.md Loading commit data...
build-oci.sh Loading commit data...
build.ps1 Loading commit data...
build.sh Loading commit data...
coderai Loading commit data...
coderai-broker-implementation-reference.md Loading commit data...
coderai-integration.md Loading commit data...
commands Loading commit data...
osxbuild.sh Loading commit data...
package-oci.sh Loading commit data...
package-tarball.sh Loading commit data...
requirements-nvidia.txt Loading commit data...
requirements-vulkan.txt Loading commit data...
requirements.txt Loading commit data...
run-oci.sh Loading commit data...
smoke-test-oci.sh Loading commit data...
todo.md Loading commit data...
video_editor.config.json Loading commit data...