• Stefy Lanza (nextime / spora )'s avatar
    front: route admin model load/unload to the owning engine · 8abd66c7
    Stefy Lanza (nextime / spora ) authored
    model-load/model-unload were proxied to the primary engine, so unloading
    (or loading) a model that lives on a secondary engine hit the wrong process
    and silently no-op'd (was_loaded=False). Add front-proxy interceptors:
    
    - unload: find the engine whose loaded_models matches the path and forward
      the request there; fall back to the primary.
    - load: reuse an engine already serving the model, else the model's engine
      pin from models.json, else the primary.
    
    Registered before the catch-all proxy, mirroring /admin/api/engines.
    Co-Authored-By: 's avatarClaude Opus 4.8 <noreply@anthropic.com>
    8abd66c7
Name
Last commit
Last update
..
__init__.py Loading commit data...
app.py Loading commit data...
assignment.py Loading commit data...
engine_supervisor.py Loading commit data...
gpu_detect.py Loading commit data...
registry.py Loading commit data...
router.py Loading commit data...