• Stefy Lanza (nextime / spora )'s avatar
    front: per-model engine_fallback option for an unavailable pin · 84d085d7
    Stefy Lanza (nextime / spora ) authored
    By default a per-model engine pin is a hard constraint: if the pinned
    engine is down/incompatible the request fails (no duplicate on another
    card). Add an `engine_fallback` model-config flag (admin form checkbox +
    persisted in models.json) that opts into the old behaviour — fall back to
    a compatible engine when the pin can't be honoured. A pinned engine
    that's merely busy-but-alive is still routed to (queues) in both modes;
    fallback only applies when it's actually down or can't serve the model.
    
    Threads pin_fallback through pick_engine; the front reads engine_fallback
    via _load_pins/_model_info.
    Co-Authored-By: 's avatarClaude Opus 4.8 <noreply@anthropic.com>
    84d085d7
router.py 7.18 KB