codai/admin/templates · f0dcf7eb2a2687135132f13eb2662aea05585ebf · nexlab / coderai

admin: whisper gguf model auto-manages its runner (1:1) · 3d551444

Stefy Lanza (nextime / spora ) authored Jun 19, 2026

Model a whisper gguf as two things: a MODEL config (a .gguf entry with
backend=whisper-server and NO model_path — enables the model, holds load
strategy, shown on the GGUF row) and a RUNNER (backend=whisper-server WITH
model_path — the subprocess, shown in the whisper card).

- Enabling a .gguf with speech_to_text marks it backend=whisper-server and
  auto-creates exactly one runner (1:1) on a free port.
- Disabling the model removes + kills all its runners (cascade by model_path).
- Removing a runner (or model) now stops the subprocess + drops registry
  entries, instead of leaving it running until restart.
- cached-models shows the model config on the GGUF row but excludes runners;
  the whisper card shows only runners (require model_path).
- engine startup only launches runners (entries with model_path), never the
  bare model config.
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

3d551444

Name	Last commit	Last update
..
archive.html		Loading commit data...
base.html		Loading commit data...
change_password.html		Loading commit data...
chat.html		Loading commit data...
dashboard.html		Loading commit data...
login.html		Loading commit data...
models.html		Loading commit data...
settings.html		Loading commit data...
tasks.html		Loading commit data...
tokens.html		Loading commit data...
users.html		Loading commit data...