• Stefy Lanza (nextime / spora )'s avatar
    fix: GGUF vision/mmproj routing + VRAM estimate; Tasks page it/s + history · ade800f9
    Stefy Lanza (nextime / spora ) authored
    - api_model_load: load a GGUF/text model via llama.cpp even when it's also
      bucketed under image/vision (respect the entry's primary model_type), so a
      gemma+mmproj LLM never hits the diffusers from_pretrained() path.
    - model config save: a GGUF LLM with an mmproj auto-gets the image_to_text
      capability and is kept out of the diffusers vision_models/image_models buckets.
    - VRAM estimate: _runtime_reserve_gb scales the KV-cache reserve by the cache
      quantization (q4_0 ≈ 0.27× f16) so quantized-KV models at large context aren't
      over-estimated into needless CPU offload.
    - Free disk (HF): quiet huggingface_hub's noisy not-found traceback and make the
      delete idempotent (repo already gone = success).
    - Tasks page: generation tasks now report it/s (or s/it when slow); text keeps
      tok/s. Throughput computed centrally in the task registry (live EMA + run
      average on finish). New "Recent tasks (last 10)" history section.
    Co-Authored-By: 's avatarClaude Opus 4.8 <noreply@anthropic.com>
    ade800f9
Name
Last commit
Last update
codai Loading commit data...
docs Loading commit data...
packaging Loading commit data...
samples Loading commit data...
tests Loading commit data...
tools Loading commit data...
.dockerignore Loading commit data...
.gitignore Loading commit data...
AI.PROMPT Loading commit data...
CODERAI_API_DOCUMENTATION.md Loading commit data...
CoderAI.gif Loading commit data...
DISTRIBUTION.md Loading commit data...
LICENSE.md Loading commit data...
MULTIMODAL_CAPABILITIES.md Loading commit data...
MULTIMODAL_UI_EXAMPLES.md Loading commit data...
README.md Loading commit data...
build-oci.sh Loading commit data...
build.ps1 Loading commit data...
build.sh Loading commit data...
coderai Loading commit data...
coderai-broker-implementation-reference.md Loading commit data...
coderai-integration.md Loading commit data...
commands Loading commit data...
osxbuild.sh Loading commit data...
package-oci.sh Loading commit data...
package-tarball.sh Loading commit data...
requirements-nvidia.txt Loading commit data...
requirements-vulkan.txt Loading commit data...
requirements.txt Loading commit data...
run-oci.sh Loading commit data...
smoke-test-oci.sh Loading commit data...
todo.md Loading commit data...
video_editor.config.json Loading commit data...