-
Your Name authored
- Add offload_strategy to kwargs in _load_default_model and _load_model_by_name - Fix parameter name: ram -> manual_ram_gb to match backend expectation - Also pass load_in_4bit, load_in_8bit, and max_gpu_percent
bf1d3f52
| Name |
Last commit
|
Last update |
|---|---|---|
| .. | ||
| cache | ||
| __init__.py | ||
| capabilities.py | ||
| grammar.py | ||
| manager.py | ||
| parser.py | ||
| templates.py | ||
| tool_call_grammar.gbnf | ||
| utils.py |