• Your Name's avatar
    Add --offload-strategy none to disable CPU offloading and VRAM auto-detection · beded066
    Your Name authored
    - Add 'none' to --offload-strategy choices in cli.py
    - In cuda.py backend:
      - _get_vram_percentages_for_strategy() returns None for 'none' strategy
      - _get_vram_percentages_for_gpu() skips VRAM detection for 'none'
      - load_model() loads directly on GPU without max_memory constraints
    - Add startup status message in main.py for --offload-strategy none
    beded066
cuda.py 38.7 KB