- Remove auto-detection logic, just use download_model from cache - User can specify --download-file-pattern for non-GGUF models