-
Stefy Lanza (nextime / spora ) authored
- New --flash flag enables Flash Attention 2 installation - Works with nvidia and all backends (when CUDA available) - Installs with --no-build-isolation flag - Graceful error handling if installation fails - Updated usage instructions to show --flash-attn flag - Requirements: CUDA 11.6+, Linux, Ampere/Ada/Hopper GPU
f4a34bc3