Implement GPU prioritization and weight-based job distribution
- Added --weight parameter to client connections (default: 100) - Modified cluster master to prioritize GPU-enabled clients for job distribution - GPU clients always get precedence over CPU-only clients - When no GPU workers have required model, GPU clients still preferred for model distribution - Client weights are combined with process weights for load balancing - Higher weight = more jobs assigned to that client Job distribution priority: 1. GPU clients with required model already loaded 2. CPU clients with required model already loaded 3. GPU clients (model will be sent) 4. CPU clients (model will be sent) Within each category, clients are selected based on combined weight.
Showing
Please
register
or
sign in
to comment