Using a system with multiple gpus and multiple users, how can we share resources?

1 次查看(过去 30 天)
I am trying use a system with a Tesla K80 which has multiple GPU devices (8). How can I effectively share these resources without affecting other peoples work? I am currently selecting a GPU device according to available memory. Unfortunately, this is not foolproof. Several devices show that memory available is 'NaN'. Any advice on how to implement it properly would be appreciated!
Extra Info: Windows Multipoint Server Accessed via remote desktop

回答(1 个)

Joss Knight
Joss Knight 2017-10-16
This is difficult to answer fully without a lot more information about your system and environment. Probably the best way to deal with multiple users is to manage them via an MDCS cluster which people can connect to open pools or send batch jobs. The administrator can then make sure there is one worker per GPU and each worker has selected a specific GPU on startup. Another way would be to use nvidia-smi to put the devices in EXCLUSIVE_PROCESS mode. You can find some answers to similar questions here:

类别

Help CenterFile Exchange 中查找有关 Parallel Computing Fundamentals 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by