GPU usage by Kilosort4 when running with run_sorter_by_property #3591

jazlynntan · 2024-12-19T14:04:07Z

Hello,

I'm running kilosort4 for a single shank using run_sorter_by_property(). Using Kilosort4 independently in the same conda environment, the same data (with all 4 shanks) took about 1.5h. However, a single shank within spikeinterface took about 6h. This leads me to suspect that the GPU is not being used?

This is the output while kilosort within spikeinterface was running:

The GPU memory seems to be used by the process but the speed seems to suggest that the GPU is not used for computation. Meanwhile the CPU usage appeared to be maxed out.

This is the code I'm using:

sorted = si.run_sorter_by_property('kilosort4',
                          shank1,
                          grouping_property='group',
                          folder=os.path.join('shank1_output'),
                          verbose=True,
                          engine="joblib",engine_kwargs={"n_jobs": 16},**params_kilosort4)

I tried using 'auto' and 'cuda' for the torch_device parameter, but both faced the same issue.

May I know if I am doing something wrong? Thank you!

The text was updated successfully, but these errors were encountered:

JoeZiminski · 2024-12-19T14:11:40Z

Hi @jazlynntan I have no idea and this is just a quick guess, but I'm not sure what will happen regarding GPU access when when parallelising multiple sortings over separate cores. Presumably the separate processes are all attempting to compute on the GPU but from the runtime it doesn't seem like they are sequentially accessing the GPU in any useful way. It might be worth testing by running with n_jobs=1 and see if this results in the GPU being used.

zm711 · 2024-12-19T15:53:25Z

That was Sam's idea too in another issue (I forget which one). He recommended doing engine='loop' instead so that it occurs serially when n_jobs>1 rather than all jobs trying to access the gpu at the same time.

zm711 · 2024-12-19T15:56:47Z

Also we might need to add a note to our docs to explain that joblib might not play well with gpu-based sorters. Not sure, but this is the second issue related to this.

jazlynntan · 2024-12-20T13:51:11Z

Hi, I tried the first suggestions:

sorted = si.run_sorter_by_property('kilosort4',
                          shank1,
                          grouping_property='group',
                          folder=os.path.join('shank1_output'),
                          verbose=True,
                          engine="joblib",engine_kwargs={"n_jobs": 1},**params_kilosort4)

I think the same problem persists? GPU memory is used but not the computation. The whole sorting for the single shank took about 5h and the resource report is as follows:

INFO:kilosort.run_kilosort:********************************************************
INFO:kilosort.run_kilosort:CPU usage:    18.80 %
INFO:kilosort.run_kilosort:Memory:       23.01 %     |     28.90   /   125.60 GB
INFO:kilosort.run_kilosort:------------------------------------------------------
INFO:kilosort.run_kilosort:GPU usage:    `conda install pynvml` for GPU usage
INFO:kilosort.run_kilosort:GPU memory:   44.24 %     |     10.47   /    23.67 GB
INFO:kilosort.run_kilosort:Allocated:     0.04 %     |      0.01   /    23.67 GB
INFO:kilosort.run_kilosort:Max alloc:     3.73 %     |      0.88   /    23.67 GB
INFO:kilosort.run_kilosort:********************************************************

I'm now attempting to use 'loop' for engine and 16 jobs. I'll update again when its done.

zm711 added the concurrency Related to parallel processing label Dec 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU usage by Kilosort4 when running with run_sorter_by_property #3591

GPU usage by Kilosort4 when running with run_sorter_by_property #3591

jazlynntan commented Dec 19, 2024

JoeZiminski commented Dec 19, 2024

zm711 commented Dec 19, 2024

zm711 commented Dec 19, 2024

jazlynntan commented Dec 20, 2024

GPU usage by Kilosort4 when running with run_sorter_by_property #3591

GPU usage by Kilosort4 when running with run_sorter_by_property #3591

Comments

jazlynntan commented Dec 19, 2024

JoeZiminski commented Dec 19, 2024

zm711 commented Dec 19, 2024

zm711 commented Dec 19, 2024

jazlynntan commented Dec 20, 2024