-
What does the chunksize argument do in get_minirocket_features? Say I have a 650,000 samples, that are each 30 time steps with 4 dimensions per time step. If I want to calculate the MiniRocket Features for these samples, shouldn't I be able to pass them all at once to the get_minirocket_features function if I appropriately set the chunksize argument such that I'll have enough memory to process that chunk on the GPU? What I'm noticing is even with a chunk size of 32, and 11 gigs of GPU memory, I'm getting out of memory errors when I run the below code. Thanks in advance for any help you can offer with this :)
|
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
I'm not sure why I get a CUDA error, but even if I didn't get a CUDA error, I realize now I don't think I have enough CPU memory to hold all processed 650,000 samples. With 650,000 samples, each of them being a float32 at 4 bytes each, and each sample having 10,000 features that results in 650,000 samples x 10,000 features per sample x 4 bytes / 1e9 bytes in a gigabyte = 26 gigabytes. So it takes 26 gigs of CPU memory just to hold a single copy of the data, never mind any derivative memory consumption for processing it. I guess the moral of the story here is be mindful of the number of samples you have when asking get_minirocket_features to calculate the features... |
Beta Was this translation helpful? Give feedback.
-
Hi @YojoNick, |
Beta Was this translation helpful? Give feedback.
I'm not sure why I get a CUDA error, but even if I didn't get a CUDA error, I realize now I don't think I have enough CPU memory to hold all processed 650,000 samples. With 650,000 samples, each of them being a float32 at 4 bytes each, and each sample having 10,000 features that results in 650,000 samples x 10,000 features per sample x 4 bytes / 1e9 bytes in a gigabyte = 26 gigabytes. So it takes 26 gigs of CPU memory just to hold a single copy of the data, never mind any derivative memory consumption for processing it.
I guess the moral of the story here is be mindful of the number of samples you have when asking get_minirocket_features to calculate the features...