gpuArray and memory management

Question

Gunnar Läthén 2012-5-7

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/37645-gpuarray-and-memory-management

Hi,

I have a loop in which I create a number of gpuArrays. To keep within memory limits I clear some gpuArrays with intermediate results. In Matlab R2011b everything was cleared nicely, but with R2012a the loop crashes with an out of memory exception (running the exact same code). I understand that I cannot completely trust the FreeMemory reported by gpuDevice, although I see that memory is freed in R2011b when in R2012a it is not. Is there some way to force R2012a to release the memory (without a reset)?

Thanks!

2 个评论
显示无隐藏无

Edric Ellis 2012-5-8

It sounds like you're probably being hit by GPU memory fragmentation. Do you have any reproduction steps you could post?

Gunnar Läthén 2012-5-21

Returning again to this issue, I run into problems when running the full set of code (as opposed to the example in the comment below). In general, it feels like the memory management in R2012a is quite flaky compared to R2011b. I have put the complete code at http://www.itn.liu.se/~gunjo38/mem_test.zip if you are willing to try it out.

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Ben Tordoff 2012-5-8

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/37645-gpuarray-and-memory-management#answer_47066

在 MATLAB Online 中打开

Hi Gunnar,

this is more a work-around than an answer, but try inserting a "wait(gpu)" after freeing the memory. For example:

gpu = gpuDevice();
bigData = parallel.gpu.GPUArray.rand(2000);
% do lots of computations
clear bigData;
wait(gpu);

In R2012a and above the GPU might still be running when you get to the "clear" command so it may need to hold onto the memory. Using "wait" to ensure all computations have completed allows the memory to be released safely.

However, this shouldn't be necessary. If memory runs low, MATLAB should wait and free up some memory automatically. Could you post a snippet of code that shows how to hit the problem so that I can see why this isn't happening for you? In particular, which function runs out of memory - is it a creation function (zeros, ones, rand etc) or an operation (fft, multiply etc)?

Thanks

Ben

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

Gunnar Läthén 2012-5-8

I've tried to reduce the code to something manageable. I removed kernel executions and replaced them with pure memory allocations and some bogus calculations. The code doesn't make sense but it reproduces the problem on my machine at least. It seems like adding a wait() in the end of the loop fixes things, but maybe the example can be of use to you!

In R2012a I get the output (without the wait()):

1.4222e+09

384090112

CUDA_ERROR_OUT_OF_MEMORY

In R2011b I get the output:

1.4221e+009

572706816

...and so on...

%%

reset(gpuDevice);

g = gpuDevice;

disp(g.FreeMemory);

dim = [288 320 256];

data = parallel.gpu.GPUArray.zeros(dim, 'single');

V = parallel.gpu.GPUArray.zeros(size(data), 'single');

eig1 = parallel.gpu.GPUArray.zeros(size(data), 'single');

eig2 = parallel.gpu.GPUArray.zeros(size(data), 'single');

eig3 = parallel.gpu.GPUArray.zeros(size(data), 'single');

for ind = 1:10

fxx = parallel.gpu.GPUArray.zeros(size(data), 'single');

fxy = parallel.gpu.GPUArray.zeros(size(data), 'single');

fxz = parallel.gpu.GPUArray.zeros(size(data), 'single');

fyy = parallel.gpu.GPUArray.zeros(size(data), 'single');

fyz = parallel.gpu.GPUArray.zeros(size(data), 'single');

fzz = parallel.gpu.GPUArray.zeros(size(data), 'single');

eig1 = fxx + fyy;

eig2 = fxy.*fyz;

eig3 = fxz - fzz;

clear fxx;

clear fxy;

clear fxz;

clear fyy;

clear fyz;

clear fzz;

v = parallel.gpu.GPUArray.zeros(size(data), 'single');

v = eig1.*eig2.*eig3;

V = v;

clear v;

%wait(g);

disp(g.FreeMemory);

end

请先登录，再进行评论。

gpuArray and memory management

2 个评论
显示无隐藏无

回答（1 个）

1 个评论
显示 -1更早的评论隐藏 -1更早的评论

另请参阅

类别

标签

产品

Community Treasure Hunt

gpuArray and memory management

2 个评论 显示 无隐藏 无

回答（1 个）

1 个评论 显示 -1更早的评论隐藏 -1更早的评论

另请参阅

类别

标签

产品

Community Treasure Hunt

2 个评论
显示无隐藏无

1 个评论
显示 -1更早的评论隐藏 -1更早的评论