Fast 2D distance calculation

Question

Neuropragmatist 2019-7-29

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/473890-fast-2d-distance-calculation

评论： Neuropragmatist 2019-8-3

Hi all,

Many of the codes I am currently using depend on a simple calculation: the distance between a single point and a set of other points.

In one example, using the matlab profiler I see that this single calculation takes 50% of the total function time, so I would like to optimise it as far as possible.

I have looked around and haven't found anything more optimal than:

p1 = rand(1,2); % single point
pn = rand(1000000,2); % random points
tic
d = sqrt(sum((p1-pn).^2,2)); % calculate the distance between these
toc

Does anyone else have a clever idea that would optimise this - even just by a tiny fraction? Is there any way to speed these calculations up on the GPU or using a mex? I would be really happy to see any suggestions.

I suspect this might be already be as mathematically simple as possible, but I'm frustrated because I need to calculate this a lot.

I have already vectrorised my code as far as possible.

Thanks for any help,

R.

5 个评论
显示 3更早的评论隐藏 3更早的评论

Neuropragmatist 2019-7-29

Well the example is just a toy to show what I have come up with so far.

In reality I have to compute the distance between 60000x2 points (minimum) and a 1x2 point 2704 times and I have to do this about 40000 times.

Just doing this once takes my function 10s and 50% of that is purely the line computing the distance equation, so obviously cutting that down would help me a lot time wise.

Unfortunately there is no way for me to this vectorally or in a pairwise way because the number of points varies a lot and can easily require too much memory to compute in a pairwise fashion.

Hope this makes more sense,

R.

Neuropragmatist 2019-8-3

在 MATLAB Online 中打开

I have run the following code several times and I get slightly different results:

xi = 1:.5:8;
t1 = NaN(1,8);
t2 = NaN(1,8);
for p = 1:length(xi)
    p1 = rand(1,2);
    pn = rand(ceil(10^xi(p)),2);
    tic;
    d1 = sqrt(sum((pn-p1).^2,2));
    t1(p) = toc;
    tic;
    d2 = pdist2(p1,pn);
    t2(p) = toc;
end
figure
plot(xi,t1,'r',xi,t2,'b');
legend({'Manual','Pdist2'})

Which probably suggests that any differences in time between pdist2 and manual calculation are negligible and more dependent on the current background state of the CPU.

However, generally the manual calculation is slightly faster or both methods are the same.

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Matt J 2019-7-29

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/473890-fast-2d-distance-calculation#answer_385282

编辑：Matt J 2019-7-29

在 MATLAB Online 中打开

If you have the Parallel Computing Toolbox, you can execute the computations on the GPU just by building p1 and pn as gpuArrays. That should definitely speed things up.

gd=gpuDevice;
p1 = gpuArray.rand(1,2); 
pn = gpuArray.rand(1000000,2); 
tic
d = sqrt(sum((p1-pn).^2,2));
wait(gd);
toc  %Elapsed time is 0.001429 seconds.

2 个评论
显示无隐藏无

Neuropragmatist 2019-7-29

I looked into this and it seems that my (work) PC only has a crappy integrated GPU that is not Matlab compatible...

Matt J 2019-7-29

Well, I don't think the question can be taken any further until we know what parallel computing resources you do have, or can remote connect to. I think you are at the limits of performance already with standard Matlab.

请先登录，再进行评论。

Answer 2

Joss Knight 2019-8-3

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/473890-fast-2d-distance-calculation#answer_386022

pdist2 is the usual way to do this, if you have Statistics and Machine Learning Toolbox.

2 个评论
显示无隐藏无

Neuropragmatist 2019-8-3

From what I have read, computing the distance directly using the equation is faster than using pdist or pdist2 as it avoids the overheads involved with calling the function.

I don't know if this remains true if you are computing pairwise distances, but I only need to know the distance from one point to many other points.

Joss Knight 2019-8-3

But pdist2 does that. Input x is a 1-by-2 vector, and input y is an N-by-2 array of N points.

You may be right that it is no faster than implementing it manually.

请先登录，再进行评论。

Fast 2D distance calculation

5 个评论
显示 3更早的评论隐藏 3更早的评论

回答（2 个）

2 个评论
显示无隐藏无

2 个评论
显示无隐藏无

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

Fast 2D distance calculation

5 个评论 显示 3更早的评论隐藏 3更早的评论

回答（2 个）

2 个评论 显示 无隐藏 无

2 个评论 显示 无隐藏 无

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

5 个评论
显示 3更早的评论隐藏 3更早的评论

2 个评论
显示无隐藏无

2 个评论
显示无隐藏无