Closest Points between two datasets without using pdist2

Question

0 个投票

Hi, i have two matrices A, of size mx2, and B, of size nx2.

Each row of both matrices identifies a point in 2D space. What i want to do is to write a code, that does not involve loops and pdist2, as fast as possible, that tells me the indices of the row of both A and B such that the distance squared of the two points is the minimum one.

Example:

A=[5 6;

1 2;

3 4

1 8];

B=[3 0;

2 1;

4 1;

3 5;

1 2];

My function must be like [indA,indB]=function(A_matrix,B_matrix)

I want as output [2,5]=function(A,B)

I found a solution using for-loops but i really would like to find a solution using repmat that involves vectorization.

Thanks

4 个评论
显示 2更早的评论隐藏 2更早的评论

Walter Roberson 2020-10-25

repmat is slower than implicit expansion in many cases.

There are vectorized ways to get indices of the minimum, but they are not necessarily faster than using find() (would have to be tested) and would have problems with ties.

Alberto Belvedere 2020-10-25

How could i use implicit expansion in this case? Furthermore, is it better to use a combination of find+min or to use sort twice (first by rows, then by columns) on the resultant matrix?

请先登录，再进行评论。

请先登录，再回答此问题。

Follow Question

Answer 1

Walter Roberson 2020-10-25

在 MATLAB Online 中打开

0 个投票

A=[5 6;
     1 2;
     3 4
     1 8];
B=[3 0;
     2 1;
     4 1;
     3 5;
     1 2];
P = permute(sum((A-permute(B,[3 2 1])).^2,2),[1 3 2])   %will be size(A,1) by size(B,1)
P = 4×5
    40    34    26     5    32
     8     2    10    13     0
    16    10    10     1     8
    68    50    58    13    36
●
A_idx_in_B = sum(cumprod(P ~= min(P,[],1),1),1)+1
A_idx_in_B = 1×5
     2     2     2     3     2
●
B_idx_in_A = sum(cumprod(P ~= min(P,[],2),2),2)+1
B_idx_in_A = 4×1
     4
     5
     4
     4
●

This code resolves ties in favor of the first match.

3 个评论
显示 1更早的评论隐藏 1更早的评论

Walter Roberson 2020-10-25

The idx_in variables are the indices of the minimum distance. For example the closest entry in B to A(4,:) is the 4th entry of A_idx_in_B which is 3, so A(4,:) is closest to B(3,:)

Alberto Belvedere 2020-10-25

Got it, thanks!

请先登录，再进行评论。

Answer 2

Mitchell Thurston 2020-10-24

编辑：Mitchell Thurston 2020-10-24

在 MATLAB Online 中打开

1 个投票

Came up with a solution:

[m,~] = size(A);
[n,~] = size(B);
A_rep = repmat(A,n,1);
B_rep = B';
B_rep = repmat(B_rep(:)',m,1);
dist = hypot( A_rep(:,1)-B_rep(:,1:2:end), A_rep(:,2)-B_rep(:,2:2:end) );
ind = find(dist == min(dist));
indB = floor((ind-1)./m)+1
indA = mod(ind-(indB-1)*m,n)

3 个评论
显示 1更早的评论隐藏 1更早的评论

Mitchell Thurston 2020-10-24

Not as far as I know, this is the method I've always used for cases like this. What is nice about this though is if there's a tie for the closest it will return all of those indicies of the tie.

Alberto Belvedere 2020-10-25

I'll try some minor tweaks tomorrow starting from your excellent job. Thank you so much for your time!

请先登录，再进行评论。

Closest Points between two datasets without using pdist2

4 个评论
显示 2更早的评论隐藏 2更早的评论

采纳的回答

3 个评论
显示 1更早的评论隐藏 1更早的评论

更多回答（1 个）

3 个评论
显示 1更早的评论隐藏 1更早的评论

类别

产品

标签

Community Treasure Hunt

Closest Points between two datasets without using pdist2

4 个评论 显示 2更早的评论 隐藏 2更早的评论

采纳的回答

3 个评论 显示 1更早的评论 隐藏 1更早的评论

更多回答（1 个）

3 个评论 显示 1更早的评论 隐藏 1更早的评论

类别

产品

标签

另请参阅

Community Treasure Hunt

4 个评论
显示 2更早的评论隐藏 2更早的评论

3 个评论
显示 1更早的评论隐藏 1更早的评论

3 个评论
显示 1更早的评论隐藏 1更早的评论