Why do I get correlation result NaN?

11 次查看（过去 30 天）

Bharath kumar boyanapalli 2021-8-13

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/897592-why-do-i-get-correlation-result-nan

评论： Walter Roberson 2021-8-17

I have two matrices A is of 1*1058 and B is 1*1058, both matrices have some NaN values included in them. Is there any way to get correlation between these two matrices.

采纳的回答

Ive J 2021-8-15

2
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/897592-why-do-i-get-correlation-result-nan#answer_767337

在 MATLAB Online 中打开

Unless you have a good reason to impute your missing data, you can remove missing values from both vectors.

nanidx = isnan(A) | isnan(B);
corr(A(~nanidx), B(~nanidx))

9 个评论
显示 7更早的评论隐藏 7更早的评论

Ive J 2021-8-16

在 MATLAB Online 中打开

Yes that's the whole idea!

% step 0-create two sample vectors with 5 missing values
sz = 100;
A = rand(sz, 1);
B = rand(sz, 1);
A(randperm(sz, 5)) = nan;
B(randperm(sz, 5)) = nan;
% step 1-find missing values in both vectors
nanidx = isnan(A) | isnan(B);
% step 2- remove the indices in step 1
cleanA = A(~nanidx);
cleanB = B(~nanidx);
% step 3- calculate the correlation coeff.
R = corr(cleanA, cleanB); % NOTE: by default Pearson correlation is used in corr function
% step 4- report it
fprintf('Pearson R is %.2f\n', R)
Pearson R is -0.08

Walter Roberson 2021-8-16

That's what Ive J's code does: removes all locations for which X is nan or Y is nan.

请先登录，再进行评论。

更多回答（1 个）

Chunru 2021-8-13

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/897592-why-do-i-get-correlation-result-nan#answer_766232

Use "fillmissing" to fill up the nans before computing the correlation. doc fillmissing for more details.

Anyway, if you have data with so many NANs, you need to doubt your data first before doubting the processing techniques. There is not fool proof technique for filling missing data. It all depends on what data you have and what you want.

Walter Roberson 2021-8-17

在 MATLAB Online 中打开

Mathematically, if you have vectors A and B, then

cAB = corr(A,B);
P = randperm(numel(A));
pA = A(P);
pB = B(P);
cpAB = corr(pA, pB);

then cAB needs to equal cpAB to within round-off. The order of the elements relative to each other in their same vectors do not matter: only the correspondance between the two vectors matter.

If, though, you were to fillmissing(A) and compare that to fillmissing(pA) then you would get different results, because fillmissing works based upon nearby values, under the assumption there is some kind of smooth continuity. This is not really compatible with the mathematics of correlation which does not care about order within the sequence.

If you have some prediction function for your vectors, then Yes, it might make sense to apply that prediction function. It might even make sense to apply something like narx to predict in some cases. But that would have to be done based upon knowledge of what the vectors represent. fillmissing() has no knowledge of what they represent.

请先登录，再进行评论。

请先登录，再回答此问题。

类别

Signal Processing Signal Processing Toolbox Transforms, Correlation, and Modeling Correlation and Convolution

在 Help Center 和 File Exchange 中查找有关 Correlation and Convolution 的更多信息

产品

Statistics and Machine Learning Toolbox

版本

R2019b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Why do I get correlation result NaN?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

9 个评论
显示 7更早的评论隐藏 7更早的评论

更多回答（1 个）

7 个评论
显示 5更早的评论隐藏 5更早的评论

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

Why do I get correlation result NaN?

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

9 个评论 显示 7更早的评论隐藏 7更早的评论

更多回答（1 个）

7 个评论 显示 5更早的评论隐藏 5更早的评论

另请参阅

类别

标签

产品

版本

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

9 个评论
显示 7更早的评论隐藏 7更早的评论

7 个评论
显示 5更早的评论隐藏 5更早的评论