How to avoid NaN when comparing data
49 次查看(过去 30 天)
显示 更早的评论
Hello
I have a quite generalized question that has been troubling me. When comparing data such as examples with real, we sometimes have 0 or NaN values. As is the case, I thought to work around it in three ways:
1.I replaced the NaN with a number (e.g. 0)
2.Replace NaN with values interpolated scheme
3. Replace NaN with the next / or previous value
Although when faced with a very big series of real values i noticed that by interpolating or replacing the NaN my results are changed.
How can we compare the two vector for a process , for example their coefficient but with not considering the NaN values and their corresponding simulated value.
p.s. am aware of the available options in matlab such as nanmean which do not taken into account the nan values of a vector, I am wondering though how can you compare two vectors (values) and when you detect a NaN to skip the NaN value and the simulated/other value that is to be compared.
I try to solve it on an example I have done, trying the functionality
% generate a random valuable and insert NaN
A=rand(500,1)
b=randi(500,20,1)
A(b)=NaN
plot(A)
%generate a same size random variable to be compared with A
B=rand(500,1)
% Just visual confirmation of the location of the NaN's in A, by replaing
% them with 0
c=A
c(isnan(c))=0
c(c==0)=-1
plot(c)
% process of comparison
corrcoef(A,B)
The objective is where the NaN are located they are not taken into account and the corresponding B value that was to be compared is also ignored.
Thank you
0 个评论
采纳的回答
Adam
2014-9-25
编辑:Adam
2014-9-25
idx = isnan( A );
A(idx) = [];
B(idx) = [];
corrcoef(A,B);
would work. I was playing around with inline if in a function handle to return NaN (which can then be filtered out of the result) if either of the inputs to the function is NaN, but I didn't quite get my syntax right and in the end figured that pre-processing as above is simpler.
Obviously you can put the NaN-less versions in new variables if you don't want to lose the values from B that correspond to NaNs in A.
Maybe the function handle wrapper approach would work better if they are huge arrays where taking copies minus NaNs would present memory issues, but otherwise pre-processing seems easiest.
2 个评论
Adam
2014-9-25
编辑:Adam
2014-9-25
You could generalise it slightly by writing a function such as:
function result = nanfunc( func, dataWithNans, otherData )
idx = isnan( dataWithNans );
dataWithNans(idx) = [];
otherData(idx) = [];
result = func( dataWithNans, otherData );
to call as e.g.
res = nanfunc( @corrcoef, A, B );
if you want to do numerous operations on your two arrays. Such a function does embed numerous assumptions though, partially highlighted by my variable naming, that the first data input is expected to be the one with NaNs and the second NaN-free (though you could easily change the function to remove elements corresponding to NaNs in either input). Also there is an assumption that the function handle you pass in is one that takes two numeric input arguments and produces one output.
Personally I don't like the name nanmean, but I named that function to be consistent. To me a function called nanmean suggests something more the opposite of what it actually does!
更多回答(0 个)
另请参阅
类别
在 Help Center 和 File Exchange 中查找有关 NaNs 的更多信息
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!