Removing outliers from a matrix
4 次查看(过去 30 天)
显示 更早的评论
I removed outliers from my each column of my feature matrix, but now I have feature vectors with differing lengths. I cannot make them the same length because I will lose data from a certain range. If I shuffle, then the correlation between my features and my outcome becomes messy. How can I do that? My matrix of features and outcomes is a 55000*14 matrix. The last column is the outputs.
1 个评论
Davide Masiello
2022-10-11
Maybe, instead of removing the outliers, you could replace them with an interpolated value.
回答(1 个)
Bjorn Gustavsson
2022-10-11
编辑:Bjorn Gustavsson
2022-10-11
QD-answer: The best you can do for single pairs of columns is to only use the rows where neither are outliers. With the cov-function you can get this handled with the options 'omitrows' or 'partialrows' if you replace each outlier with a nan. If you want to use the corrcoef-function it has a slightly different interface. It takes a parameter-value pair to set the options for 'rows', see the help and documentation for description.
HTH
0 个评论
另请参阅
类别
在 Help Center 和 File Exchange 中查找有关 Descriptive Statistics 的更多信息
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!