Removing outliers from a matrix

4 次查看(过去 30 天)
I removed outliers from my each column of my feature matrix, but now I have feature vectors with differing lengths. I cannot make them the same length because I will lose data from a certain range. If I shuffle, then the correlation between my features and my outcome becomes messy. How can I do that? My matrix of features and outcomes is a 55000*14 matrix. The last column is the outputs.
  1 个评论
Davide Masiello
Davide Masiello 2022-10-11
Maybe, instead of removing the outliers, you could replace them with an interpolated value.

请先登录,再进行评论。

回答(1 个)

Bjorn Gustavsson
Bjorn Gustavsson 2022-10-11
编辑:Bjorn Gustavsson 2022-10-11
QD-answer: The best you can do for single pairs of columns is to only use the rows where neither are outliers. With the cov-function you can get this handled with the options 'omitrows' or 'partialrows' if you replace each outlier with a nan. If you want to use the corrcoef-function it has a slightly different interface. It takes a parameter-value pair to set the options for 'rows', see the help and documentation for description.
HTH

类别

Help CenterFile Exchange 中查找有关 Descriptive Statistics 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by