How to extract the variable IDs after dimensionality reduction via pca?

1 次查看(过去 30 天)
I have a dataset X(voxel*protein), it's size is 3694200*10. I used pca to reduce dimensionality and got a matrix of size 3694200*4. The question is that I need to know which 4 proteins are remained. How can I get the original column IDs(0-9) of the 4 proteins?
My code: [COEEF,SCORE,LATENT] = pca(X); cumVar = cumsum(LATENT)./sum(LATENT); reducedData = SCORE(:,1:4);

采纳的回答

the cyclist
the cyclist 2014-7-8
PCA doesn't select a subset of variable. (Specifically, in your case, PCA does not select 4 out of your 10 variables).
Instead, PCA is identifying linear combinations of your original variables that explain the overall variation. If a small number of these linear combinations capture most of the variation, then it makes sense to limit to those combinations. The first output variable tells you the coefficients of those linear combinations.

更多回答(0 个)

类别

Help CenterFile Exchange 中查找有关 Dimensionality Reduction and Feature Extraction 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by