How to extract the variable IDs after dimensionality reduction via pca?
3 次查看(过去 30 天)
显示 更早的评论
I have a dataset X(voxel*protein), it's size is 3694200*10. I used pca to reduce dimensionality and got a matrix of size 3694200*4. The question is that I need to know which 4 proteins are remained. How can I get the original column IDs(0-9) of the 4 proteins?
My code: [COEEF,SCORE,LATENT] = pca(X); cumVar = cumsum(LATENT)./sum(LATENT); reducedData = SCORE(:,1:4);
0 个评论
采纳的回答
the cyclist
2014-7-8
PCA doesn't select a subset of variable. (Specifically, in your case, PCA does not select 4 out of your 10 variables).
Instead, PCA is identifying linear combinations of your original variables that explain the overall variation. If a small number of these linear combinations capture most of the variation, then it makes sense to limit to those combinations. The first output variable tells you the coefficients of those linear combinations.
更多回答(0 个)
另请参阅
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!