Partial least-squares regression evalutaion

Question

SM 2020-2-6

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/503948-partial-least-squares-regression-evalutaion

编辑： Vladimir Sovkov 2020-2-7

Hi

I am new to PLSR and I went throguh several literature that indicate the eqution of Y=BX+E as the PLSR eqution.

I am trying to use :[XL,YL,XS,YS] = plsregress(X,Y,ncomp) but I cannot match the output with the general form of eqution.

From the matlab description I could figure out the B=XL and I guess that YL should be a coefficient for Y, but I am not sure about neither YL nor XS,YS.

Can anyone help me with explaning how to match them?

and also how can I use them to predict other values?

Thanks

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Vladimir Sovkov 2020-2-6

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/503948-partial-least-squares-regression-evalutaion#answer_414084

编辑：Vladimir Sovkov 2020-2-6

The equation considered in PLS of Matlab is rather

Y = X(:,:)*B(2:end,:) + repmat( B(1,:),n,1 ) + E,

where:

X is an n-by-p matrix of predictor variables: in analitical chemistry this can be experimentlal data (e.g., spectroscopic measurements), every row of which contains the full set of measurements dealing with one "standard sample"; p measurements (predictors) for each of n samples;

Y is an n-by-m response matrix (sometimes called "scores"), every row of which contains known values of responces (e.g. concentrations of some elements) in the corresponding "standard sample"; m "concentrations" (responces) for each of n samples;

B is the (p+1)-by-m matrix of coefficients ("loadings") to compute scores ("concentrations") from predictors ("spectra"); its 1st row presents intercepts.

E is unknown noise (is not used further on at all, introduced by some authors for a better undestanding of physics).

Matlab command:

[XL,YL,XS,YS,B,PCTVAR,MSE] = plsregress(X,Y,ncomp)

For predicting "concentrations" from new "spectroscopic data", only B is needed.

To understand other output, you should get deeper into the theory of PLS, which is hard to explain in a few words.

Many of those outputs serve the purpose of finding the appropriate number ncomp of components, but you can just estimate the quality of the approximation with different ncomp directly via the initial equation.

When you already understand the PLS method, you can address the Matlab documentation to deciper their designations; it also contains an example, which is able to make the things clearer.

2 个评论
显示无隐藏无

SM 2020-2-6

Thank you for your detailed answer.

I am still confused in the names and match it with the Matlab code:

(1) Y=XB+E , that you mentioned E is not needed and beta or B is equal to B in the equation (1).

(2) T=WX

(3) Y=TQ+E

1-I am wondering that what exactly are XL and YL and XS and YS?

2- Regarding the prediction is it: Y_new=X_new*B(2:end,:) + repmat( B(1,:),n,1 ) ?

Thanks

Vladimir Sovkov 2020-2-7

编辑：Vladimir Sovkov 2020-2-7

(1)

Matlab documentation states that XL = (XS\X0)' = X0'*XS, YL = (XS\Y0)' = Y0'*XS

where X0 and Y0 are the centered versions of X and Y : X0 = X - mean(X,1); Y0 = Y - mean(Y,1).

Hence, approximately (i.e., via separating out some noisy components): X0 ~= XS*XL', Y0 ~= XS*YL'

The last equation looks similar to your Y=TQ+E with T=XS, Q=YL' and Y -> Y0.

Factually, Y ~= XS*YL' + mean(Y,1)

(2)

Yes, Y_new=X_new*B(2:end,:) + repmat( B(1,:),n,1 ) = [ones(n,1),X_new]*B

请先登录，再进行评论。

Partial least-squares regression evalutaion

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

2 个评论
显示无隐藏无

更多回答（0 个）

另请参阅

类别

标签

Community Treasure Hunt

Partial least-squares regression evalutaion

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

2 个评论 显示 无隐藏 无

更多回答（0 个）

另请参阅

类别

标签

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

2 个评论
显示无隐藏无