press

版本 1.5.0.0 (3.4 KB) 作者: Antonio Trujillo-Ortiz

Prediction error sum of squares.

关注

4.7

(3)

3.0K 次下载

更新时间 2013/9/3

查看许可证

This m-file returns a useful residual scaling, the prediction error sum of squares (PRESS). To calculate PRESS, select an observation i. Fit the regression model to the remaining n-1 observations and use this equation to predict the withheld observation y_i. Denoting this predicted value by ye_(i), we may find the prediction error for point i as e_(i)=y_i - ye_(i). The prediction error is often called the ith PRESS residual. This procedure is repeated for each observation i = 1,2,...,n, producing a set of n PRESS residuals e_(1),e_(2),...,e_(n). Then the PRESS statistic is defined as the sum of squares of the n PRESS residuals as in,

PRESS = i_Sum_n e_(i)^2 = i_Sum_n [y_i - ye_(i)]^2

Thus PRESS uses such possible subset of n-1 observations as an estimation data set, and every observation in turn is used to form a prediction data set. In the construction of this m-file, we use this statistical approach.

As we have seen that calculating PRESS requires fitting n different regressions, also it is possible to calculate it from the results of a single least squares fit to all n observations. It turns out that the ith PRESS residual is,

e_(i) = e_i/(1 - h_ii)

Thus, because PRESS is just the sum of the squares of the PRESS residuals, a simple computing formula is

PRESS = i_Sum_n [e_i/(1 - h_ii)]^2

It is easy to see that the PRESS residual is just the ordinary residual weighted according to the diagonal elements of the hat matrix h_ii. Also, for all the interested people, here we just indicate, in an inactive form, this statistical approaching.

Data points for which h_ii are large will have large PRESS residuals. These observations will generally be high influence points. Generally, a large difference between the ordinary residual and the PRESS residual will indicate a point where the model fits the data well, but a model built without that point predicts poorly.

This is also known as leave-one-out cross-validation (LOOCV) in linear models as a measure of the accuracy. [an anon's suggestion]

In order to improve the matrix script for avoiding the squares condition number in the regression parameter estimation are by using a pivoted QR factorization of X.

Syntax: function x = press(D)

Inputs:
D - matrix data (=[X Y]) (last column must be the Y-dependent variable).
(X-independent variables).

Output:
x - prediction error sum of squares (PRESS).

引用格式

Antonio Trujillo-Ortiz (2024). press (https://www.mathworks.com/matlabcentral/fileexchange/14564-press), MATLAB Central File Exchange. 检索时间: 2024/11/22.

MATLAB 版本兼容性

创建方式 R14

兼容任何版本

平台兼容性

Windows macOS Linux

类别

AI and Statistics > Statistics and Machine Learning Toolbox > Regression >

在 Help Center 和 MATLAB Answers 中查找有关 Regression 的更多信息

标签添加标签

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

press(D)

版本	已发布	发行说明
1.5.0.0	2013/9/3	Text was improved. [The anon suggestion was taken into account.]	下载
1.3.0.0	2013/8/27	Text was improved.	下载
1.1.0.0	2013/8/26	Code was improved. Taking into account the mathematical suggestions done by Bart and John D'Errico (08-26-2013).	下载
1.0.0.0	2007/4/11	Text was improved.	下载

press

引用格式

必需项

MATLAB 版本兼容性

平台兼容性

类别

标签添加标签

Community Treasure Hunt

探索实时编辑器

press

引用格式

必需项

MATLAB 版本兼容性

平台兼容性

类别

标签 添加标签

Community Treasure Hunt

探索实时编辑器

标签添加标签