MMSE Forecasting Regression Models with ARIMA Errors
What Are MMSE Forecasts?
An objective of time series analysis is generating forecasts for responses over a future time horizon. That is, you can generate predictions for yT + 1,yT + 2,...,yT + h given the following:
An observed series y1, y2,...,yT
A forecast horizon h
Nonstochastic predictors x1,x2,...,xT,...,xT + h, where xk is an r-vector containing the measurements of r predictors observed at time k
A regression model with ARIMA errors
where H(L) and N(L) are compound autoregressive and moving average lag operator polynomials (possibly containing integration), respectively.
Let denote a forecast for the process at time t + 1, conditional on the history of the process up to time t (Ht), and assume that the predictors are fixed. The minimum mean square error (MMSE) forecast is the forecast that minimizes expected square loss,
Minimizing this loss function yields the MMSE forecast,
How forecast Generates MMSE Forecasts
forecast
generates MMSE forecasts
recursively. When you call forecast
, you must specify a regARIMA
model (Mdl
) and the forecast horizon. You
can also specify presample observations (Y0
), predictors
(X0
), innovations (E0
), and conditional
disturbances (U0
) using name-value pair arguments.
To begin forecasting yt starting at time T + 1, use the last few observations of yt and Xt as presample responses and predictors to initialize the forecast. Alternatively, you can specify presample unconditional disturbances or innovations.
However, when you specify presample data:
If you provide presample predictor data (
X0
), then you must also provide predictor forecasts (XF
). It is best practice to setX0
to the same predictor matrix that estimates the parameters. If you do not provide presample and future predictors, thenforecast
ignores the regression component in the model.If the error process in
Mdl
contains a seasonal or nonseasonal autoregressive component, or seasonal or nonseasonal integration, thenforecast
requires a minimum of P presample unconditional disturbances to initialize the forecast. The propertyP
ofMdl
stores P.If the error process in
Mdl
contains a seasonal or nonseasonal moving average component, thenforecast
requires a minimum of Q presample innovations to initialize the forecast. The propertyQ
ofMdl
stores Q.If you provide a sufficient amount of presample unconditional disturbances, then
forecast
ignoresY0
andX0
. If you also do not provideE0
, but provide enough presample unconditional disturbances, thenforecast
infers the required amount of presample innovations from the ARIMA error model andU0
.If you provide a sufficient amount of presample responses and predictors (and do not provide
U0
), thenforecast
uses the regression model to infer the presample unconditional disturbances.If you do not provide presample observations, then
forecast
sets the required amount of presample unconditional disturbances and innovations to 0.If you provide an insufficient amount of presample observations, then
forecast
returns an error.
Consider generating forecasts from a regression model with ARMA(3,2) errors:
where a(L) and B(L) are lag operator polynomials. The largest AR lag is 3, the largest MA lag is 2. This model does not contain any seasonal lags nor integration. Therefore, P = 3 and Q = 2. To forecast this model, you need three presample responses and predictors, or three presample unconditional disturbances, and two presample innovations.
Given presample unconditional disturbances presample innovations and future predictors you can forecast the model as follows:
...
Note that:
Future innovations take on their unconditional mean, 0.
For stationary error processes, such as this one:
The forecasted unconditional disturbances converge to their unconditional mean,
c + Xtβ governs the long-term behavior of the forecasted responses.
Forecast Error
The forecast error for an s-step ahead forecast of a regression model with ARIMA errors is
where the dividend ψ(L) is an infinite lag operator polynomial, and σ2 is the innovation variance.
If the error process is stationary, then the coefficients of ψ(L) are absolutely summable. Therefore, the MSE (mean square error) converges to the unconditional variance of the process [1].
If the error process is not stationary, then the MSE grows with increasing s.
References
[1] Box, G. E. P., G. M. Jenkins, and G. C. Reinsel. Time Series Analysis: Forecasting and Control. 3rd ed. Englewood Cliffs, NJ: Prentice Hall, 1994.
See Also
Related Examples
- Verify Predictive Ability Robustness of a regARIMA Model
- Forecast a Regression Model with Multiplicative Seasonal ARIMA Errors
- Forecast a Regression Model with ARIMA Errors