Main Content

regARIMA Model Estimation Using Equality Constraints

estimate requires a regARIMA model and a vector of univariate response data to estimate a regression model with ARIMA errors. Without predictor data, the model specifies the parametric form of an intercept-only regression component with an ARIMA error model. This is not the same as a conditional mean model with a constant. For details, see Alternative ARIMA Model Representations. If you specify a T-by-r matrix of predictor data, then estimate includes a linear regression component for the r series.

estimate returns fitted values for any parameters in the input model with NaN values. For example, if you specify a default regARIMA model and pass a T-by-r matrix of predictor data, then the software sets all parameters to NaN including the r regression coefficients, and estimates them all. If you specify non-NaN values for any parameters, then estimate views these values as equality constraints and honors them during estimation.

For example, suppose residual diagnostics from a linear regression suggest integrated unconditional disturbances. Since the regression intercept is unidentifiable in integrated models, you decide to set the intercept to 0. Specify 'Intercept',0 in the regARIMA model that you pass into estimate. The software views this non-NaN value as an equality constraint, and does not estimate the intercept, its standard error, and its covariance with other estimates. To illustrate further, suppose the true model for a response series yt is

yt=0+utut=εt,

where εt is Gaussian with variance 1. The loglikelihood function for a simulated data set from this model can resemble the surface in the following figure over a grid of variances and intercepts.

rng(1); % For reproducibility
e = randn(100,1);
Variance = 1;
Intercept = 0;
Mdl0 = regARIMA('Intercept',Intercept,'Variance',Variance);
y = filter(Mdl0,e);

gridLength = 25;
intGrid1 = linspace(-1,1,gridLength);
varGrid1 = linspace(0.1,4,gridLength);
[varGrid2,intGrid2] = meshgrid(varGrid1,intGrid1);
LogLGrid = zeros(numel(varGrid1),numel(intGrid1));

for k1 = 1:numel(intGrid1)
    for k2 = 1:numel(varGrid1)
        Mdl = regARIMA('Intercept',...
            intGrid1(k1),'Variance',varGrid1(k2));
        [~,~,LogLGrid(k1,k2)] = estimate(Mdl,y,'Display','off');
    end
end

figure
surf(intGrid2,varGrid2,LogLGrid) % 3D loglikelihood plot        
xlabel 'Intercept';
ylabel 'Variance';
zlabel 'Loglikelihood';
shading interp

Figure contains an axes object. The axes object with xlabel Intercept, ylabel Variance contains an object of type surface.

Notice that the maximum (yellow region) occurs around where the intercept is 0 and the variance is 1. If you apply an equality constraint, then the optimizer views a two-dimensional slice (in this example) of the loglikelihood function at that constraint. The following plots display the loglikelihood at several different equality constraints on the intercept.

intValue = [intGrid1(5), intGrid1(10),...
    intGrid1(15), intGrid1(20)];

figure
for k = 1:4
    subplot(2,2,k)
    plot(varGrid1,LogLGrid(intGrid2 == intValue(k)))
    title(sprintf('Loglikelihood, Intercpet = %.3f',intValue(k)))
    xlabel 'Variance';
    ylabel 'Loglikelihood';
    hold on
    h1 = gca;
    plot([Variance Variance],h1.YLim,'r:')
    hold off
end

Figure contains 4 axes objects. Axes object 1 with title Loglikelihood, Intercpet = -0.667, xlabel Variance, ylabel Loglikelihood contains 2 objects of type line. Axes object 2 with title Loglikelihood, Intercpet = -0.250, xlabel Variance, ylabel Loglikelihood contains 2 objects of type line. Axes object 3 with title Loglikelihood, Intercpet = 0.167, xlabel Variance, ylabel Loglikelihood contains 2 objects of type line. Axes object 4 with title Loglikelihood, Intercpet = 0.583, xlabel Variance, ylabel Loglikelihood contains 2 objects of type line.

In each case, the maximum of the loglikelihood function occurs close to the true value of the variance.

Rather than constrain the intercept, the following plots display the likelihood function using several equality constraints on the variance.

varValue = [varGrid1(5),varGrid1(10),varGrid1(15),varGrid1(20)];

figure
for k = 1:4
    subplot(2,2,k)
    plot(intGrid1,LogLGrid(varGrid2 == varValue(k)))
    title(sprintf('Loglikelihood, Variance = %.3f',varValue(k)))
    xlabel('Intercept')
    ylabel('Loglikelihood')
    hold on
    h2 = gca;
    plot([Intercept Intercept],h2.YLim,'r:')
    hold off
end

Figure contains 4 axes objects. Axes object 1 with title Loglikelihood, Variance = 0.750, xlabel Intercept, ylabel Loglikelihood contains 2 objects of type line. Axes object 2 with title Loglikelihood, Variance = 1.563, xlabel Intercept, ylabel Loglikelihood contains 2 objects of type line. Axes object 3 with title Loglikelihood, Variance = 2.375, xlabel Intercept, ylabel Loglikelihood contains 2 objects of type line. Axes object 4 with title Loglikelihood, Variance = 3.188, xlabel Intercept, ylabel Loglikelihood contains 2 objects of type line.

In each case, the maximum of the loglikelihood function occurs close to the true value of the intercept.

estimate also honors a subset of equality constraints while estimating all other parameters set to NaN. For example, suppose r=3, and you know that β2=5. Specify Beta = [NaN; 5; NaN] in the regARIMA model, and pass this model with the data to estimate.

estimate optionally returns the estimated covariance matrix for the estimated parameters. If any parameter known to the optimizer has an equality constraint, then the corresponding row and column of the variance-covariance matrix is composed of zeros.

See Also

|

Related Topics