Main Content

Tobit

Create Tobit model object for loss given default

Since R2021a

Description

Create and analyze a Tobit model object to calculate loss given default (LGD) using this workflow:

  1. Use fitLGDModel to create a Tobit model object.

  2. Use predict to predict the LGD.

  3. Use modelDiscrimination to return AUROC and ROC data. You can plot the results using modelDiscriminationPlot.

  4. Use modelCalibration to return the R-squared, RMSE, correlation, and sample mean error of predicted and observed LGD data. You can plot the results using modelCalibrationPlot.

Creation

Description

TobitLGDModel = fitLGDModel(data,ModelType) creates a Tobit LGD model object.

example

TobitLGDModel = fitLGDModel(___,Name,Value) specifies options using one or more name-value arguments in addition to the input arguments in the previous syntax. The optional name-value arguments set the model object properties. For example, lgdModel = fitLGDModel(data,'tobit',PredictorVars={'LTV' 'Age' 'Type'},ResponseVar="LGD",CensoringSide="left",LeftLimit=1e-4,WeightsVar="Weights") creates a lgdModel object using a Tobit model type.

example

Input Arguments

expand all

Data for loss given default, specified as a table.

Data Types: table

Model type, specified as a string with the value of "Tobit" or a character vector with the value of 'Tobit'.

Data Types: char | string

Name-Value Arguments

Specify optional pairs of arguments as Name1=Value1,...,NameN=ValueN, where Name is the argument name and Value is the corresponding value. Name-value arguments must appear after other arguments, but the order of the pairs does not matter.

Before R2021a, use commas to separate each name and value, and enclose Name in quotes.

Example: lgdModel = fitLGDModel(data,'tobit',PredictorVars={'LTV' 'Age' 'Type'},ResponseVar="LGD",CensoringSide="left",LeftLimit=1e-4)

User-defined model ID, specified as the comma-separated pair consisting of 'ModelID' and a string or character vector. The software uses the ModelID text to format outputs and is expected to be short.

Data Types: string | char

User-defined description for model, specified as the comma-separated pair consisting of 'Description' and a string or character vector.

Data Types: string | char

Predictor variables, specified as the comma-separated pair consisting of 'PredictorVars' and a string array or cell array of character vectors. PredictorVars indicates which columns in the data input contain the predictor information. By default, PredictorVars is set to all the columns in the data input except for ResponseVar.

Data Types: string | cell

Response variable, specified as the comma-separated pair consisting of 'ResponseVar' and a string or character vector. The response variable contains the LGD data and must be a numeric variable. An LGD value of 0 indicates no loss (full recovery), 1 indicates total loss (no recovery), and values between 0 and 1 indicate a partial loss. By default, ResponseVar is set to the last column.

Data Types: string | char

Censoring side, specified as the comma-separated pair consisting of 'CensoringSide' and a character vector or string. CensoringSide indicates whether the desired Tobit model is left-censored, right-censored, or censored on both sides.

Data Types: string | char

Left-censoring limit, specified as the comma-separated pair consisting of 'LeftLimit' and a scalar numeric between 0 and 1.

Data Types: double

Right-censoring limit, specified as the comma-separated pair consisting of 'RightLimit' and a scalar numeric between 0 and 1.

Data Types: double

Options for fitting, specified as the comma-separated pair consisting of 'SolverOptions' and an optimoptions object that is created using optimoptions from Optimization Toolbox™. The defaults for the optimoptions object are:

  • "Display""none"

  • "Algorithm""sqp"

  • "MaxFunctionEvaluations"500 ✕ Number of model coefficients

  • "MaxIterations" — The number of Tobit model coefficients is determined at run time, it depends on the number of predictors and the number of categories in the categorical predictors.

Note

When using optimoptions with a Tobit model, specify the SolverName as fmincon.

Data Types: object

Column name of the input table containing weights, specified as a string scalar.

Note

The default value ("") results in a weight of 1 for each row in data. All weight values in data must be nonnegative.

For an example using WeightsVar, see Create Weighted LGD Model.

Properties

expand all

User-defined model ID, returned as a string.

Data Types: string

User-defined description, returned as a string.

Data Types: string

This property is read-only.

Underlying statistical model, returned as a compact linear model object. The compact version of the underlying regression model is an instance of the classreg.regr.CompactLinearModel class. For more information, see fitlm and CompactLinearModel.

Data Types: CompactLinearModel

Predictor variables, returned as a string array.

Data Types: string

Response variable, returned as a string.

Data Types: string

This property is read-only.

Censoring side, returned as a string.

Data Types: string

This property is read-only.

Left-censoring limit, returned as a scalar numeric between 0 and 1.

Data Types: double

This property is read-only.

Right-censoring limit, returned as a scalar numeric between 0 and 1.

Data Types: double

Column name of the input table containing weights, returned as a string scalar. This property is also used to determine the weights variable for validation data when you use the modelDiscrimination or modelCalibration functions.

Object Functions

predictPredict loss given default
modelDiscriminationCompute AUROC and ROC data
modelDiscriminationPlotPlot ROC curve
modelCalibrationCompute R-square, RMSE, correlation, and sample mean error of predicted and observed LGDs
modelCalibrationPlotScatter plot of predicted and observed LGDs

Examples

collapse all

This example shows how to use fitLGDModel to create a Tobit model for loss given default (LGD).

Load LGD Data

Load the LGD data.

load LGDData.mat
head(data)
      LTV        Age         Type           LGD   
    _______    _______    ___________    _________

    0.89101    0.39716    residential     0.032659
    0.70176     2.0939    residential      0.43564
    0.72078     2.7948    residential    0.0064766
    0.37013      1.237    residential     0.007947
    0.36492     2.5818    residential            0
      0.796     1.5957    residential      0.14572
    0.60203     1.1599    residential     0.025688
    0.92005    0.50253    investment      0.063182
rng('default');
NumObs = height(data);
c = cvpartition(NumObs,'HoldOut',0.4);
TrainingInd = training(c);
TestInd = test(c);

Create Tobit LGD Model

Use fitLGDModel to create a Tobit model using the TrainingInd data.

lgdModel = fitLGDModel(data(TrainingInd,:),'Tobit',...
   'ModelID','Example Tobit',...
   'PredictorVars',{'LTV' 'Age' 'Type'},...
   'ResponseVar','LGD',...
   'CensoringSide','left',...
   'LeftLimit',1e-4);
disp(lgdModel)
  Tobit with properties:

      CensoringSide: "left"
          LeftLimit: 1.0000e-04
         RightLimit: 1
            Weights: [0x1 double]
            ModelID: "Example Tobit"
        Description: ""
    UnderlyingModel: [1x1 risk.internal.credit.TobitModel]
      PredictorVars: ["LTV"    "Age"    "Type"]
        ResponseVar: "LGD"
         WeightsVar: ""

Display the underlying model. The underlying model is a left-censored Tobit model. Use the 'CensoringSide' argument and the 'LeftLimit' and 'RightLimit' arguments to modify the underlying Tobit model.

disp(lgdModel.UnderlyingModel)
Tobit regression model, left-censored:
     LGD = max(0.0001,Y*)
     Y* ~ 1 + LTV + Age + Type

Estimated coefficients:
                       Estimate       SE         tStat       pValue  
                       ________    _________    _______    __________

    (Intercept)        0.057356     0.026618     2.1548       0.03129
    LTV                  0.2003     0.030651     6.5346     7.984e-11
    Age                -0.09405    0.0072992    -12.885             0
    Type_investment     0.10071     0.017917      5.621    2.1528e-08
    (Sigma)             0.28833    0.0055226     52.209             0

Number of observations: 2093
Number of left-censored observations: 547
Number of uncensored observations: 1546
Number of right-censored observations: 0
Log-likelihood: -638.353

Predict LGD

For Tobit models, use predict to calculate the predicted LGD value, which is the unconditional expected value of the response, given the predictor values.

predictedLGD = predict(lgdModel,data(TestInd,:))
predictedLGD = 1394×1

    0.0871
    0.1228
    0.3181
    0.0926
    0.1654
    0.2215
    0.2347
    0.0102
    0.1576
    0.1969
      ⋮

Validate LGD Model

Use modelDiscriminationPlot to plot the ROC curve.

modelDiscriminationPlot(lgdModel,data(TestInd,:))

Figure contains an axes object. The axes object with title ROC Example Tobit, AUROC = 0.67989, xlabel False Positive Rate, ylabel True Positive Rate contains an object of type line. This object represents Example Tobit.

Use modelCalibrationPlot to show a scatter plot of the predictions.

modelCalibrationPlot(lgdModel,data(TestInd,:))

Figure contains an axes object. The axes object with title Scatter Example Tobit, R-Squared: 0.0855, xlabel LGD Predicted, ylabel LGD Observed contains 2 objects of type scatter, line. These objects represent Data, Fit.

More About

expand all

References

[1] Baesens, Bart, Daniel Roesch, and Harald Scheule. Credit Risk Analytics: Measurement Techniques, Applications, and Examples in SAS. Wiley, 2016.

[2] Bellini, Tiziano. IFRS 9 and CECL Credit Risk Modelling and Validation: A Practical Guide with Examples Worked in R and SAS. San Diego, CA: Elsevier, 2019.

Version History

Introduced in R2021a

expand all