Probit: removing groups that perfectly predict failures

Question

Tian 2021-7-29

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/888680-probit-removing-groups-that-perfectly-predict-failures

回答： Kumar Pallav 2021-8-4

采纳的回答： Kumar Pallav

data.mat

在 MATLAB Online 中打开

Hi all,

I have a group-year panel data as attahced. Apologies the data is very low quality.

There are 3 groups, each has 20 observations. Outcome y is a dummy variable for success. The first column in x is a continuous variable "effort". The second column is a dummy indicates group A. The third column is a dummy for group B. There is no dummy for group C to avoid collinearity.

I want to predict the probability of success using the probit model. The code I try is:

b = glmfit(x,y,'binomial','Link','probit');
b =
    0.1857 (constant)
   -1.8149 (effort)   
  -16.1148 (group A)
  -16.2994 (group B)

As you can see in the data, all outcomes for group A are failures. So the second column in x predicts y == 0 perfectly. Matlab also raises a warning:

  Warning: The estimated coefficients perfectly separate failures from successes. This means the theoretical best estimates are not finite.
For the fitted linear combination XB of the predictors, the sample proportions P of Y=N in the data satisfy:
   XB<-0.834093: P=0
   XB=-0.834093: P=1
   XB>-0.834093: P=0 

However, it still returns an estimated coefficient for group A dummy, which is b(3) = -16.1148.

Question:

Since x(:,2) perfectly predict failures, b(3) should be 0. Is there an option in glmfit to remove observations for group A within glmfit function, then return the coefficient as 0 for this column? So I can get something like:

   b =
    0.1857 (constant)
   -1.8149 (effort)   
     0     (group A)
    xxx    (group B)

Stata does this automatically using the command:

probit y effort i.group

It turns out the estiamtes for the constant and effort are the same. So the perfect failure issue only affects the group dummies coefficients...

Thank you!!!

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Kumar Pallav 2021-8-4

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/888680-probit-removing-groups-that-perfectly-predict-failures#answer_760702

在 MATLAB Online 中打开

From my understanding ,for the coefficient vector b, you expect the b(3)=0 as you mentioned that the second column of x (group A dummies) are failures(that is 0). But , after inspecting the data, I see that the second column of x are not all zeros.

%check if any non-zero value in the vector
containsNonZero = any(x(:,2)) %returns 1 if true

However, if you change the values of second column of x to zero

%change second column values of x to zero
x(:,2)=0;
b = glmfit(x,y,'binomial','Link','probit')

Then, the b(3) value becomes 0.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

Probit: removing groups that perfectly predict failures

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

更多回答（0 个）

另请参阅

类别

标签

Community Treasure Hunt

Probit: removing groups that perfectly predict failures

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

更多回答（0 个）

另请参阅

类别

标签

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论