Importance of predictors within an Optimizable GPR model

Question

lauzof 2023-9-6

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2017351-importance-of-predictors-within-an-optimizable-gpr-model

评论： lauzof 2023-9-20

Hello everyone,

related with a previous topic I posted (https://it.mathworks.com/matlabcentral/answers/2011417-regression-learner-app-relative-weights-of-variables#answer_1296851?s_tid=prof_contriblnk), I'd like to know if is it possible to find the predictor importance from my model at a global level ? something as shown in the first plot of here https://it.mathworks.com/help/stats/lime.plot.html but not for an individual queryPoint but the whole model

thanks again!

best,

Laura

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Ive J 2023-9-11

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/2017351-importance-of-predictors-within-an-optimizable-gpr-model#answer_1307261

在 MATLAB Online 中打开

It's almost always good to look at both local and global levels. In case of lime or SHAP, you can calculate on a random subset of your training dataset, and take the average SHAP (this can be quite computationally heavy for GPR though).

numSample = 1000; % for 1000 randomly selected observations
shap_idx = randsample(1:height(x_train), numSample);
shap_train = x_train(shap_idx, 1:end-1); % x_train is a table, and target variable is the last col
explainer = shapley(yourGPRmodel, shap_train, UseParallel=true); % check other arguments, e.g. if categorical features are there
shap = zeros(size(shap_train, 1), size(shap_train, 2));
        
for k = 1:size(shap_train, 1)
    explainer = fit(explainer, shap_train(k, :), UseParallel=true);
    shap(k, :) = explainer.ShapleyValues.ShapleyValue;
end
shap_mean_abs = mean(abs(shap), 1)';