How is root node value chosen in regression decision tree?

Question

Christiana Sasser 2020-9-8

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/590758-how-is-root-node-value-chosen-in-regression-decision-tree

回答： Ayush Aniket 2025-6-4

I understand the criteria for node splitting and how the root node variable is chosen but I do not understand how the actual value for the inequality at the root node is chosen. Is it just local optimization of the numbers? For example, I have a variety of whole number values ranging from 3 to 25 and the root node is chosing 9.5. This is not the median or mean, so why is this number chosen? Is it because the decision tree analyzed all potential values to see what had the lowest MSE to start with? If so, why did it chose a decimal number when all my data points are whole numbers?

Thank you for your help!

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Ayush Aniket 2025-6-4

0
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/590758-how-is-root-node-value-chosen-in-regression-decision-tree#answer_1565883

在 MATLAB Online 中打开

The split value at the root node in a decision tree is chosen based on optimization criteria, not necessarily the median or mean. Decision trees aim to minimize impurity (for classification) or reduce variance/MSE (for regression).The algorithm evaluates all possible split points and selects the one that maximizes information gain or minimizes error.

Why a Decimal Value Instead of Whole Numbers?

Even if your dataset contains only whole numbers, the tree considers midpoints between consecutive values as potential split points.
For example, if your sorted values are {3, 5, 7, 9, 11, 13, ...}, the tree might evaluate splits at {4, 6, 8, 10, 12, ...}.
The split at 9.5 means the algorithm found that separating values below 9.5 from those above 9.5 resulted in the best reduction in impurity or error.

In MATLAB, you can visualize the tree using:

view(SVModelTree, 'Mode', 'graph');

Refer the following documentation to learn more about the viewing options: https://www.mathworks.com/help/stats/view-decision-tree.html

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

How is root node value chosen in regression decision tree?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

How is root node value chosen in regression decision tree?

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

回答（1 个）

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

另请参阅

类别

标签

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论