how does fitglm treat categorical variables?

13 次查看(过去 30 天)
Dear all,
I'd like to verify something about categorical variables in fitglm. As input I use a table which also contains categorical variables (sex (0 vs 1), and education (on a scale 1 to 7). If I understand the documentation correctly, fitglm automatically treats these as categorical, and it also automatically dummy codes when necessary? (for education in this case). Is this correct?
Thanks very much.
Best
Hans van der Horn

采纳的回答

Jeff Miller
Jeff Miller 2022-3-6
I don't that is correct. WIth numerical values in the table, I think you have to mark the variables as categorical with something like:
tbl.sex = categorical(tbl.sex);
tbl.education = categorical(tbl.education);
Where there are three or more categories (e.g., education) you can check the df's to make sure that the variable has been treated as categorical (6 dfs) rather than numerical (1 df, essentially a regression slope across 1-7).
  2 个评论
Hans van der Horn
Dear Jeff,
Thanks for your answer. I tried as you suggested and indeed it works better, with now dummy variables defined in the model.
Best Hans
Jeff Miller
Jeff Miller 2022-3-7
Hi Hans,
That sounds good. If this answer solves the problem that you were having, then please accept it using the "Accept" button (so that the question no longer appears to be open).
Jeff

请先登录,再进行评论。

更多回答(0 个)

类别

Help CenterFile Exchange 中查找有关 Dimensionality Reduction and Feature Extraction 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by