What formula is used to calculate perplexity in fitlda?

Question

Stephen Bruestle 2019-1-22

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/440960-what-formula-is-used-to-calculate-perplexity-in-fitlda

回答： Ilya 2019-3-13

采纳的回答： Ilya

Many sources have different formulas. I want to make sure that I am referencing the correct formula.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

请先登录，再回答此问题。

Answer 1

Ilya 2019-3-13

1
链接

此回答的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/440960-what-formula-is-used-to-calculate-perplexity-in-fitlda#answer_365172

If you are asking about the 2nd output from the logp method, document log-probabilities are estimated using the Mean-Field Approximation described in the paper cited at the bottom of that doc page. Perplexity is then

exp(-sum(logprob)/Nwords)

where Nwords is the total word count across all documents.

If you are asking about perplexity displayed during training when you pass 'Verbose' to fitlda, those document log-probabilities are computed using current estimates of topic probabilities per document. The perplexity formula is the same as above. Because document log-probabilities are evaluated at the max likelihood estimates of topic probabilities per document, these document probabilities are overestimated and perplexity is therefore underestimated. This is done for speed. The MFA approach gives a more accurate estimate by integtrating over topic probabilities at the cost of longer runtime.

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

请先登录，再进行评论。

What formula is used to calculate perplexity in fitlda?

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

更多回答（0 个）

另请参阅

类别

标签

产品

Community Treasure Hunt

What formula is used to calculate perplexity in fitlda?

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

采纳的回答

0 个评论 显示 -2更早的评论隐藏 -2更早的评论

更多回答（0 个）

另请参阅

类别

标签

产品

Community Treasure Hunt

0 个评论
显示 -2更早的评论隐藏 -2更早的评论

0 个评论
显示 -2更早的评论隐藏 -2更早的评论