sonnetsCounts.mat file

8 次查看(过去 30 天)
Does anyone know how the sonnetsCounts.mat file was created on the following MATLAB page: https://uk.mathworks.com/help/textanalytics/ref/ldamodel.predict.html
Predict Top LDA Topics of Word Count Matrix
Load the example data. sonnetsCounts.mat contains a matrix of word counts and a corresponding vocabulary of preprocessed versions of Shakespeare's sonnets.
load sonnetsCounts.mat
size(counts)
ans = 1×2
154 3092
When I open the sonnetsCounts.mat file, it has the following data
val =
(1,1) 1
(106,1) 1
(131,1) 2
(154,1) 1
(1,2) 1
(143,2) 1
I presume the second column in the frequency of words. But I'm not sure if the vector in the first column represents two words?
Peter

采纳的回答

Walter Roberson
Walter Roberson 2018-12-24
编辑:Walter Roberson 2018-12-24
The counts is a sparse matrix.
(143,2) 1
means that sonnet #143, unique word #2, had a count of 1.
  4 个评论
Peter Mayhew
Peter Mayhew 2018-12-26
编辑:Peter Mayhew 2018-12-26
OK, so if I understand correctly. I would perform the following command
bag = bagOfWords(documents);
Then check the counts property of variable bag.
Walter Roberson
Walter Roberson 2018-12-26
Counts with a capital C, but Yes.

请先登录,再进行评论。

更多回答(0 个)

类别

Help CenterFile Exchange 中查找有关 Statistics and Machine Learning Toolbox 的更多信息

标签

产品


版本

R2018b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by