normalizeWords
Stem or lemmatize words
Syntax
Description
Use normalizeWords
to reduce words to a root form. To
lemmatize English words (reduce them to their dictionary
forms), set the 'Style'
option to
'lemma'
.
The function supports English, Japanese, German, and Korean text.
reduces the words in updatedDocuments
= normalizeWords(documents
)documents
to a root form. For English
and German text, the function, by default, stems the words using the Porter
stemmer for English and German text respectively. For Japanese and Korean text,
the function, by default, lemmatizes the words using the MeCab tokenizer.
reduces each word in the string array updatedWords
= normalizeWords(words
)words
to a root
form.
reduces the words and also specifies the word language.updatedWords
= normalizeWords(words
,'Language',language
)