Get Started with Text Analytics Toolbox
Text Analytics Toolbox provides algorithms and visualizations for preprocessing, analyzing, and modeling text data. Models created with the toolbox can be used in applications such as sentiment analysis, predictive maintenance, and topic modeling.
Table of Contents
| Function Name | Description |
|---|---|
wordcloud |
Create word cloud chart from bag-of-words or LDA model |
wordCloudCounts |
Count words for word cloud creation |
textscatter |
2-D scatter plot of text |
textscatter3 |
3-D scatter plot of text |
heatmap |
Create heatmap chart |
histcounts |
Histogram bin counts |
discretize |
Group data into bins or categories |
| Function Name | Description |
|---|---|
Read word embedding from text file |
|
Train word embedding |
|
Maps words to embedding vectors |
|
Latent Dirichlet allocation (LDA) model |
|
Latent semantic analysis (LSA) model |
|
Bag-of-words model |
|
Fit latent Dirichlet allocation (LDA) model |
|
Fit a latent semantic analysis (LSA) model |
|
Predict top LDA topics of documents |
|
Fit probability distribution object to data |
|
Fit linear regression model to high-dimensional data |
|
Fit linear classification model to high-dimensional data |
|
Fit multiclass models for classifiers |
| Function Name | Description |
|---|---|
Read from PDF, Microsoft Word, and plain text |
|
Read formatted data from text file or string |
|
Create table from file |
|
Convert data into formatted string array |
|
Read Microsoft Excel spreadsheet file |
|
Read content from RESTful web service |
|
Datastore for tabular text files |
|
Datastore with custom file reader |
|
Datastore for spreadsheet files |
| Function Name | Description |
|---|---|
Split documents into collections of words |
|
Remove inflections from words using the Porter stemmer |
|
Bag-of-words model |
|
Stop word list |
|
Search documents for word occurrences in context |
|
Remove selected words from document or bag-of-words |
|
Remove long words from documents or bag-of-words |
|
Remove short words from documents or bag-of-words |
|
Remove words with low counts from bag-of-words model |
|
Erase punctuation from text and documents |
| Function Name | Description |
|---|---|
Declare a string variable |
|
Declare a string array |
|
Convert a character vector C to a string |
|
Convert a string to double numbers |
|
Return the length of strings |
|
Determine if input is string array |
|
Combine strings |
|
Split strings in string array |
|
Split string at newline characters |
|
Find and replace substrings in string array |
|
Determine if pattern is in string |
|
Delete substrings within strings |
|
Extract substrings between indicators |
|
Extract substring after specified position |
|
Extract substring before specified position |
|
Compare strings |
|
Match regular expression (case sensitive) |