HTML file scraping for Fields in a Table
2 次查看(过去 30 天)
显示 更早的评论
The HTML file that I am working on, is a long one and contains particulars as given in the attached text file. Although the structure is simple and repetitive, due to the large number of characters in between the data fields, I am having hard time in scraping the required data. The objective is to get a two-column excel spreadsheet containing Name in the first column and Email in the second column. How to obtain these required fields in the xlsx file ? Thanks.
0 个评论
回答(1 个)
Sean de Wolski
2020-6-15
Start playing with htmlTree in the Text analytics toolbox.
t = htmlTree(fileread('clientdata.txt'))
t.findElement('TD').extractHTMLText
另请参阅
类别
在 Help Center 和 File Exchange 中查找有关 Text Data Preparation 的更多信息
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!