How to Separte table data from html?

1 次查看(过去 30 天)
i am doing web reading using Http POST method with urlread function. how to separate html Table data from html?(str)
ur='https://www.w3schools.com/html/html_tables.asp';
str=urlread(ur)

采纳的回答

Paolo
Paolo 2018-6-6
编辑:Paolo 2018-6-6
You should be using webread rather than urlread as stated in the documentation. You can use regexp to extract HTML elements from a HTML document. There are examples on how to do so on the regexp documentation page.
Use the following code to extract < table > elements.
ur = 'https://www.w3schools.com/html/html_tables.asp';
str = webread(ur);
%Using urlread instead will give the same output.
%str = urlread(ur);
expression = '<(table).*?</\1>';
matches = regexp(str,expression,'match');
matches contains the two < table > elements in the HTML.

更多回答(0 个)

类别

Help CenterFile Exchange 中查找有关 String Parsing 的更多信息

产品


版本

R2017a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by