Reading data from a website

7 次查看(过去 30 天)
Salma fathi
Salma fathi 2022-10-19
编辑: Salma fathi 2022-10-24
Hello, Ia m trying to read some data from the follwing website, I tried to use webread to acheive this but it would give me an array of charcters with size 1x1534400 char where we would like our data to be read into a table. I used the following lines
url="https://lgdc.uml.edu/common/DIDBGetValues?ursiCode=JI91J&charName=foF2,foF1,foE,foEs,hmF2,hmF1,hmE&DMUF=3000&fromDate=1997%2F01%2F01+00%3A00%3A00&toDate=1997%2F12%2F31+11%3A59%3A00";
%options = weboptions("ContentType", "text");
data = webread(url);
the website looks like the image belwo and we would like to ignore the first text lines and start reading from the heading of the table
if anyone can help with this, thanks in advance

回答(1 个)

Walter Roberson
Walter Roberson 2022-10-19
After you read the characters you can pass them to textscan as the first parameter. You should see either pass Headerlines or CommentStyle to skip the header. Use a %{}T format to describe the datetime and %s for the text fields.
  1 个评论
Salma fathi
Salma fathi 2022-10-24
编辑:Salma fathi 2022-10-24
Thank you for the help, I tried what you suggested and it worked fine except that I am facing one issue:
  1. I am able to read only to the third column, I think the issue is that forth column has the characters "/_", so I tried to ignore these columns using the '*', but I would still get the following fields to that as empty cells. Attached is an image to the cell array I get.
This is the line that I am using
C = textscan(data, '%{uuuu-MM-dd''T''HH:mm:ss.SSS''Z}D%f%f%*s%f%*s%f%*s%f%*s%f%*s%f%*s%f%*s' , 'whitespace', ' ', 'CommentStyle' , '#');
Any advice?

请先登录,再进行评论。

类别

Help CenterFile Exchange 中查找有关 String Parsing 的更多信息

产品


版本

R2021b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by