webread not yielding actual website

2 次查看（过去 30 天）

Jakob Sievers 2022-7-13

0
链接

此问题的直接链接

https://ww2.mathworks.cn/matlabcentral/answers/1759285-webread-not-yielding-actual-website

评论： Jakob Sievers 2022-7-14

Hi there. I'm trying to learn how to extract information from websites. As an example, i'm trying to extract text from Facebook posts but webread gives me something which appears to be quite different from what I'm actually seeing on the website. I'm a complete Noob at this particular type of task and so I was hoping I could get some pointers concerning how to get the text, as I see it, rather than some obscured version. Thanks in advance!

3 个评论
显示 1更早的评论隐藏 1更早的评论

DGM 2022-7-14

编辑：DGM 2022-7-14

Considering the source, I'm going to guess it's dynamic content.

https://www.mathworks.com/matlabcentral/answers/1750720-webread-not-returning-full-html-contents

Without knowing what page and what content exactly is being targeted, it's hard to be sure.

Jakob Sievers 2022-7-14

@DGM: reading through the references in the thread you're referring to, I think it may be the exact problem that the stuff you're seeing on sites like Facebook is created not by basic HTML but by tons of scripts and such, which webread then is not able to extract.

Is there no way to dig deeper than webread, using matlab? I'd really like to stay on the Matlab platform, which I'm most familiar with, before considering other alternatives

请先登录，再进行评论。

请先登录，再回答此问题。

回答（0 个）

请先登录，再回答此问题。

类别

MATLAB Environment and Settings

在 Help Center 和 File Exchange 中查找有关 Environment and Settings 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

webread not yielding actual website

3 个评论
显示 1更早的评论隐藏 1更早的评论

回答（0 个）

另请参阅

类别

标签

Community Treasure Hunt

webread not yielding actual website

3 个评论 显示 1更早的评论隐藏 1更早的评论

回答（0 个）

另请参阅

类别

标签

Community Treasure Hunt

3 个评论
显示 1更早的评论隐藏 1更早的评论