开发者

java html parser doesnt read all page

I'm parsing html pages to get specific information, but there are some pages that I cant get all the information displayed on the web page, for example in this page

I cant get the reviews information. By the way, if you see the source code of the page th开发者_如何学Cere are very much empty lines, and the reviews information dont appear.

Do you know why? Some library to read this type of pages?

Thanks


I'm willing to bet they are using some sort of javascript to load in the review information. In order to access that information, you are going to need to somehow either mimic the request or evaluate the javascript and then parse the resulting page. I would suggest examining their javascript and mimicking the request they use to download the review information as that will be much easier than attempting to evaluate the javascript in your code.

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜