开发者

Web page monitoring question

There are a number of different websites that let yo开发者_JS百科u monitor specifi web pages for any changes, such as watchthatpage.com or page2rss.com

I'm interested in the way how those sites are working, meaning how do they determine whether some web page is updated. Do they just copy all the text from the page, store it in memory and compare it later to the content of a site's page? Or maybe they look for some specific html elements and compare theirs values?

Please help me to find the answer.


How it works: http://www.watchthatpage.com/information.jsp


I suspect that they store the entire contents, and every time they check, they compare. If different, send alert, otherwise don't.


There's two ways this can be done just off the top of my head.

The first is to pull the HTML and do a simple string.compare.

The second way, would be to do a HEAD request See, section 9.4 here

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜