开发者

Reading more of an RSS Feed with Java

I'm writing a little Java application that I want to use to se开发者_Go百科arch RSS feeds for news I care about (I don't really care if such an app exists, I'm writing it mostly for learning purposes). I'm currently using the information I found here to make the reader. The code is great and all, but I've run into a problem. The RSS feeds, like for example here: only go for the last few entries, however RSS feeds like Google Reader show the entire history of the RSS feed as far back as I want. How would I go about reading the RSS feed through to more entries?


You can only get the data provided.

Google have the history because they have been polling the feed for a long period of time and kept all the articles that have ever appeared in it.


RSS feeds are just another way of formatting content in an agreed upon standard. Each time your application requests the content from the RSS feed URL, a web server processes that requests and returns the content (presumably formatted to some RSS XML standard).

If you want to make sure you don't miss any items, then you will have to poll the RSS feed regularly and keep your own history.


You could go totally nuts since you mentioned learning purposes and use Nutch to crawl and index the RSS content you are looking for, it's actually pretty easy to setup if you run in a single node, not using Hadoop's cluster. You can also use Droid to spider the content.

Or just a simple RSS parser http://www.vogella.de/articles/RSSFeed/article.html#overview_java :)

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜