开发者

Website downloader library

I need to put a little project together for myself, and I need some functionality to download a page for offline viewing. Is there a library that will download a given page and its embedded images, and edit the img tags to reflect the local locations of the images.

I know there are a lot of website downloaders out t开发者_运维知识库here, but I cant find something that i can use directly in my code.

I have some basic scripts done in python, so Python is very welcome. but pretty much any language will do.


Yes, BeautifulSoup + python urllib module


You're looking for BeautifulSoup.


How about python web crawler? http://code.google.com/p/pywebcrawler/

OR, Anemone (ruby)? http://anemone.rubyforge.org/


simplest solution I can think of.

wget -p example.com
0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜