PHP Based Web Crawler or JAVA Based Web Crawler [closed]
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 6 years ago.
Improve this questioni have some doubt about PHP based web crawlers,can it run like the java thread based one? i am asking it b开发者_如何学Cecause, in java the thread can be executed again and again, i dont think, PHP have something like thread function, can you guys please say, which web crawler will be more use full?A PHP Based or A Java Based
Instead of writing your own use one of the following. Btw, Java based web crawlers are preferred. My fav Nutch.
Java based: Nutch, Heritrix, JSpider, JoBo (simple crawler)
PHP based: PHPCrawl
In general, you will need to jump through more hoops to run long-running tasks in PHP, as it's much more of an request/response-based setup.
精彩评论