How to index my website [closed]
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 4 years ago.开发者_StackOverflow社区
Improve this questionI just come across Elastic Search, I have done installation and sample indexing and searchs. Now I want to use this for my website. For that how to create indexes for my website? do I need to store entire web pages content as index??
Thanks, Nehatha
ES doesn't come with a crawler built-in - You will need to write a crawler to index websites. There are plenty of F/OSS crawlers/spiders you can use - Nutch being one of them.
Here is a thread that might be of interest: http://elasticsearch-users.115913.n3.nabble.com/Suggestions-for-places-to-start-for-a-crawler-td900225.html
If your needs are lightweight, you can easily hack together a crawler in your fave language within a few hours.
精彩评论