How to write a file indexing software in java?
I want to write a software to search my hard disk for pdf files and index them in a library by analysing the content .
P开发者_开发问答lease help me how to achieve it using Java platform.
You should look at Lucene, it is THE indexing and searching framework in Java.
For indexing PDF documents, you can use PDFBox that integrates nicely with Lucene.
May be a simpler solution for PDF and other document files take a look at Tika.
This open source project might be a start for you. It uses Apache Lucene and PDFBox:
- https://github.com/WolfgangFahl/pdfindexer
精彩评论