开发者

How to write a file indexing software in java?

I want to write a software to search my hard disk for pdf files and index them in a library by analysing the content .

P开发者_开发问答lease help me how to achieve it using Java platform.


You should look at Lucene, it is THE indexing and searching framework in Java.

For indexing PDF documents, you can use PDFBox that integrates nicely with Lucene.


May be a simpler solution for PDF and other document files take a look at Tika.


This open source project might be a start for you. It uses Apache Lucene and PDFBox:

  • https://github.com/WolfgangFahl/pdfindexer
0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜