Using Zend Lucene to search Office 2003 or older files
I know there are alre开发者_运维知识库ady objects supporting Office 2007 files, but is there any native Office 2003 or earlier support ?
There doesn't seem to be anything bundled with Zend_Search_Lucene
, for those.
Still, considering it can index HTML documents, if you can find a way to convert your Office 2003 documents to HTML (at least, for indexing -- keeping to original version alonside the HTML one, for consultation), you might be able to index those...
I would recommend indexing the documents with Solr and Tika together and using JSON to search your Solr/Lucene index from PHP. See the ExtractingRequestHandler (Solr wiki page) article for more information.
精彩评论