I am using ExtractingRequestHandler in Solr for getting document content and index it. It works fine for all Microsoft Documents, but for PDFs, the content being extracted is empty. I have also tried
I know there are alre开发者_运维知识库ady objects supporting Office 2007 files, but is there any native Office 2003 or earlier support ?There doesn\'t seem to be anything bundled with Zend_Search_Luce