开发者

pdf text extracter class in php

Is there available any class in php that extract all text from pdf file so i can store it in mysql database. My pdf has many elements like images, tables,plain text,form elements,charts etc.

So far i saw many classes for last two days, that extract texts, but no开发者_StackOverflow one facilitate with complete text extraction, Not extracting complete text from pdf.

I want to extract all text from given pdf file, even if the text is in table etc.

Any one know about this ? :)

Thanks a lot. Have a nice day :)


Find the below url,

Reading the clean text from PDF with PHP


If you are running this on a linux server, you could try using apdf2text calling it via exec then grabing the contents of the output file.

Note that a few pdf to text scripts are around and you'll get different mileage from all.


I've tested many command line program, but none has 100% result. So I've started my own library in PHP :

https://github.com/smalot/pdfparser

Currently it's text oriented, but image support will be planned.

If you encountered issues, thanks for sending me your PDF and if possible, the way you made it .

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜