开发者

Get plain text from PDF file

I know that there was a few questions about this topic.

I need an framework / script / solution to get plain text from PDF file by php.

Does exist a solution how to get plain text from PDF file?

(And yes, I searched for t开发者_运维问答his hours. This is the last chance).


On linux system, simply use pdftotext (with shell_exec)

echo `pdftotext in.pdf out.txt`;

PS: Take a look on this article, I don't know if it's useful or not.


I believe you might work something out by using the google docs API which handles OCR.

It converts an image to plaintext so you first have to convert the pdf to image with GhostScript and ImageMagick or something else

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜