Extract Data from .PDF files [duplicate]

2023-02-08 20:25 问答作者：

This question already has answers here: 开发者_运维技巧 Extract Data from .PDF files (4 answers) Closed 8 years ago.

I need to extract data from .PDF files and load it in to SQL 2008. Can any one tell me how to proceed??

You will need to use a PDF library such as iTextSharp to extract the data from the PDF.

At this point, you have the data and can insert it into a database.

Text extraction works good with iText until you don't have a requirement to extract text from columns instead of rows (like Adobe Reader and Foxit Reader do when you copy the text from a PDF document. To extract text column by column the tool need to calculate a position and coordinates for text on a page

The commercial tool ByteScout PDF Extractor SDK capable of doing such text extraction with both row by row and column by column modes for text extraction (or can simply extract data as the structured XML)

DISCLAIMER: I work for ByteScout currently

Extract Data from .PDF files [duplicate]

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

王昌瑞《潜梦追凶》剧组庆生新锐演员未来可期？

Is it allowed to ask users to enter credit card details for own payment method?

Escaping "<" in Perl-generated XML

imessage会显示已读吗？

微信重新建群怎么建？

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

王昌瑞《潜梦追凶》剧组庆生 新锐演员未来可期？

Is it allowed to ask users to enter credit card details for own payment method?

Escaping "<" in Perl-generated XML

imessage会显示已读吗？

微信重新建群怎么建？

王昌瑞《潜梦追凶》剧组庆生新锐演员未来可期？