how to convert PDF to XML
I am facing problem finding any solution for converting my PDF to HTMl/xml.This requirement comes because I have a certificate as a P开发者_Go百科DF form.I need to make this certificate available to the user in the form of a HTML page(specifically using JSF). User can edit this form and can update the values.These values will be read by backing bean so that I can be saved in the database.Problem is these certificates kept on changing after 3 months or so. So if I go by trivial approach of writing all the components in the HTML, I will be kind of hell to maintain and to change. So I want that if there is any JAVA utility which I can tweak so that I can render the page using XML.
Looking forward for the answer. thanks in advance!!!
You could give this a try: pdftohtml. It looks pretty old, latest release is from 2006.
If you need just the raw text: consider using itext, parse the raw text and fill it into a xml document.
精彩评论