How to design a unit test for generating a PDF document?

2023-01-22 08:31 问答作者：

I'm late to the party with regards to using unit testing... trying to figure best practices and the such. My question is, given a class which is responsible for generating a PDF (or Doc/Html/Xml/etc.), how would I go about testing the final ou开发者_C百科tput file is correct? I figure a text based file (xml), I could just see if the strings match, but what about a binary file (pdf)? Should I just check against a MD5 hash? Should I even be testing this?

Thanks in advance.

I use pdfbox to extract text from generated PDF and check if it cointains the data it should. this doesnt check if data is in the correct place, but I dont go that deep with pdf testing. You need think how deep you want to go, the deeper you go the more time you will spend fixing the tests after a change(i never had a bug that text was in the wrong place and maybe thats why i dont test for it).

Another way would be to use the same PDF library (you use to write it) to read it or use someting like iText if you generate PDF from template using some framework.

For mission-critical PDFs (e.g. those sent out to a customer), I don't think checking the text is enough. You'd want to check layout, font-sizes, text-wrapping, etc. For the same reasons that we use Selenium to check web pages.

I took the approach of turning the PDF into an image, and comparing that image against a known "correct" image. Our PDFs didn't change very often, and didn't contain anything that changed over time (e.g. "today's" date). So this approach worked well - using the same input data, we could always generate the same output PDF.

I think PDFUnit now has built-in support for doing this, plus a lot more: http://www.pdfunit.com/en/documentation/java/testscope/rendered-pages.html

继续阅读：language-agnostic pdf unit-testing

How to design a unit test for generating a PDF document?

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？