Replacing images in PDF documents with Python?

2023-03-01 03:54 问答作者：

We generate PDF documents with RGB images stored in a CMS.

As part of the PDF processing we sometimes have the need to conver开发者_StackOverflow社区t the RGB images to CMYK (for print productions).

Converting the images from RGB to CMYK seems to be feasible with Python using LittleCMS and the PyLittleCMS bindings (plus the ICC color profiles for the RGB input and CMYK output device).

However is there some Python-based option to iterate over the images inside a PDF, extracting the image data and replacing them with the processed CMYK variants?

I don't think there's any free Python tools that do exactly what you want. Here are some options:

PoDoFo doesn't have mature Python bindings but can read and write PDFs, has support for PDF images and color spaces.

PDFMiner is a pure-Python PDF parser but it doesn't do much with images. This is a start, but would probably take quite a bit of work to do what you want.

The commercial version of ReportLab may be able to do what you want with PageCatcher; I haven't used it in a few years but you might investigate it. (The free ReportLab only writes PDFs, it doesn't read them.)

继续阅读：cmyk pdf python rgb

Replacing images in PDF documents with Python?

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

Easiest way to get words of one line from istream into a vector?

Infinite gtk warnings when I right click on the icon

Best solution for private video database [closed]

国内夏季避暑旅游胜地有哪些？