开发者

Microsoft Powerpoint Python Parser [closed]

As it currently stands, this question is not a good fit for our Q&A format. We expect answers to be supported by facts, references, or expertise, but this question will likely solicit开发者_开发问答 debate, arguments, polling, or extended discussion. If you feel that this question can be improved and possibly reopened, visit the help center for guidance. Closed 9 years ago.

I am looking for a python based microsoft office parser - specifically powerpoint.

I want to be able to parse PPT in python and extract things like text and images from the powerpoint file.

Is there a library available?


I don't think there is such a library.

What you can do is use pywin32 package to access PowerPoint's COM.

Here is a very nice introduction to using the win32com module to automate tasks in PowerPoint someone has written: http://www.s-anand.net/blog/automating-powerpoint-with-python/


You might find such a beast, but I'd bet against it; you're looking for two rare properties together.

You might consider instead using the Open Office SDK, which already has vast amounts of machinery to read power point files, and abuse it for your purposes. This is all Java, not Python, but my guess is the learning curve to learn Java is much smaller than the learning curve to figure out how to read PowerPoint files.

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜