Python separating images and text from MS office files
Is there a way to separate the text and images from MS Office files like word, excel, ppt etc. and getting the position of the image in a document( where the image starts in the document between text)?
The application needs to be developed for Linux bo开发者_如何学编程x.
Please suggest.
You may want to look at the Python UNO bindings for OpenOffice - wiki at http://wiki.services.openoffice.org/wiki/Python - this should let you open and work with MSOffice docs on linux.
What exactly are you trying to accomplish - a new way to HTML-ize Office docs?
精彩评论