> Is there some reason you really want to convert to PDF first? You can
> get much better HTML right from the Word doc. You'll lose a lot of info
> going from PDF to HTML.
Right now, two reasons: Printing to PDF allows me to create the PDF "for
the web" which means it has a much smaller filesize n
google won't do a good job with .doc files but they may do pdf to html
and back.. It's per each I just mentioned it to make fun of them here
is my resume converted from a monster.com .doc file
http://docs.google.com/View?docid=dftrj73t_3cfwjdv
[EMAIL PROTECTED] wrote:
> Alexander Klingenstein w
Alexander Klingenstein wrote:
> I need to take a bunch of .doc files (word 2000) which have a little text
> including some tables/layout and mostly pictures and comvert them to a pdf
> and extract the text and images > separately too. If I have a pdf, I can do
> create the html with pdftohtml ca
Alexander Klingenstein wrote:
> I need to take a bunch of .doc files (word 2000) which have a little text
> including some tables/layout and mostly pictures and comvert them to a pdf
> and extract the text and images separately too. If I have a pdf, I can do
> create the html with pdftohtml call
I need to take a bunch of .doc files (word 2000) which have a little text
including some tables/layout and mostly pictures and comvert them to a pdf and
extract the text and images separately too. If I have a pdf, I can do create
the html with pdftohtml called from python with popen. However I n