El 27/02/2005 a las 14:22, roberto_andrich <[EMAIL PROTECTED]>
dijo, en su mensaje "Problems converting htm file":

> 1)Do you know if there is problems when converting *.htm files, created
> using Microsoft Word 2000, to plucker format? 
> 2)How can I work around this problem?

The problem, with all probability, is due to the lousy conversion
system included in Word. It includes all sort of non-HTML constructs
on which Plucker croaks invariably.

The possible solutions involve in all cases additional tools:

* Download Tidy <http://tidy.sourceforge.net/> and use it to process
  every document and convert it in a standard-compliant version. Tidy
  even has a special setting to deal with the cruft included in files
  converted by Word 2000.

* Download and install Mozilla and open and resave every file with the
  Mozilla Composer application (which should at least convert them to a
  more standard compliant version)

* Download and install OpenOffice and use a Python script running
  PyUNO to reconvert automatically all documents (I adapted
  successfully a script used for other means and I happily
  batch-convert doc and rtf files to HTML to be able to "pluck" them
  afterwards). I have a script I can offer to you to this purpose;
  OpenOffice includes a Python interpreter which you use to run this
  script.

The easiest, of course, seems to be the first one. For the long term,
however, I would recommend you to consider the last option.
  
-- 
                    o-=< Marcelo >=-o

elegante. Elefante de fiesta.
  --Del "Bichonario" (Gim�nez/Wright)

_______________________________________________
plucker-list mailing list
[email protected]
http://lists.rubberchicken.org/mailman/listinfo/plucker-list

Reply via email to