You could have your users upload  MSWord documents and do the >html
conversion for them on the server using something like wvware.

-----Original Message-----
From: Philip M. Gollucci [mailto:[EMAIL PROTECTED]]
Sent: Wednesday, January 23, 2002 10:23 AM
To: [EMAIL PROTECTED]; [EMAIL PROTECTED]; [EMAIL PROTECTED];
[EMAIL PROTECTED]; [EMAIL PROTECTED]
Subject: MS+HTML -> Unix


Say I have a webpage where I want to offer people the ability to upload
either a .txt or a .html file.  Now these people basically are computer
illierate, and don't even konw that UNIX is different from Microsh$t.

At anyrate, they will use "Save as (HTML) from MSWord 97/2000, "Save as
(txt)", or worse yet, "Save as RTF".
Then upload that.

Big surprise it gets it really wrong basically meaning it doesn't format
correctly before or after they use the site in any Browser.
One file, tidy told me had over 300 errors and that was just with HTML4.01
not XHTML1.0.

Is there anyway I can on the fly take the messed up HTML file I get and
covert it to what they meant to give me.

Important cases :
  Parrell Columns not in a table
  Bullets
  <DIR> tags
  actually closing <u> tags so the whole page isn't underlined.

I've see the demoronizer port, but don't know that much about it, and I
don't think its quite what I want.

Basically I have to take html given me and make the html they mean.


Any Great Ideas


END
----------------------------------------------------------------------------
--
Philip M. Gollucci (p6m7g8) [EMAIL PROTECTED] 301.314.3118

Science, Discovery, & the Universe (UMCP)
        Webmaster & Webship Teacher
        URL: http://www.sdu.umd.edu

EJPress.com
        Database/PERL Programmer & System Admin
        URL : http://www.ejournalpress.com

Resume      : http://www.p6m7g8.com/resume.txt



Reply via email to