Re: [CODE4LIB] tidy

2006-06-06 Thread Eric Lease Morgan
On Jun 6, 2006, at 4:40 PM, Chris Gray wrote: The Sourceforge page for Tidy has links to the bindings for various languages (Perl included) and also note that there is a link to a separate sourceforge project for an Apache mod for tidy. The mod_tidy might be cool

Re: [CODE4LIB] tidy

2006-06-06 Thread Eric Lease Morgan
On Jun 6, 2006, at 4:35 PM, Thomas Dowling wrote: - It looks like something's in the works, but not yet a going concern. I'm not above a system call myself: tidy -asxml html4doc.html > xhtml1doc.html Aside from assurances of well-formedness, is there a p

Re: [CODE4LIB] tidy

2006-06-06 Thread Chris Gray
Eric, The Sourceforge page for Tidy has links to the bindings for various languages (Perl included) and also note that there is a link to a separate sourceforge project for an Apache mod for tidy. Chris On Tue, 6 Jun 2006, Eric Lease Morgan wrote: Is there a coo

Re: [CODE4LIB] tidy

2006-06-06 Thread Dan Scott
You could shell out to the original HTML Tidy (http://tidy.sourceforge.net/ * I've used that very successfully in past lives) or get the HTML::Tidy Perl module (can't vouch for it personally, but it seems to be the Perl version of what you're looking for). Dan Systems Librarian, Bibliothèque J

Re: [CODE4LIB] tidy

2006-06-06 Thread Thomas Dowling
On 6/6/2006 4:23 PM, Eric Lease Morgan wrote: > Is there a cool Tidy (Perl) module that will convert (force) dirty > HTML documents into XHTML documents? I took a look at CPAN but I > wasn't sure which one(s) do the job. > - It looks like something's in the

Re: [CODE4LIB] tidy

2006-06-06 Thread Roy Tennant
Eric, I can't speak to the Perl module aspect, but tidy itself has a switch that tries to do that: -asxhtml convert HTML to well formed XHTML But it won't do such things as dump tables used only for formatting purposes and generate an appropriate stylesheet, so the output will not nece

[CODE4LIB] tidy

2006-06-06 Thread Eric Lease Morgan
Is there a cool Tidy (Perl) module that will convert (force) dirty HTML documents into XHTML documents? I took a look at CPAN but I wasn't sure which one(s) do the job. -- Eric "()()()()()" Morgan