On Thursday 2004.06.17 00:46:31 -0400, John Cowan wrote: > Edward H. Trager scripsit: > > > What about vim (vi clone: http://www.vim.org). I just converted > > a very large UTF-8-encoded HTML document (more than 15000 > > lines) to standards-compliant XHTML-1.0 and found the advanced > > regular-expression-based substitution facilities in vi(m) absolutely > > indispensible for adding all of the closing tags that XML requires > > which were missing in my original document. > > HTML Tidy or TagSoup would probably have served you better, rather than > groveling over the code bit by bit. (HTML Tidy can do more cleaning, > but it sometimes loops or delivers garbage if the HTML is sufficiently > broken. TagSoup never gives up and never loops, but doesn't know > as much about HTML.)
Actually I did use Tidy too. > > -- > Said Agatha Christie / To E. Philips Oppenheim John Cowan > "Who is this Hemingway? / Who is this Proust? [EMAIL PROTECTED] > Who is this Vladimir / Whatchamacallum, http://www.reutershealth.com > This neopostrealist / Rabble?" she groused. http://www.ccil.org/cowan > --author unknown to me; any suggestions?