On Tue, Dec 08, 2009 at 05:47:31PM +0200, Lia wrote: > Hi, > > I have a problem related to comments while using mod_proxy_html (version > 3.0.1) with libxml2 (version 2.7.6). > The problem consists in dropping some comments from HTML code while parsing > for URL rewrite. > Having a short look in code, I suspect the unexpected behaviour comes from > libxml, but I am not 100% sure, please forgive the out of topic comment, if > no so. > > When having an HTML document in form: > > <html xmlns="http://www.w3.org/1999/xhtml "> > <head> > <meta name="description" content="....."/>
here > .. That piece of non-space text ends the head and open the <body> > <!--[if lte IE 6]> > <link href="..." rel="StyleSheet" type="text/css" media="all" /> > <![endif]--> > <script type="text/javascript" src="..."></script> > <!--[if lte IE 6]> > <script type="text/javascript" src="..."></script> > <![endif]--> > </head> > ... > </html> > > After parsing with SAX, the first comment is dropped, only the comment found > as last child in the <head> element is preserved. > Does anyone know if dropping comments inside tags, is a problem related to > libxml? Except that the HTML prser seems to do its job paphio:~/tmp -> xmllint --html tst.html tst.html:12: HTML parser error : Unexpected end tag : head </head> ^ <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" "http://www.w3.org/TR/REC-html40/loose.dtd"> <html xmlns="http://www.w3.org/1999/xhtml "> <head><meta name="description" content="....."></head> <body><p> .. <!--[if lte IE 6]> <link href="..." rel="StyleSheet" type="text/css" media="all" /> <![endif]--> <script type="text/javascript" src="..."></script><!--[if lte IE 6]> <script type="text/javascript" src="..."></script> <![endif]--> ... </p></body> </html> paphio:~/tmp -> oh and the extra space at the end of your xmlns definition makes your data not XHTML ! Daniel -- Daniel Veillard | libxml Gnome XML XSLT toolkit http://xmlsoft.org/ dan...@veillard.com | Rpmfind RPM search engine http://rpmfind.net/ http://veillard.com/ | virtualization library http://libvirt.org/ _______________________________________________ xml mailing list, project page http://xmlsoft.org/ xml@gnome.org http://mail.gnome.org/mailman/listinfo/xml