Hi, Karl Dubost wrote: > Nick Kew weighed in and proposed that we should target [6]libxml > which includes an HTML parser and is already supported by Apache > server and many other tools. > > [6] http://xmlsoft.org/html/libxml-HTMLparser.html > > From here it would be interesting to implement HTML 5 parsing > algorithm into libxml2. It would benefit the community as large.
Have you tried joining forces with the people who started the C implementation of html5lib? Maybe they have ideas to contribute or (partially) working code that you can look at. It may even happen that you get them convinced of the project. In any case, having working implementations in Python and Java should get you a lot closer to your goal by looking under the hood. Stefan _______________________________________________ xml mailing list, project page http://xmlsoft.org/ [email protected] http://mail.gnome.org/mailman/listinfo/xml
