Gude Reshma wrote: > how can the user the Xerces DOM Parser to parse an HTML document? an > HTML document is not well formed therfore i'm unable to parse it using > the Xerces parser. I tried to well form the document using HTML Tidy > tool given by W3C but this was not helpful. Any suggestions as to how > i should parse an HTML document using the DOM Parser?
See the mail I sent to the xerces-j-user mailing list on Monday titled "Can Xerces Parse HTML? Yes, It Can!". I attached a simple HTML parser for Xerces2. It's very basic but wanted to get some feedback from people before I put more work into it. -- Andy Clark * [EMAIL PROTECTED] --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
