On Mon, 2013-02-25 at 23:37 -0500, Anil Jangam wrote: > Team, > > > I observed that HTML parser (hubbub-0.1.2) is breaking when it finds a > SEMICOLON in the text field. I am giving below an example of the text > string.
[...] > <meta http-equiv="content-type" content="text/html; charset=UTF-8" /> > When it finds the ';', it stops working. When I remove this ';' from > the string, it works fine. Can you please check, if this is an issue > with the parser or if I am missing anything? Can you explain what you mean by "stops working"? The output below is exactly what I would expect to see, given the input, above. > ELEMENT meta > ATTRIBUTE http-equiv > TEXT > content=content-type > ATTRIBUTE content > TEXT > content=text/html; charset=UTF-8 J.
