On 1 Sep 2007, at 10:59, David Chisnall wrote: > On 1 Sep 2007, at 04:56, Yen-Ju Chen wrote: > >> Currently TRXML will fails on something like this: >> <img width=64 /> because the value is not quated. >> What's the chance to fix it in TRXML without major changes ? >> Another way I can think is to pre-process such text before feeding >> into the parser. > > It probably wouldn't be too hard to do. XML requires attributes to > be quoted, but TRXML cheats a bit and allows them to be quoted by any > character, so it would try to parse this as 4 quoted by 6s, and fail > because it can't find the closing 6.
Apparently I already fixed this behaviour, and in XML mode it parses things correctly. > The fix would be (in SGML mode) > to check that the quote is a quote character, and skip to the next > space if it isn't. I'll have a look at doing that now. I have done this. It will now parse your example correctly, but with one caveat: Things like this: <img size=64> will still break. Getting it to detect a space, a / or a > as the end of the attribute would require some slightly larger changes to the parser. Feel free to have a play. David _______________________________________________ Etoile-dev mailing list [email protected] https://mail.gna.org/listinfo/etoile-dev
