Hi Sandy, I completely agree that typically during error recovery there is more than one ``Correction'' that can be applied to the stream so that the stream is parseable. Any parser can atmost take a best guess at which correction is to be applied to recover from the error.
To make the best guess is it possible to make the parser look at the DTD and eliminate some of the corrections? In the example I gave : <icon > <small-icon>stillTypingMygif <large-icon>/tmp/large.gif</large-icon> </icon> There are more than one Corrections with which the parser can recover. But the element declarations are: <!ELEMENT icon (small-icon?, large-icon?)> <!ELEMENT small-icon (#PCDATA)> <!ELEMENT large-icon (#PCDATA)> So the best guess in this case should have been to add </small-icon> before <large-icon> Do you think it is possible to improve the recovery by making the parser consider the content model during the error recovery phase? Could you give me some hints as to how I can go about doing this? Thanks! Rajiv --- Sandy Gao <[EMAIL PROTECTED]> wrote: > But how would the parser know the input is not > > <icon > > <small-icon>stillTypingMygif > <large-icon>/tmp/large.gif</large-icon> > </small-icon> > </icon> > > in which case adding </small-icon> before > <large-icon> is worse. > > In dealing with errors (especially well-formedness > ones), the best the > parser can do is to make a guess, which can't be > guaranteed to be the best > in all cases. > > Thanks, > Sandy Gao > Software Developer, IBM Canada > (1-905) 413-3255 > [EMAIL PROTECTED] __________________________________________________ Do you Yahoo!? Yahoo! Tax Center - File online, calculators, forms, and more http://tax.yahoo.com --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]