Re: [xml] Recovering from errors in an XML "stream"

2019-09-24 Thread Webb Scales
On 9/24/19 5:49 PM, Liam R E Quin wrote: This isn’t true in general in XML, so beware. That was why I was asking  :-) (And, it's why I really want LibXML2 to do as much of the thinking here as possible!)         Thanks,             Webb -- Webb Scales Principal Software Architect

Re: [xml] Recovering from errors in an XML "stream"

2019-09-24 Thread Webb Scales
get something we read until the ending tag and pause for processing.  Eric *From:*xml [mailto:xml-boun...@gnome.org] *On Behalf Of *Webb Scales *Sent:* Monday, September 09, 2019 9:30 PM *To:* Liam R E Quin ; xml@gnome.org *Subject:* Re: [xml] Recovering from errors in an XML "stream&

Re: [xml] Recovering from errors in an XML "stream"

2019-09-24 Thread Eric Eberhard
, 2019 7:41 PM To: Liam R. E. Quin ; xml@gnome.org Subject: Re: [xml] Recovering from errors in an XML "stream" On 9/7/19 12:37 AM, Liam R. E. Quin wrote: On Fri, 2019-09-06 at 01:57 -0400, Webb Scales wrote: The first issue is that the XML parser seems to balk entirely at

Re: [xml] Recovering from errors in an XML "stream"

2019-09-24 Thread Eric Eberhard
byte blocking and as soon as we get something we read until the ending tag and pause for processing. Eric From: xml [mailto:xml-boun...@gnome.org] On Behalf Of Webb Scales Sent: Monday, September 09, 2019 9:30 PM To: Liam R E Quin ; xml@gnome.org Subject: Re: [xml] Recovering from errors

Re: [xml] Recovering from errors in an XML "stream"

2019-09-09 Thread Liam R E Quin
On Tue, 2019-09-10 at 00:29 -0400, Webb Scales wrote: > > If the TextReader didn't insist upon reading beyond the root end-tag, All XML parsers do that, as the spec requires them to check if anything follows it and raise an error if so. Liam -- Liam Quin - web slave for

Re: [xml] Recovering from errors in an XML "stream"

2019-09-09 Thread Webb Scales
I'm OK with making small on-the-fly "edits" to the input (such as removing the initial comment, or removing all comments), but trying to make my code discern the overall structure (such as picking out the boundaries between the documents) is starting to step over into actually parsing it,

Re: [xml] Recovering from errors in an XML "stream"

2019-09-09 Thread Liam R E Quin
On Mon, 2019-09-09 at 22:41 -0400, Webb Scales wrote: > the > fact remains that I don't control the text that I'm trying to parse, > and I still need to parse it, even though it's not "well-formed". You may need to write some form of pre-processor that fixes the problems. As you say, that may

Re: [xml] Recovering from errors in an XML "stream"

2019-09-09 Thread Webb Scales
On 9/7/19 12:37 AM, Liam R. E. Quin wrote: On Fri, 2019-09-06 at 01:57 -0400, Webb Scales wrote: The first issue is that the XML parser seems to balk entirely at the fact that the document is preceded by a comment before the XML declaration. (I'm less than shocked, but it is kind of

[xml] Recovering from errors in an XML "stream"

2019-09-05 Thread Webb Scales
Greetings, all.  My apologies if this has already been addressed...I had no luck searching the archive. My code is being presented with a stream of XML-like data which looks similar to this: \ \ \ \ I cannot read it all into memory, because it might be "big" or even "infinite" in