Re: [due diligence] std.xml

sybrandy Tue, 19 Oct 2010 13:45:24 -0700

Well one obvious problem is you have to read the document into memory
first, which clearly isn't good enough for large documents.

I think that depends on the type of XML library we create. A SAXlibrary doesn't require the whole document in memory, however a DOMlibrary typically does as, from what I can tell, they create anin-memory representation that's tree-like. If you don't read it intomemory, I'm not really sure how you would be able to, for example, writeXPath queries to access some random nodes that are not grouped togetherin a relatively efficient manner. I say relatively because yes, thememory layout can be very scattered, however it's still better thanhaving to perform random access from disk.

I guess one question we need to ask is what do we expect from thislibrary? Do we want a full DOM implementation or is a SAX parser goodenough? Or do we need something in between? In PHP or Perl, perhapsboth, I saw a library where an XML document was essentially transformedinto nested associative arrays. It made it very easy to read data fromthe XML, however I don't know how much of the official standards itcomplied with.

The current std.xml looks like it tries to be both a DOM library and aSAX library. Personally, I'd rather break them up into two libraries,though it may make sense for the DOM library to leverage the SAX libraryto build up it's objects.

IMHO, I love a good SAX parser. I've used them in the past and I thinkthey work great, so having one in D I think would be ideal, especiallyin those situations where the XML file is essentially read-only.

Do we need a DOM parser? I honestly don't know. Personally, I'd behappy with the associative array approach as it's simple. I don't needto learn a new API just to navigate through XML. Yes, I know there areadvantages to using the DOM and XPath, which I also like, but for themost part, I don't need either.

Of course, I personally would love to just let XML die and use betterdata formats, but that's an unrealistic dream :)


Casey

Re: [due diligence] std.xml

Reply via email to