Re: std.xml should just go

Michel Fortin Thu, 03 Feb 2011 21:06:12 -0800

On 2011-02-03 22:27:08 -0500, Andrei Alexandrescu<seewebsiteforem...@erdani.org> said:

On 2/3/11 9:11 PM, Walter Bright wrote:
Andrei Alexandrescu wrote:
Nobody that I know of. If you want to discuss design here while
working on it, that would be great. I could think of a few high-level
requirements:
* works with input ranges so we can plug it in with any source
The difficulty with that is if it's a pure input range, then the output
cannot be slices of the input.
In that case it's fair to require sliceable ranges of characters then,or strings outright. It all boils down to stating one's assumptions andchoices. Probably parameterizing on character width would berecommendable anyway.

The problem with parametrizing on the character width is that whether aparser parses a UTF-8 document or a UTF-16 document is determined atruntime by inspecting the document. How is the user of the parsersupposed to decide in advance which to instantiate? And how theapplication is supposed to handle slices of different string typescoming from those different parser instances?

The actual low-level parser could indeed use a different instancedepending on the text encoding as an optimization, but the end-user APIshould standardize on one string type. Unfortunately, if the XML fileis not using the same text encoding as your standard string type, thenyou can't use slicing and have to create copies for each and everystring...

Another option is to use a "smart" string type that can accept stringsslices of any encoding.


--
Michel Fortin
michel.for...@michelf.com
http://michelf.com/

Re: std.xml should just go

Reply via email to