Re: VLERange: a range in between BidirectionalRange and RandomAccessRange

Michel Fortin Mon, 17 Jan 2011 19:51:05 -0800

On 2011-01-17 17:54:04 -0500, Michel Fortin <michel.for...@michelf.com> said:

More seriously, you have four choice:

1. code unit
2. code point
3. grapheme
4. require the client to state explicitly which kind of 'character' hewants; 'character' being an overloaded word, it's reasonable to ask fordisambiguation.

This makes me think of what I did with my XML parser after you madecode points the element type for strings. Basically, the parser nowuses 'front' and 'popFront' whenever it needs to get the next codepoint, but most of the time it uses 'frontUnit' and 'popFrontUnit'instead (which I had to add) when testing for or skipping an ASCIIcharacter is sufficient. This way I avoid a lot of unnecessary decodingof code points.

For this to work, the same range must let you skip either a unit or acode point. If I were using a separate range with a call to toDchar ortoCodeUnit (or toGrapheme if I needed to check graphemes), it wouldn'thave helped much because the new range would essentially become a newslice independent of the original, so you can't interleave "I want toadvance by one unit" with "I want to advance by one code point".

So perhaps the best interface for strings would be to provide multiplerange-like interfaces that you can use at the level you want.

I'm not sure if this is a good idea, but I thought I should at leastshare my experience.



--
Michel Fortin
michel.for...@michelf.com
http://michelf.com/

Re: VLERange: a range in between BidirectionalRange and RandomAccessRange

Reply via email to