Re: [review] new string type

Steven Schveighoffer Fri, 03 Dec 2010 12:30:22 -0800

On Fri, 03 Dec 2010 14:40:30 -0500, Jerry Quinn <[email protected]>wrote:

I tend to do a lot of transforming strings, but I need to track offsetsback to the original text to maintain alignment between the results andthe input. For that, indexes are necessary and we use them a lot.

In my daily usage of strings, I generally use a string as a whole, notindividual characters. But I do occasionally use it.

Let's also understand that indexing is still present, what is deactivatedis the ability to index to arbitrary code-units. It sounds to me likethis new type would not affect your ability to store offsets (you canstore an index, use it later when referring to the string, etc. just likeyou can now).

My string type does not allow for writeable strings. My plan was to allowyou access to the underlying char[] and let you edit that way. Lettingsomeone write a dchar into the middle a utf-8 string could cause lots ofproblems, so I just disabled it by default.

Not sure how that affects your 'transforming' work, are you actuallychanging the data or just lazily transforming? I'm interested to hearwhether you think my string type would be a viable alternative.

Probably the right thing to do in this case is just pay for the cost ofusing dchar everywhere, but if you're working with large enoughquantities of data, storage efficiency matters.

The huge advantage of using utf-8 is backwards compatibility with ASCIIfor C functions.


-Steve

Re: [review] new string type

Reply via email to