Re: standard ranges

Roman D. Boiko Thu, 28 Jun 2012 06:13:00 -0700

Timings should not be very different from random access in anyUTF-32 string implementation, because of design of thesealgorithms:

* only operations on 64-bit aligned words are performed(addition, multiplication, bitwise and shift operations)

* there is no branching except at the very top level for verylarge array sizes

* data is stored in a way that makes algorithms cache-obliviousIIRC. Authors claim that very few cache misses are neccessary(1-2 per random access).

* after determining code unit index for some code point indexfurther access is performed as usually inside an array, so inorder to perform slicing it is only needed to calculate code unitindices for its end and start.

* original data arrays are not modified (unlike for compactrepresentations of dstring, for example).

Re: standard ranges

Reply via email to