Re: standard ranges

Roman D. Boiko Thu, 28 Jun 2012 03:22:48 -0700

On Thursday, 28 June 2012 at 10:02:59 UTC, Roman D. Boiko wrote:

On Thursday, 28 June 2012 at 09:58:02 UTC, Roman D. Boiko wrote:
Pedantically speaking, it is possible to index a string withabout 50-51% memory overhead to get random access in 0(1)time. Best-performing algorithms can do random access in about35-50 nanoseconds per operation for strings up to tens ofmegabytes. For bigger strings (tested up to 1GB) or when someother memory-intensive calculations are performedsimultaneously, random access takes up to 200 nanoseconds dueto memory-access resolution process.
Just a remark, indexing would take O(N) operations and N/Bmemory transfers where N = str.length and B is size of cachebuffer.

That being said, I would be against switching from stringrepresentation as arrays. Such switch would hardly help us solveany problems of practical importance better (by a significantdegree) than they have to be solved with current design.

However, a struct could be created for indexing which I mentionedin two previous posts to give efficient random access for narrowstrings (and arbitrary variable-length data stored consequentlyin arrays) without any significant overhead.

Respective algorithms are called Rank and Select, and there existmany variations of them (with different trade-offs, but some ofthem are arguably better than others).

I have investigated this question quite deeply in the last twoweeks, because similar algorithms would be useful in my DCTproject. If nobody else will implement them before me, I willeventually do that myself. It is just a matter of finding somefree time, likely a week or two.

Re: standard ranges

Reply via email to