Re: stream interfaces - with ranges

Roman D. Boiko Fri, 18 May 2012 01:59:50 -0700

On Friday, 18 May 2012 at 07:52:57 UTC, Mehrdad wrote:

On Thursday, 17 May 2012 at 14:02:09 UTC, Steven Schveighofferwrote:
2. I realized, buffering input stream of type T is actually aninput range of type T[].
The trouble is, why a slice? Why not an std.array.Array? Whynot some other data source?
(Check/egg problem....)




Another problem I've noticed is the following:
Say you're tokenizing some input range, and it happens to justbe a huge, gigantic string.
It *should* be possible to turn it into tokens with slicesreferring to the ORIGINAL string, which is VERY efficientbecause it doesn't require *any* heap allocations whatsoever.(You just tokenize with opApply() as you go, without everyrequiring a heap allocation...)
However, this is *only* possible if you don't use the conceptof an input range!
Since you can't slice an input range, you'd be forced to usethe front() and popFront() properties. But, as soon as you dothat, you're gonna have to store the data somewhere... so yournext-best option is to append it to some new gigantic array(instead of a bunch of small arrays, which require a lot ofheap allocations), but even then, it's not as efficient aspossible, because there's O(n) extra memory involved -- whichdefeats the whole purpose of working on small chunks at a timewith no heap allocations.(If you're going to do that, after all, you might as well readthe entire thing into a giant string at the beginning, and workwith an array anyway, discarding the whole idea of a rangewhile doing your tokenization.)
Any ideas on how to solve this problem?

Provide slicing if underlying data source is compatible.

I have the same need in my DCT, and so far I went with a customimplementation (not on Github yet), but plan to reuse std.io assoon as it will be more or less stable and usable.

Re: stream interfaces - with ranges

Reply via email to