Re: Time for 2.067

via Digitalmars-d Thu, 05 Feb 2015 08:36:35 -0800

On Thursday, 5 February 2015 at 03:00:53 UTC, Andrei Alexandrescuwrote:

On 2/2/15 2:42 PM, "Ulrich =?UTF-8?B?S8O8dHRsZXIi?=<kuett...@gmail.com>" wrote:
On Friday, 30 January 2015 at 23:17:09 UTC, AndreiAlexandrescu wrote:
Sorry, I thought that was in the bag. Keep current semantics,call itchunkBy. Add the key to each group when the predicate isunary. Make
sure aggregate() works nice with chunkBy().
I might miss some information on this, so please forgive mynaive
question. Your requirements seem to be contradictory to me.

1. aggregate expects a range of ranges
Probably we need to change that because aggregate shouldintegrate seamlessly with chunkBy.
2. you ask chunkBy to return something that is not a range ofranges
Yah.
3. you ask chunkBy to play along nicely with aggregate
Yah.
There are certainly ways to make this work. Adding a specialversion ofaggregate comes to mind. However, I fail to see the rationalbehind this.
Rationale as discussed is that the key value for each group isuseful information. Returning a range of ranges would wastethat information forcing e.g. its recomputation.

I understand and agree. My suggestion aims to avoid thisparticular waste. See below.

To me the beauty of range is the composibility of "simple"constructs tocreate complex behavior. The current chunkBy does not need tobe changed
to "add the key to each group when the predicate is unary":

 r.map!(pred, "a")
  .chunkBy!("a[0]")
  .map!(inner => tuple(inner.front[0], inner.map!"a[1]"));
So I'd like to know why the above is inferior to a rework ofthe
chunkBy's implementation. Maybe this is a question for D.learn.
Wouldn't that force recomputation if a more complex expressionreplaced a[0]?

I do not think you ever want to replace a[0] here. In the codeabove the (original) predicate to chunkBy is pred. The idea is toevaluate the predicate outside of chunkBy. Create a range oftuples from the original range, chunk the range of tuples andconstruct the desired result from the chunked range of tuples.


// create a range of `tuple(pred(a), a)`
r.map!(pred, "a")

// chunk the range of tuples based of the first tuple element
// this results in a range of ranges of tuples
   .chunkBy!("a[0]")

// convert the inner ranges of tuples to a tuple of the predicateapplied and the appropriate range

   .map!(inner => tuple(inner.front[0], inner.map!"a[1]"));

The construction of a range of tuples is not for free. On thebright side:


* you only do it when you need it

* if your predicate is that heavy, you might want to precomputeit anyway* a modified chunkBy is not exactly free either (and you pay theprice even if you do not need the key value)

Now I learned that map is very lazy and applies the functioninside front(). Thus, the above might actually result in multipleevaluations of the predicate. Luckily, there is the new cachefunction:


auto chunkByStar(alias pred, Range)(Range r)
{
return r.map!(pred, "a")
   .cache
   .chunkBy!("a[0]")
   .map!(inner => tuple(inner.front[0], inner.map!"a[1]"));
}

My point here is, we can construct a version of chunkBy that doesnot waste the key value with modest means. With great power comesgreat flexibility. I wanted to sneak this in as an example,because it is not clear what eventual users might actually need.

On the other hand there is no limit to the special cases we couldadd. aggregate might not be the only function to work withchunkBy. And even an aggregate function that takes a tuple of arange and something else and only uses the range seems wrong tome, given expressive the power D has. The transformation of therange is just on map away:


chunkByStar!(...)(r).map!"a[1]".aggregate!max

Then again, I might be missing something huge here.

Re: Time for 2.067

Reply via email to