Re: [OT] n-way union

Sean Kelly Sat, 23 May 2009 10:00:18 -0700

Andrei Alexandrescu wrote:

This is somewhat OT but I think it's an interesting problem. Considerthe following data:
double[][] a =
[
    [ 1, 4, 7, 8 ],
    [ 1, 7 ],
    [ 1, 7, 8],
    [ 4 ],
    [ 7 ],
];
We want to compute an n-way union, i.e., efficiently span all elementsin all arrays in a, in sorted order. You can assume that each individualarray in a is sorted. The output of n-way union should be:
auto witness = [
    1, 1, 1, 4, 4, 7, 7, 7, 7, 8, 8
];
assert(equal(nWayUnion(a), witness[]));
The STL and std.algorithm have set_union that does that for two sets:for example, set_union(a[0], a[1]) outputs [ 1, 1, 4, 7, 7, 8 ]. Butn-way unions poses additional challenges. What would be a fastalgorithm? (Imagine a could be a very large range of ranges).


It seems like there are two basic options: either merge the arrays (as
per merge sort) and deal with multiple passes across the same data
elements or insert the elements from each array into a single
destination array and deal with a bunch of memmove operations.

The merge option is kind of interesting because it could benefit from a
parallel range of sorts.  Each front() op could actually return a range

which contained the front element of each non-empty range. Pass this toa min() op accepting a range and drop the min element into thedestination. The tricky part would be working things in such a way thatthe originating range could have popFront() called once the insertionhad occurred.

Needless to say, nWayUnion is a range :o).

Finally, why would anyone care for something like this?

Other than mergeSort? I'd think that being able to perform union of Nsets in unison would be a nice way to eliminate arbitrary restrictions.

Re: [OT] n-way union

Reply via email to