Re: [OT] n-way union

Andrei Alexandrescu Mon, 25 May 2009 11:55:12 -0700

Georg Wrede wrote:

Andrei Alexandrescu wrote:
This is somewhat OT but I think it's an interesting problem. Considerthe following data:
double[][] a =
[
    [ 1, 4, 7, 8 ],
    [ 1, 7 ],
    [ 1, 7, 8],
    [ 4 ],
    [ 7 ],
];
We want to compute an n-way union, i.e., efficiently span all elementsin all arrays in a, in sorted order. You can assume that eachindividual array in a is sorted. The output of n-way union should be:
auto witness = [
    1, 1, 1, 4, 4, 7, 7, 7, 7, 8, 8
];
assert(equal(nWayUnion(a), witness[]));
The STL and std.algorithm have set_union that does that for two sets:for example, set_union(a[0], a[1]) outputs [ 1, 1, 4, 7, 7, 8 ]. Butn-way unions poses additional challenges. What would be a fastalgorithm? (Imagine a could be a very large range of ranges).
Needless to say, nWayUnion is a range :o).
If we'd know anything about the data, such as, the max value is alwayssmaller than the total number of elements in the subarrays, then we'dprobably more easily invent a decent algorithm.
But the totally general algorithm has to be more inefficient. Andconstructing (not worst-case, but) tough-case data is trivial. Forexample, take a thousand subarrays, each a thousand elements long,containing random uints from the inclusive range 0..uint.max.


You can assume that each array is sorted.

Andrei

Re: [OT] n-way union

Reply via email to