Re: Is anybody working on the globally "order by" of hive ?

Jeff Hammerbacher Fri, 11 Jun 2010 23:02:55 -0700

See https://issues.apache.org/jira/browse/HIVE-1402.


On Fri, Jun 11, 2010 at 1:22 PM, John Sichi <[email protected]> wrote:

> If someone is interested in adding parallel ORDER BY to Hive (using
> TotalOrderPartitioner), here's a good starting point:
>
> http://wiki.apache.org/hadoop/Hive/HBaseBulkLoad
>
> The goal would be to take that manual two-step sample-then-sort process and
> turn it into an automatic plan within Hive.  I have a better example for the
> sampling query which I haven't published yet.
>
> We would also need to name the final output files in such a way that the
> total order could be iterated via the filenames.
>

Re: Is anybody working on the globally "order by" of hive ?

Reply via email to