See https://issues.apache.org/jira/browse/HIVE-1402.
On Fri, Jun 11, 2010 at 1:22 PM, John Sichi <jsi...@facebook.com> wrote: > If someone is interested in adding parallel ORDER BY to Hive (using > TotalOrderPartitioner), here's a good starting point: > > http://wiki.apache.org/hadoop/Hive/HBaseBulkLoad > > The goal would be to take that manual two-step sample-then-sort process and > turn it into an automatic plan within Hive. I have a better example for the > sampling query which I haven't published yet. > > We would also need to name the final output files in such a way that the > total order could be iterated via the filenames. >