Gunther Hagleitner created HIVE-6098:
----------------------------------------

             Summary: Merge Tez branch into trunk
                 Key: HIVE-6098
                 URL: https://issues.apache.org/jira/browse/HIVE-6098
             Project: Hive
          Issue Type: New Feature
    Affects Versions: 0.12.0
            Reporter: Gunther Hagleitner
            Assignee: Gunther Hagleitner


I think the Tez branch is at a point where we can consider merging it back into 
trunk after review. 

Tez itself has had its first release, most hive features are available on Tez 
and the test coverage is decent. There are a few known limitations, all of 
which can be handled in trunk as far as I can tell (i.e.: None of them are 
large disruptive changes that still require a branch.)

Limitations:
- Union all is not yet supported on Tez
- SMB is not yet supported on Tez
- Bucketed map-join is executed as broadcast join (bucketing is ignored)

Since the user is free to toggle hive.optimize.tez, it's obviously possible to 
just run these on MR.

I am hoping to follow the approach that was taken with vectorization and shoot 
for a merge instead of single commit. This would retain history of the branch. 
Also in vectorization we required at least three +1s before merge, I'm hoping 
to go with that as well.

I will add a combined patch to this ticket for review purposes (not for 
commit). I'll also attach instructions to run on a cluster if anyone wants to 
try.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to