RE: tez + union stmt

2017-01-11 Thread Bikas Saha
between hive-tez and hive-mr. Bikas From: Rohini Palaniswamy [mailto:rohini.adi...@gmail.com] Sent: Tuesday, January 10, 2017 2:55 PM To: user@hive.apache.org; u...@tez.apache.org Subject: Re: tez + union stmt The implementation in hive does look wrong. The concept of VertexGroups was

Re: tez + union stmt

2017-01-11 Thread Elliot West
Thank you. On Wed, 11 Jan 2017 at 07:21, Chris Drome wrote: > Elliot, > > Mithun already created the following ticket to track the issue: > > https://issues.apache.org/jira/browse/HIVE-15575 > > chris > > > On Tuesday, January 10, 2017 11:05 PM, Elliot West > wrote: > > > Thanks Rohini, > > Thi

Re: tez + union stmt

2017-01-10 Thread Chris Drome
Elliot, Mithun already created the following ticket to track the issue: https://issues.apache.org/jira/browse/HIVE-15575 chris On Tuesday, January 10, 2017 11:05 PM, Elliot West wrote: Thanks Rohini, This is good to know. Could you perhaps raise an issue in the Hive JIRA? Thanks, Ellio

Re: tez + union stmt

2017-01-10 Thread Elliot West
Thanks Rohini, This is good to know. Could you perhaps raise an issue in the Hive JIRA? Thanks, Elliot. On Tue, 10 Jan 2017 at 22:55, Rohini Palaniswamy wrote: > The implementation in hive does look wrong. The concept of VertexGroups > was added in Tez specifically for the case of union to su

Re: tez + union stmt

2017-01-10 Thread Rohini Palaniswamy
The implementation in hive does look wrong. The concept of VertexGroups was added in Tez specifically for the case of union to support writing to same directory from different vertices. Sub-directories should not be required as a workaround. Regards, Rohini On Sun, Dec 25, 2016 at 10:58 AM, Steph

Re: tez + union stmt

2016-12-25 Thread Stephen Sprague
Thanks Elliot. Nice christmas present. Those settings in that stackoverflow link look to me to be exactly what i need to set for MR jobs to pick that data up that Tez created. Cheers, Stephen. On Sun, Dec 25, 2016 at 2:45 AM, Elliot West wrote: > I believe that tez will generate subfolders f

Re: tez + union stmt

2016-12-25 Thread Elliot West
I believe that tez will generate subfolders for unioned data. As far as I know, this is the expected behaviour and there is no alternative. Presumably this is to prevent multiple tasks from attempting to write the same file? We've experienced issues when switching from mr to tez; downstream jobs w

Re: tez + union stmt

2016-12-24 Thread Rajesh Balamohan
Are there any exceptions in hive.log?. Is tmp_pv_v4* table part of the select query? Assuming you are creating the table in staging.db, it would have created the table location as staging.db/foo (as you have not specified the location). Adding user@hive.apache.org as this is hive related. ~Raje