[jira] [Updated] (HIVE-3485) Skewed DDL doesn't support skewed value with string quote

2012-09-18 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3485:
---

Attachment: hive-3485.patch.1

 Skewed DDL doesn't support skewed value with string quote
 -

 Key: HIVE-3485
 URL: https://issues.apache.org/jira/browse/HIVE-3485
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: hive-3485.patch.1


 CREATE TABLE list_bucket_single (key STRING, value STRING) SKEWED BY (key) ON 
 ('1','5','6')
 Save '1' as in map instead 1 should

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HIVE-3485) Skewed DDL doesn't support skewed value with string quote

2012-09-18 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13458366#comment-13458366
 ] 

Gang Tim Liu commented on HIVE-3485:


ready for review https://reviews.facebook.net/D5547

 Skewed DDL doesn't support skewed value with string quote
 -

 Key: HIVE-3485
 URL: https://issues.apache.org/jira/browse/HIVE-3485
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: hive-3485.patch.1


 CREATE TABLE list_bucket_single (key STRING, value STRING) SKEWED BY (key) ON 
 ('1','5','6')
 Save '1' as in map instead 1 should

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


Re: run a test case for hadoop 23

2012-09-14 Thread Gang Tim Liu
Zhenxiao,

Thanks. It also works.

Sorry for late reply.

Tim

On 9/11/12 5:08 PM, Zhenxiao Luo zhenx...@cloudera.com wrote:

Hi,

The following seems working to build/test hadoop23:

$ant very-clean package -Dhadoop.version=0.23.1
-Dhadoop-0.23.version=0.23.1 -Dhadoop.mr.rev=23
$ant test -Dhadoop.version=0.23.1 -Dhadoop-0.23.version=0.23.1
-Dhadoop.mr.rev=23

Thanks,
Zhenxiao

On Tue, Sep 11, 2012 at 4:51 PM, Gang Tim Liu g...@fb.com wrote:
 This one will work and it doesn't need change in build.properties.
Details
 in HIVE-3389.

 ant test -Dhadoop.mr.rev=23 -Dtest.print.classpath=true
 -Dhadoop.version=2.0.0-alpha -Dhadoop.security.version=2.0.0-alpha
 -Dtestcase=TestCliDriver -Dqfile=...q

 I will update hive wiki so that people can save efforts.

 Thanks

 Tim

 On 9/11/12 4:14 PM, Gang Tim Liu g...@fb.com wrote:

I am running test in eclipse and it is printing

INCLUDE list does not contain Hadoop Version 0.20. SkippingÅ 

It's from QTestUtil.java due to hadoopversion 0.20


Thanks

On 9/11/12 4:04 PM, Chris Drome cdr...@yahoo-inc.com wrote:

You can try cleaning and rebuilding:

ant clean package

Then you can try running your tests:

ant test

How do you know that it is using hadoop 20?

chris

On 9/11/12 3:57 PM, Gang Tim Liu g...@fb.com wrote:

Not working.

I must miss something.

What command do you use after changing it?

thanks

On 9/11/12 2:31 PM, Chris Drome cdr...@yahoo-inc.com wrote:

You might try modifying the build.properties file as follows:

hadoop.version=${hadoop-0.23.version}
hadoop.security.version=${hadoop-0.23.version}
hadoop.mr.rev=23

This works for me.

chris



On 9/11/12 1:59 PM, Gang Tim Liu g...@fb.com wrote:

Hello all,

I am looking for help and thank first.

How to run a test case for hadoop 23 only? Exact command?

I tried the following and it didn't work:

ant ... -Dhadoop.version=0.23.1 -Dhadoop-0.23.version=0.23.1
-Dhadoop.mr.rev=23


It still gets 0.20 as hadoop version.

Thanks

Tim









run a test case for hadoop 23

2012-09-11 Thread Gang Tim Liu
Hello all,

I am looking for help and thank first.

How to run a test case for hadoop 23 only? Exact command?

I tried the following and it didn't work:

ant ... -Dhadoop.version=0.23.1 -Dhadoop-0.23.version=0.23.1
-Dhadoop.mr.rev=23


It still gets 0.20 as hadoop version.

Thanks

Tim



Re: run a test case for hadoop 23

2012-09-11 Thread Gang Tim Liu
Not working.

I must miss something.

What command do you use after changing it?

thanks

On 9/11/12 2:31 PM, Chris Drome cdr...@yahoo-inc.com wrote:

You might try modifying the build.properties file as follows:

hadoop.version=${hadoop-0.23.version}
hadoop.security.version=${hadoop-0.23.version}
hadoop.mr.rev=23

This works for me.

chris



On 9/11/12 1:59 PM, Gang Tim Liu g...@fb.com wrote:

Hello all,

I am looking for help and thank first.

How to run a test case for hadoop 23 only? Exact command?

I tried the following and it didn't work:

ant ... -Dhadoop.version=0.23.1 -Dhadoop-0.23.version=0.23.1
-Dhadoop.mr.rev=23


It still gets 0.20 as hadoop version.

Thanks

Tim





Re: run a test case for hadoop 23

2012-09-11 Thread Gang Tim Liu
I am running test in eclipse and it is printing

INCLUDE list does not contain Hadoop Version 0.20. SkippingÅ 

It's from QTestUtil.java due to hadoopversion 0.20


Thanks

On 9/11/12 4:04 PM, Chris Drome cdr...@yahoo-inc.com wrote:

You can try cleaning and rebuilding:

ant clean package

Then you can try running your tests:

ant test

How do you know that it is using hadoop 20?

chris

On 9/11/12 3:57 PM, Gang Tim Liu g...@fb.com wrote:

Not working.

I must miss something.

What command do you use after changing it?

thanks

On 9/11/12 2:31 PM, Chris Drome cdr...@yahoo-inc.com wrote:

You might try modifying the build.properties file as follows:

hadoop.version=${hadoop-0.23.version}
hadoop.security.version=${hadoop-0.23.version}
hadoop.mr.rev=23

This works for me.

chris



On 9/11/12 1:59 PM, Gang Tim Liu g...@fb.com wrote:

Hello all,

I am looking for help and thank first.

How to run a test case for hadoop 23 only? Exact command?

I tried the following and it didn't work:

ant ... -Dhadoop.version=0.23.1 -Dhadoop-0.23.version=0.23.1
-Dhadoop.mr.rev=23


It still gets 0.20 as hadoop version.

Thanks

Tim







Re: run a test case for hadoop 23

2012-09-11 Thread Gang Tim Liu
This one will work and it doesn't need change in build.properties. Details
in HIVE-3389.

ant test -Dhadoop.mr.rev=23 -Dtest.print.classpath=true
-Dhadoop.version=2.0.0-alpha -Dhadoop.security.version=2.0.0-alpha
-Dtestcase=TestCliDriver -Dqfile=...q

I will update hive wiki so that people can save efforts.

Thanks

Tim

On 9/11/12 4:14 PM, Gang Tim Liu g...@fb.com wrote:

I am running test in eclipse and it is printing

INCLUDE list does not contain Hadoop Version 0.20. SkippingÅ 

It's from QTestUtil.java due to hadoopversion 0.20


Thanks

On 9/11/12 4:04 PM, Chris Drome cdr...@yahoo-inc.com wrote:

You can try cleaning and rebuilding:

ant clean package

Then you can try running your tests:

ant test

How do you know that it is using hadoop 20?

chris

On 9/11/12 3:57 PM, Gang Tim Liu g...@fb.com wrote:

Not working.

I must miss something.

What command do you use after changing it?

thanks

On 9/11/12 2:31 PM, Chris Drome cdr...@yahoo-inc.com wrote:

You might try modifying the build.properties file as follows:

hadoop.version=${hadoop-0.23.version}
hadoop.security.version=${hadoop-0.23.version}
hadoop.mr.rev=23

This works for me.

chris



On 9/11/12 1:59 PM, Gang Tim Liu g...@fb.com wrote:

Hello all,

I am looking for help and thank first.

How to run a test case for hadoop 23 only? Exact command?

I tried the following and it didn't work:

ant ... -Dhadoop.version=0.23.1 -Dhadoop-0.23.version=0.23.1
-Dhadoop.mr.rev=23


It still gets 0.20 as hadoop version.

Thanks

Tim








[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-24 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3072:
---

Status: Patch Available  (was: In Progress)

Patch is available.

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, 
 HIVE-3072.patch.3, HIVE-3072.patch.4, HIVE-3072.patch.5, HIVE-3072.patch.6, 
 HIVE-3072.patch.7


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-24 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3072:
---

Attachment: HIVE-3072.patch.7

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, 
 HIVE-3072.patch.3, HIVE-3072.patch.4, HIVE-3072.patch.5, HIVE-3072.patch.6, 
 HIVE-3072.patch.7


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Work started] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-23 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-3072 started by Gang Tim Liu.

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, 
 HIVE-3072.patch.3, HIVE-3072.patch.4


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-23 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3072:
---

Attachment: HIVE-3072.patch.5

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, 
 HIVE-3072.patch.3, HIVE-3072.patch.4, HIVE-3072.patch.5


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-23 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3072:
---

Status: Patch Available  (was: In Progress)

Patch is ready for review. It's in both jira and phabricator.

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, 
 HIVE-3072.patch.3, HIVE-3072.patch.4, HIVE-3072.patch.5


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Work started] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-23 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-3072 started by Gang Tim Liu.

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, 
 HIVE-3072.patch.3, HIVE-3072.patch.4, HIVE-3072.patch.5, HIVE-3072.patch.6


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-23 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13440845#comment-13440845
 ] 

Gang Tim Liu commented on HIVE-3072:


Patch is available on both jara and phabricator.

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, 
 HIVE-3072.patch.3, HIVE-3072.patch.4, HIVE-3072.patch.5, HIVE-3072.patch.6


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-23 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3072:
---

Attachment: HIVE-3072.patch.6

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, 
 HIVE-3072.patch.3, HIVE-3072.patch.4, HIVE-3072.patch.5, HIVE-3072.patch.6


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-23 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13440881#comment-13440881
 ] 

Gang Tim Liu commented on HIVE-3072:


@Carl, yes , will code after dinner. Thanks a lot

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, 
 HIVE-3072.patch.3, HIVE-3072.patch.4, HIVE-3072.patch.5, HIVE-3072.patch.6


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Work started] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-21 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-3072 started by Gang Tim Liu.

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, 
 HIVE-3072.patch.3


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-21 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3072:
---

Attachment: HIVE-3072.patch.4

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, 
 HIVE-3072.patch.3, HIVE-3072.patch.4


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-21 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3072:
---

Status: Patch Available  (was: In Progress)

patch is available on both jira and phabricator. thanks

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, 
 HIVE-3072.patch.3, HIVE-3072.patch.4


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3394) Refactor a few classes' constructors with creational patterns

2012-08-20 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13438007#comment-13438007
 ] 

Gang Tim Liu commented on HIVE-3394:


@Carl, great! will address them and have a patch ready before noon. thanks

 Refactor a few classes' constructors with creational patterns
 -

 Key: HIVE-3394
 URL: https://issues.apache.org/jira/browse/HIVE-3394
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: HIVE-3394.patch.1


 It's good to refactor the following classes' constructors with 
 builder/factory pattern. This should be done before HIVE-3072 so that 
 HIVE-3072 can refresh according and extend it to skewed use case:
 1. MStorageDescriptor.java
 2. ColumnInfo.java
 3. ParseContext.java
 4. CreateTableDesc.java
 5. ExprNodeColumnDesc.java

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Work started] (HIVE-3394) Refactor a few classes' constructors with creational patterns

2012-08-20 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-3394 started by Gang Tim Liu.

 Refactor a few classes' constructors with creational patterns
 -

 Key: HIVE-3394
 URL: https://issues.apache.org/jira/browse/HIVE-3394
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: HIVE-3394.patch.1


 It's good to refactor the following classes' constructors with 
 builder/factory pattern. This should be done before HIVE-3072 so that 
 HIVE-3072 can refresh according and extend it to skewed use case:
 1. MStorageDescriptor.java
 2. ColumnInfo.java
 3. ParseContext.java
 4. CreateTableDesc.java
 5. ExprNodeColumnDesc.java

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3394) Refactor a few classes' constructors with creational patterns

2012-08-20 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3394:
---

Status: Patch Available  (was: In Progress)

Patch is available in both jira and phabricator. Some explannations in 
phabricator.

 Refactor a few classes' constructors with creational patterns
 -

 Key: HIVE-3394
 URL: https://issues.apache.org/jira/browse/HIVE-3394
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: HIVE-3394.patch.1, HIVE-3394.patch.2


 It's good to refactor the following classes' constructors with 
 builder/factory pattern. This should be done before HIVE-3072 so that 
 HIVE-3072 can refresh according and extend it to skewed use case:
 1. MStorageDescriptor.java
 2. ColumnInfo.java
 3. ParseContext.java
 4. CreateTableDesc.java
 5. ExprNodeColumnDesc.java

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3394) Refactor a few classes' constructors with creational patterns

2012-08-20 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3394:
---

Attachment: HIVE-3394.patch.2

 Refactor a few classes' constructors with creational patterns
 -

 Key: HIVE-3394
 URL: https://issues.apache.org/jira/browse/HIVE-3394
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: HIVE-3394.patch.1, HIVE-3394.patch.2


 It's good to refactor the following classes' constructors with 
 builder/factory pattern. This should be done before HIVE-3072 so that 
 HIVE-3072 can refresh according and extend it to skewed use case:
 1. MStorageDescriptor.java
 2. ColumnInfo.java
 3. ParseContext.java
 4. CreateTableDesc.java
 5. ExprNodeColumnDesc.java

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Work started] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-20 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-3072 started by Gang Tim Liu.

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, 
 HIVE-3072.patch.3


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-20 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3072:
---

Attachment: HIVE-3072.patch.3

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, 
 HIVE-3072.patch.3


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-20 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3072:
---

Status: Patch Available  (was: In Progress)

Patch is available on both jira and phabricator.

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, 
 HIVE-3072.patch.3


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-20 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13438451#comment-13438451
 ] 

Gang Tim Liu commented on HIVE-3072:


@Carl, thank you very much for quick review. will address them right now and 
get a patch tonight. thanks

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, 
 HIVE-3072.patch.3


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3394) Refactor a few classes' constructors with creational patterns

2012-08-18 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13437265#comment-13437265
 ] 

Gang Tim Liu commented on HIVE-3394:


@Carl: thank you for giving me the opportunities to work on it. Initially have 
natural resistance due to work plan but quickly realize it is good thing to do 
and will make HIVE better which is my goal. I enjoy this refactor with them in 
mind. have a good weekend.

 Refactor a few classes' constructors with creational patterns
 -

 Key: HIVE-3394
 URL: https://issues.apache.org/jira/browse/HIVE-3394
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: HIVE-3394.patch.1


 It's good to refactor the following classes' constructors with 
 builder/factory pattern. This should be done before HIVE-3072 so that 
 HIVE-3072 can refresh according and extend it to skewed use case:
 1. MStorageDescriptor.java
 2. ColumnInfo.java
 3. ParseContext.java
 4. CreateTableDesc.java
 5. ExprNodeColumnDesc.java

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3394) Refractor a few classes' constructor with builder pattern

2012-08-17 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3394:
---

Description: 
It's good to refactor the following classes' constructors with builder/factory 
pattern. This should be done before HIVE-3072 so that HIVE-3072 can refresh 
according and extend it to skewed use case:

1. MStorageDescriptor.java
2. ColumnInfo.java
3. ParseContext.java
4. CreateTableDesc.java
5. ExprNodeColumnDesc.java

Environment: (was: It's good to refactor the following classes' 
constructors with builder/factory pattern. This should be done before HIVE-3072 
so that HIVE-3072 can refresh according and extend it to skewed use case:

1. MStorageDescriptor.java
2. ColumnInfo.java
3. ParseContext.java
4. CreateTableDesc.java
5. ExprNodeColumnDesc.java)

 Refractor a few classes' constructor with builder pattern
 -

 Key: HIVE-3394
 URL: https://issues.apache.org/jira/browse/HIVE-3394
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor

 It's good to refactor the following classes' constructors with 
 builder/factory pattern. This should be done before HIVE-3072 so that 
 HIVE-3072 can refresh according and extend it to skewed use case:
 1. MStorageDescriptor.java
 2. ColumnInfo.java
 3. ParseContext.java
 4. CreateTableDesc.java
 5. ExprNodeColumnDesc.java

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (HIVE-3394) Refractor a few classes' constructor with builder pattern

2012-08-17 Thread Gang Tim Liu (JIRA)
Gang Tim Liu created HIVE-3394:
--

 Summary: Refractor a few classes' constructor with builder pattern
 Key: HIVE-3394
 URL: https://issues.apache.org/jira/browse/HIVE-3394
 Project: Hive
  Issue Type: Bug
 Environment: It's good to refactor the following classes' constructors 
with builder/factory pattern. This should be done before HIVE-3072 so that 
HIVE-3072 can refresh according and extend it to skewed use case:

1. MStorageDescriptor.java
2. ColumnInfo.java
3. ParseContext.java
4. CreateTableDesc.java
5. ExprNodeColumnDesc.java
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-17 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3072:
---

Attachment: HIVE-3072.patch.2

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Work started] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-17 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-3072 started by Gang Tim Liu.

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-17 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3072:
---

Status: Patch Available  (was: In Progress)

Patch available in phabricator and jira.

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3394) Refactor a few classes' constructors with creational patterns

2012-08-17 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3394:
---

Summary: Refactor a few classes' constructors with creational patterns  
(was: Refractor a few classes' constructor with builder pattern)

 Refactor a few classes' constructors with creational patterns
 -

 Key: HIVE-3394
 URL: https://issues.apache.org/jira/browse/HIVE-3394
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor

 It's good to refactor the following classes' constructors with 
 builder/factory pattern. This should be done before HIVE-3072 so that 
 HIVE-3072 can refresh according and extend it to skewed use case:
 1. MStorageDescriptor.java
 2. ColumnInfo.java
 3. ParseContext.java
 4. CreateTableDesc.java
 5. ExprNodeColumnDesc.java

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3394) Refactor a few classes' constructors with creational patterns

2012-08-17 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13436969#comment-13436969
 ] 

Gang Tim Liu commented on HIVE-3394:


Problem
===
Several classes have many parameters in constructor and it doesn't scale well.

Solution

After reviewing design patterns (factory method/abstract factory/builder/static 
factory) in GOF and builder pattern in Joshua's Effective in JAVA (Item #2), 
feel builder pattern in effective in Java is a good fit since it is designed to 
solve our problem.

Details on each instance

1. MStorageDescriptor.java
   use builder pattern
2. ColumnInfo.java
   # of parameter is not big yet but the number of references to the 
constructors is huge: 40+. risk is high to change them all.
   solution is use javabean pattern (setter/getter) for new parameters in hive 
3072
3. ParseContext.java
   use builder pattern
4. CreateTableDesc.java
   use builder pattern
5. ExprNodeColumnDesc.java
   same as #2

 Refactor a few classes' constructors with creational patterns
 -

 Key: HIVE-3394
 URL: https://issues.apache.org/jira/browse/HIVE-3394
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor

 It's good to refactor the following classes' constructors with 
 builder/factory pattern. This should be done before HIVE-3072 so that 
 HIVE-3072 can refresh according and extend it to skewed use case:
 1. MStorageDescriptor.java
 2. ColumnInfo.java
 3. ParseContext.java
 4. CreateTableDesc.java
 5. ExprNodeColumnDesc.java

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3394) Refactor a few classes' constructors with creational patterns

2012-08-17 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13436996#comment-13436996
 ] 

Gang Tim Liu commented on HIVE-3394:


second thought on MStorageDescriptor.java. It's model class for JDO and let's 
keep it simple. we can use setter/getter here.

 Refactor a few classes' constructors with creational patterns
 -

 Key: HIVE-3394
 URL: https://issues.apache.org/jira/browse/HIVE-3394
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor

 It's good to refactor the following classes' constructors with 
 builder/factory pattern. This should be done before HIVE-3072 so that 
 HIVE-3072 can refresh according and extend it to skewed use case:
 1. MStorageDescriptor.java
 2. ColumnInfo.java
 3. ParseContext.java
 4. CreateTableDesc.java
 5. ExprNodeColumnDesc.java

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3394) Refactor a few classes' constructors with creational patterns

2012-08-17 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13437036#comment-13437036
 ] 

Gang Tim Liu commented on HIVE-3394:


https://reviews.facebook.net/D4719

 Refactor a few classes' constructors with creational patterns
 -

 Key: HIVE-3394
 URL: https://issues.apache.org/jira/browse/HIVE-3394
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: HIVE-3394.patch.1


 It's good to refactor the following classes' constructors with 
 builder/factory pattern. This should be done before HIVE-3072 so that 
 HIVE-3072 can refresh according and extend it to skewed use case:
 1. MStorageDescriptor.java
 2. ColumnInfo.java
 3. ParseContext.java
 4. CreateTableDesc.java
 5. ExprNodeColumnDesc.java

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Work started] (HIVE-3394) Refactor a few classes' constructors with creational patterns

2012-08-17 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-3394 started by Gang Tim Liu.

 Refactor a few classes' constructors with creational patterns
 -

 Key: HIVE-3394
 URL: https://issues.apache.org/jira/browse/HIVE-3394
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: HIVE-3394.patch.1


 It's good to refactor the following classes' constructors with 
 builder/factory pattern. This should be done before HIVE-3072 so that 
 HIVE-3072 can refresh according and extend it to skewed use case:
 1. MStorageDescriptor.java
 2. ColumnInfo.java
 3. ParseContext.java
 4. CreateTableDesc.java
 5. ExprNodeColumnDesc.java

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3394) Refactor a few classes' constructors with creational patterns

2012-08-17 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3394:
---

Attachment: HIVE-3394.patch.1

 Refactor a few classes' constructors with creational patterns
 -

 Key: HIVE-3394
 URL: https://issues.apache.org/jira/browse/HIVE-3394
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: HIVE-3394.patch.1


 It's good to refactor the following classes' constructors with 
 builder/factory pattern. This should be done before HIVE-3072 so that 
 HIVE-3072 can refresh according and extend it to skewed use case:
 1. MStorageDescriptor.java
 2. ColumnInfo.java
 3. ParseContext.java
 4. CreateTableDesc.java
 5. ExprNodeColumnDesc.java

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3394) Refactor a few classes' constructors with creational patterns

2012-08-17 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3394:
---

Status: Patch Available  (was: In Progress)

Patch is ready for review in both jira and phabricator.

 Refactor a few classes' constructors with creational patterns
 -

 Key: HIVE-3394
 URL: https://issues.apache.org/jira/browse/HIVE-3394
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: HIVE-3394.patch.1


 It's good to refactor the following classes' constructors with 
 builder/factory pattern. This should be done before HIVE-3072 so that 
 HIVE-3072 can refresh according and extend it to skewed use case:
 1. MStorageDescriptor.java
 2. ColumnInfo.java
 3. ParseContext.java
 4. CreateTableDesc.java
 5. ExprNodeColumnDesc.java

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (HIVE-3390) Hive List Bucketing - DDL support - DB upgrade script for Derby, Postgres, and Oracle

2012-08-16 Thread Gang Tim Liu (JIRA)
Gang Tim Liu created HIVE-3390:
--

 Summary: Hive List Bucketing - DDL support - DB upgrade script for 
Derby, Postgres, and Oracle
 Key: HIVE-3390
 URL: https://issues.apache.org/jira/browse/HIVE-3390
 Project: Hive
  Issue Type: New Feature
Reporter: Gang Tim Liu


This is a follow-up for HIVE-3072.

We need upgrade scripts for Derby, Postgres, and Oracle.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3390) Hive List Bucketing - DDL support - DB upgrade script for Derby, Postgres, and Oracle

2012-08-16 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13436246#comment-13436246
 ] 

Gang Tim Liu commented on HIVE-3390:


@Carl, got you. do you have some instructions to generate upgrade script for 
derby, postgres and Orable? Are you using SchemaTool? If you have instructions, 
it will be big help. thanks

 Hive List Bucketing - DDL support - DB upgrade script for Derby, Postgres, 
 and Oracle
 -

 Key: HIVE-3390
 URL: https://issues.apache.org/jira/browse/HIVE-3390
 Project: Hive
  Issue Type: New Feature
Reporter: Gang Tim Liu

 This is a follow-up for HIVE-3072.
 We need upgrade scripts for Derby, Postgres, and Oracle.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3390) Hive List Bucketing - DDL support - DB upgrade script for Derby, Postgres, and Oracle

2012-08-16 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13436253#comment-13436253
 ] 

Gang Tim Liu commented on HIVE-3390:


I see. will do it last.

I am making changes and target to get you a patch to review today.

thanks

 Hive List Bucketing - DDL support - DB upgrade script for Derby, Postgres, 
 and Oracle
 -

 Key: HIVE-3390
 URL: https://issues.apache.org/jira/browse/HIVE-3390
 Project: Hive
  Issue Type: New Feature
Reporter: Gang Tim Liu

 This is a follow-up for HIVE-3072.
 We need upgrade scripts for Derby, Postgres, and Oracle.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Work started] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-15 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-3072 started by Gang Tim Liu.

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-15 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3072:
---

Attachment: HIVE-3072.patch.1

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch, HIVE-3072.patch.1


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Work started] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-14 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-3072 started by Gang Tim Liu.

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu

 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-14 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3072:
---

Attachment: HIVE-3072.patch

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-14 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3072:
---

Status: Patch Available  (was: In Progress)

Patch is available for review.

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3072.patch


 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3372) Local mode errors point to the wrong file

2012-08-13 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3372:
---

  Priority: Trivial  (was: Major)
Issue Type: Task  (was: Bug)

Atlassian JIRA doesn't create the jira issue correctly:
1. no action to submit patch
2. no action to resolve issue etc

so far, no way to clean them up. temporarily move to task.

 Local mode errors point to the wrong file
 -

 Key: HIVE-3372
 URL: https://issues.apache.org/jira/browse/HIVE-3372
 Project: Hive
  Issue Type: Task
  Components: CLI
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Trivial

 This is a classic example of a query that fails due to a bad udf in local 
 mode:
 Ended Job = job_local_0001 with errors
 Error during job, obtaining debugging information...
 Execution failed with exit status: 2
 Obtaining error information
 Task failed!
 Task ID:
 Stage-1
 Logs:
 /tmp/gang/hive.log
 FAILED: Execution Error, return code 2 from 
 org.apache.hadoop.hive.ql.exec.MapRedTask
 But actually local mr job in 
 /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3371) Local mode errors point to the wrong file

2012-08-13 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3371:
---

  Priority: Trivial  (was: Minor)
Issue Type: Task  (was: Bug)

Atlassian JIRA doesn't create the jira issue correctly:
1. no action to submit patch
2. no action to resolve issue etc

so far, no way to clean them up. temporarily move to task.

 Local mode errors point to the wrong file
 -

 Key: HIVE-3371
 URL: https://issues.apache.org/jira/browse/HIVE-3371
 Project: Hive
  Issue Type: Task
  Components: CLI
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Trivial
 Attachments: hive-3371

  Time Spent: 48h
  Remaining Estimate: 0h

  This is a classic example of a query that fails due to a bad udf in local 
 mode:
 Ended Job = job_local_0001 with errors
 Error during job, obtaining debugging information...
 Execution failed with exit status: 2
 Obtaining error information
 Task failed!
 Task ID:
   Stage-1
 Logs:
 /tmp/gang/hive.log
 FAILED: Execution Error, return code 2 from 
 org.apache.hadoop.hive.ql.exec.MapRedTask
 But actually local mr job in 
 /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3347) Local mode errors point to the wrong file

2012-08-13 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3347:
---

  Priority: Trivial  (was: Major)
Issue Type: Task  (was: Bug)

Atlassian JIRA doesn't create the jira issue correctly:
1. no action to submit patch
2. no action to resolve issue etc

so far, no way to clean them up. temporarily move to task.

 Local mode errors point to the wrong file
 -

 Key: HIVE-3347
 URL: https://issues.apache.org/jira/browse/HIVE-3347
 Project: Hive
  Issue Type: Task
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Trivial

 This is a classic example of a query that fails due to a bad udf in local 
 mode:
 Ended Job = job_local_0001 with errors
 Error during job, obtaining debugging information...
 Execution failed with exit status: 2
 Obtaining error information
 Task failed!
 Task ID:
   Stage-1
 Logs:
 /tmp/gang/hive.log
 FAILED: Execution Error, return code 2 from 
 org.apache.hadoop.hive.ql.exec.MapRedTask
 But actually local mr job in 
 /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3346) Local mode errors point to the wrong file

2012-08-13 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3346:
---

  Priority: Trivial  (was: Major)
Issue Type: Task  (was: Bug)

Atlassian JIRA doesn't create the jira issue correctly:
1. no action to submit patch
2. no action to resolve issue etc

so far, no way to clean them up. temporarily move to task.

 Local mode errors point to the wrong file
 -

 Key: HIVE-3346
 URL: https://issues.apache.org/jira/browse/HIVE-3346
 Project: Hive
  Issue Type: Task
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Trivial

 This is a classic example of a query that fails due to a bad udf in local 
 mode:
 Ended Job = job_local_0001 with errors
 Error during job, obtaining debugging information...
 Execution failed with exit status: 2
 Obtaining error information
 Task failed!
 Task ID:
   Stage-1
 Logs:
 /tmp/gang/hive.log
 FAILED: Execution Error, return code 2 from 
 org.apache.hadoop.hive.ql.exec.MapRedTask
 But actually local mr job in 
 /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3370) Local mode errors point to the wrong file

2012-08-13 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3370:
---

  Priority: Trivial  (was: Major)
Issue Type: Task  (was: Bug)

Atlassian JIRA doesn't create the jira issue correctly:
1. no action to submit patch
2. no action to resolve issue etc

so far, no way to clean them up. temporarily move to task.

 Local mode errors point to the wrong file
 -

 Key: HIVE-3370
 URL: https://issues.apache.org/jira/browse/HIVE-3370
 Project: Hive
  Issue Type: Task
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Trivial

  This is a classic example of a query that fails due to a bad udf in local 
 mode:
 Ended Job = job_local_0001 with errors
 Error during job, obtaining debugging information...
 Execution failed with exit status: 2
 Obtaining error information
 Task failed!
 Task ID:
   Stage-1
 Logs:
 /tmp/gang/hive.log
 FAILED: Execution Error, return code 2 from 
 org.apache.hadoop.hive.ql.exec.MapRedTask
 But actually local mr job in 
 /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (HIVE-3376) Local mode errors point to the wrong file

2012-08-13 Thread Gang Tim Liu (JIRA)
Gang Tim Liu created HIVE-3376:
--

 Summary: Local mode errors point to the wrong file
 Key: HIVE-3376
 URL: https://issues.apache.org/jira/browse/HIVE-3376
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu




This is a classic example of a query that fails due to a bad udf in local mode:

Ended Job = job_local_0001 with errors
Error during job, obtaining debugging information...
Execution failed with exit status: 2
Obtaining error information

Task failed!
Task ID:
Stage-1

Logs:

/tmp/gang/hive.log
FAILED: Execution Error, return code 2 from 
org.apache.hadoop.hive.ql.exec.MapRedTask

But actually local mr job in 
/tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Assigned] (HIVE-3376) Local mode errors point to the wrong file

2012-08-13 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu reassigned HIVE-3376:
--

Assignee: Gang Tim Liu

 Local mode errors point to the wrong file
 -

 Key: HIVE-3376
 URL: https://issues.apache.org/jira/browse/HIVE-3376
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: hive-3371


 This is a classic example of a query that fails due to a bad udf in local 
 mode:
 Ended Job = job_local_0001 with errors
 Error during job, obtaining debugging information...
 Execution failed with exit status: 2
 Obtaining error information
 Task failed!
 Task ID:
 Stage-1
 Logs:
 /tmp/gang/hive.log
 FAILED: Execution Error, return code 2 from 
 org.apache.hadoop.hive.ql.exec.MapRedTask
 But actually local mr job in 
 /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Work started] (HIVE-3376) Local mode errors point to the wrong file

2012-08-13 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-3376 started by Gang Tim Liu.

 Local mode errors point to the wrong file
 -

 Key: HIVE-3376
 URL: https://issues.apache.org/jira/browse/HIVE-3376
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: hive-3371


 This is a classic example of a query that fails due to a bad udf in local 
 mode:
 Ended Job = job_local_0001 with errors
 Error during job, obtaining debugging information...
 Execution failed with exit status: 2
 Obtaining error information
 Task failed!
 Task ID:
 Stage-1
 Logs:
 /tmp/gang/hive.log
 FAILED: Execution Error, return code 2 from 
 org.apache.hadoop.hive.ql.exec.MapRedTask
 But actually local mr job in 
 /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3376) Local mode errors point to the wrong file

2012-08-13 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13433316#comment-13433316
 ] 

Gang Tim Liu commented on HIVE-3376:


https://reviews.facebook.net/D4575

 Local mode errors point to the wrong file
 -

 Key: HIVE-3376
 URL: https://issues.apache.org/jira/browse/HIVE-3376
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: hive-3371


 This is a classic example of a query that fails due to a bad udf in local 
 mode:
 Ended Job = job_local_0001 with errors
 Error during job, obtaining debugging information...
 Execution failed with exit status: 2
 Obtaining error information
 Task failed!
 Task ID:
 Stage-1
 Logs:
 /tmp/gang/hive.log
 FAILED: Execution Error, return code 2 from 
 org.apache.hadoop.hive.ql.exec.MapRedTask
 But actually local mr job in 
 /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3376) Local mode errors point to the wrong file

2012-08-13 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3376:
---

Attachment: hive-3371

 Local mode errors point to the wrong file
 -

 Key: HIVE-3376
 URL: https://issues.apache.org/jira/browse/HIVE-3376
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: hive-3371


 This is a classic example of a query that fails due to a bad udf in local 
 mode:
 Ended Job = job_local_0001 with errors
 Error during job, obtaining debugging information...
 Execution failed with exit status: 2
 Obtaining error information
 Task failed!
 Task ID:
 Stage-1
 Logs:
 /tmp/gang/hive.log
 FAILED: Execution Error, return code 2 from 
 org.apache.hadoop.hive.ql.exec.MapRedTask
 But actually local mr job in 
 /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3376) Local mode errors point to the wrong file

2012-08-13 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3376:
---

Status: Patch Available  (was: In Progress)

 Local mode errors point to the wrong file
 -

 Key: HIVE-3376
 URL: https://issues.apache.org/jira/browse/HIVE-3376
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: hive-3371


 This is a classic example of a query that fails due to a bad udf in local 
 mode:
 Ended Job = job_local_0001 with errors
 Error during job, obtaining debugging information...
 Execution failed with exit status: 2
 Obtaining error information
 Task failed!
 Task ID:
 Stage-1
 Logs:
 /tmp/gang/hive.log
 FAILED: Execution Error, return code 2 from 
 org.apache.hadoop.hive.ql.exec.MapRedTask
 But actually local mr job in 
 /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3376) Local mode errors point to the wrong file

2012-08-13 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13433325#comment-13433325
 ] 

Gang Tim Liu commented on HIVE-3376:


While executing in local mode, the process to collect failed job information is 
a different process from running local MR job. Each process reads a different 
log4j property file. One points to hive.log and another is MR job log.
No communication on 2 processes yet on job log. There are some ways like 
external files etc.
Consider the bug impact itself (log file location), easy way is to pass MR job 
stream and pick up log file location and output in CLI stream.

 Local mode errors point to the wrong file
 -

 Key: HIVE-3376
 URL: https://issues.apache.org/jira/browse/HIVE-3376
 Project: Hive
  Issue Type: Bug
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: hive-3371


 This is a classic example of a query that fails due to a bad udf in local 
 mode:
 Ended Job = job_local_0001 with errors
 Error during job, obtaining debugging information...
 Execution failed with exit status: 2
 Obtaining error information
 Task failed!
 Task ID:
 Stage-1
 Logs:
 /tmp/gang/hive.log
 FAILED: Execution Error, return code 2 from 
 org.apache.hadoop.hive.ql.exec.MapRedTask
 But actually local mr job in 
 /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-13 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13433326#comment-13433326
 ] 

Gang Tim Liu commented on HIVE-3072:


https://reviews.facebook.net/D4599

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu

 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-13 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13433328#comment-13433328
 ] 

Gang Tim Liu commented on HIVE-3072:


There are a few reasons to release a DDL as a patch:
1. the content is getting bigger. might be better to get it review so that we 
can capture issues earlier.
2. the skewed grammar will not only benefit list bucketing feature but also 
other feature like skewed join. releasing grammar patch will unblock others' 
development.

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu

 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3343) Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result

2012-08-13 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13433892#comment-13433892
 ] 

Gang Tim Liu commented on HIVE-3343:


thanks Namit

 Hive: Query misaligned result for Group by followed by Join with filter and 
 skip a group-by result
 --

 Key: HIVE-3343
 URL: https://issues.apache.org/jira/browse/HIVE-3343
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: hive-3343.patch


 This simple Hive query would generate wrong result:
 select a.key, b.k2, b.k3
 from src a
 join (
   select key, 
  min(key) as k,
  min(key)+1 as k1,
  min(key)+2 as k2,
  min(key)+3 as k3
   from src
   group by key
 ) b
 on a.key=b.key and b.k1  5;
 0   3.0 1.0
 0   3.0 1.0
 0   3.0 1.0
 2   5.0 3.0
 The right result is:
 0 2.0 3.0
 0 2.0 3.0
 0 2.0 3.0
 2 4.0 5.0

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3072) Hive List Bucketing - DDL support

2012-08-13 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13433904#comment-13433904
 ] 

Gang Tim Liu commented on HIVE-3072:


yes, I am.

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu

 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3371) Local mode errors point to the wrong file

2012-08-09 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13432283#comment-13432283
 ] 

Gang Tim Liu commented on HIVE-3371:


https://reviews.facebook.net/D4575

 Local mode errors point to the wrong file
 -

 Key: HIVE-3371
 URL: https://issues.apache.org/jira/browse/HIVE-3371
 Project: Hive
  Issue Type: Bug
  Components: CLI
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor

  This is a classic example of a query that fails due to a bad udf in local 
 mode:
 Ended Job = job_local_0001 with errors
 Error during job, obtaining debugging information...
 Execution failed with exit status: 2
 Obtaining error information
 Task failed!
 Task ID:
   Stage-1
 Logs:
 /tmp/gang/hive.log
 FAILED: Execution Error, return code 2 from 
 org.apache.hadoop.hive.ql.exec.MapRedTask
 But actually local mr job in 
 /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3371) Local mode errors point to the wrong file

2012-08-09 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3371:
---

Attachment: hive-3371

 Local mode errors point to the wrong file
 -

 Key: HIVE-3371
 URL: https://issues.apache.org/jira/browse/HIVE-3371
 Project: Hive
  Issue Type: Bug
  Components: CLI
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: hive-3371


  This is a classic example of a query that fails due to a bad udf in local 
 mode:
 Ended Job = job_local_0001 with errors
 Error during job, obtaining debugging information...
 Execution failed with exit status: 2
 Obtaining error information
 Task failed!
 Task ID:
   Stage-1
 Logs:
 /tmp/gang/hive.log
 FAILED: Execution Error, return code 2 from 
 org.apache.hadoop.hive.ql.exec.MapRedTask
 But actually local mr job in 
 /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Work logged] (HIVE-3371) Local mode errors point to the wrong file

2012-08-09 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3371?page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel\#worklog-{worklog.getId()}
 ]

Gang Tim Liu logged work on HIVE-3371:
--

Author: Gang Tim Liu
Created on: 09/Aug/12 23:08
Start Date: 09/Aug/12 23:08
Worklog Time Spent: 48h 

Issue Time Tracking
---

Worklog Id: (was: 13834)
Time Spent: 48h
Remaining Estimate: 0h

 Local mode errors point to the wrong file
 -

 Key: HIVE-3371
 URL: https://issues.apache.org/jira/browse/HIVE-3371
 Project: Hive
  Issue Type: Bug
  Components: CLI
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: hive-3371

  Time Spent: 48h
  Remaining Estimate: 0h

  This is a classic example of a query that fails due to a bad udf in local 
 mode:
 Ended Job = job_local_0001 with errors
 Error during job, obtaining debugging information...
 Execution failed with exit status: 2
 Obtaining error information
 Task failed!
 Task ID:
   Stage-1
 Logs:
 /tmp/gang/hive.log
 FAILED: Execution Error, return code 2 from 
 org.apache.hadoop.hive.ql.exec.MapRedTask
 But actually local mr job in 
 /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




Atlassian JIRA Issue.

2012-08-09 Thread Gang Tim Liu
Dear all,

Have you seen problem while creating jira issue today?

I tried to create jira issue and it showed exception in window but
actually issue is created. The created issues are abnormal: it missed
actions like submit patch.

It also doesn't allow me to close/resolve them.

So, you will see a few of them for the same issue:

HIVE-3372 https://issues.apache.org/jira/browse/HIVE-3372
 HIVE-3371 https://issues.apache.org/jira/browse/HIVE-3371
 HIVE-3370 https://issues.apache.org/jira/browse/HIVE-3370
 HIVE-3369 https://issues.apache.org/jira/browse/HIVE-3369



Thanks

Tim

On 8/9/12 4:09 PM, Gang Tim Liu (JIRA) j...@apache.org wrote:


 [ 
https://issues.apache.org/jira/browse/HIVE-3371?page=com.atlassian.jira.pl
ugin.system.issuetabpanels:worklog-tabpanel\#worklog-{worklog.getId()} ]

Gang Tim Liu logged work on HIVE-3371:
--

Author: Gang Tim Liu
Created on: 09/Aug/12 23:08
Start Date: 09/Aug/12 23:08
Worklog Time Spent: 48h

Issue Time Tracking
---

Worklog Id: (was: 13834)
Time Spent: 48h
Remaining Estimate: 0h

 Local mode errors point to the wrong file
 -

 Key: HIVE-3371
 URL: https://issues.apache.org/jira/browse/HIVE-3371
 Project: Hive
  Issue Type: Bug
  Components: CLI
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: hive-3371

  Time Spent: 48h
  Remaining Estimate: 0h

  This is a classic example of a query that fails due to a bad udf in
local mode:
 Ended Job = job_local_0001 with errors
 Error during job, obtaining debugging information...
 Execution failed with exit status: 2
 Obtaining error information
 Task failed!
 Task ID:
   Stage-1
 Logs:
 /tmp/gang/hive.log
 FAILED: Execution Error, return code 2 from
org.apache.hadoop.hive.ql.exec.MapRedTask
 But actually local mr job in
/tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA
administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira





[jira] [Updated] (HIVE-3343) Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result

2012-08-08 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3343:
---

Attachment: hive-3343.patch

 Hive: Query misaligned result for Group by followed by Join with filter and 
 skip a group-by result
 --

 Key: HIVE-3343
 URL: https://issues.apache.org/jira/browse/HIVE-3343
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: hive-3343.patch


 This simple Hive query would generate wrong result:
 select a.key, b.k2, b.k3
 from src a
 join (
   select key, 
  min(key) as k,
  min(key)+1 as k1,
  min(key)+2 as k2,
  min(key)+3 as k3
   from src
   group by key
 ) b
 on a.key=b.key and b.k1  5;
 0   3.0 1.0
 0   3.0 1.0
 0   3.0 1.0
 2   5.0 3.0
 The right result is:
 0 2.0 3.0
 0 2.0 3.0
 0 2.0 3.0
 2 4.0 5.0

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3343) Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result

2012-08-08 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3343:
---

Status: Patch Available  (was: Open)

patch is submitted.

 Hive: Query misaligned result for Group by followed by Join with filter and 
 skip a group-by result
 --

 Key: HIVE-3343
 URL: https://issues.apache.org/jira/browse/HIVE-3343
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: hive-3343.patch


 This simple Hive query would generate wrong result:
 select a.key, b.k2, b.k3
 from src a
 join (
   select key, 
  min(key) as k,
  min(key)+1 as k1,
  min(key)+2 as k2,
  min(key)+3 as k3
   from src
   group by key
 ) b
 on a.key=b.key and b.k1  5;
 0   3.0 1.0
 0   3.0 1.0
 0   3.0 1.0
 2   5.0 3.0
 The right result is:
 0 2.0 3.0
 0 2.0 3.0
 0 2.0 3.0
 2 4.0 5.0

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (HIVE-3342) Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result

2012-08-07 Thread Gang Tim Liu (JIRA)
Gang Tim Liu created HIVE-3342:
--

 Summary: Hive: Query misaligned result for Group by followed by 
Join with filter and skip a group-by result
 Key: HIVE-3342
 URL: https://issues.apache.org/jira/browse/HIVE-3342
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Gang Tim Liu


This simple Hive query would generate wrong result:

select a.key, b.k2, b.k3
from src a
join (
  select key, 
 min(key) as k,
 min(key)+1 as k1,
 min(key)+2 as k2,
 min(key)+3 as k3
  from src
  group by key
) b
on a.key=b.key and b.k1  5;


0   3.0 1.0
0   3.0 1.0
0   3.0 1.0
2   5.0 3.0

The right result is:
0   2.0 3.0
0   2.0 3.0
0   2.0 3.0
2   4.0 5.0

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (HIVE-3343) Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result

2012-08-07 Thread Gang Tim Liu (JIRA)
Gang Tim Liu created HIVE-3343:
--

 Summary: Hive: Query misaligned result for Group by followed by 
Join with filter and skip a group-by result
 Key: HIVE-3343
 URL: https://issues.apache.org/jira/browse/HIVE-3343
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu


This simple Hive query would generate wrong result:

select a.key, b.k2, b.k3
from src a
join (
  select key, 
 min(key) as k,
 min(key)+1 as k1,
 min(key)+2 as k2,
 min(key)+3 as k3
  from src
  group by key
) b
on a.key=b.key and b.k1  5;


0   3.0 1.0
0   3.0 1.0
0   3.0 1.0
2   5.0 3.0

The right result is:
0   2.0 3.0
0   2.0 3.0
0   2.0 3.0
2   4.0 5.0

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3343) Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result

2012-08-07 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13430757#comment-13430757
 ] 

Gang Tim Liu commented on HIVE-3343:


If we remove b.k1  5, it will generate right result.

explain extended shows Select Operator has different order of columns (_col0, 
_col3, _col4, _col2) from filter operator's (_col0,_col2,_col3,_col4). Here is 
snapshot:

Reduce Operator Tree:
Group By Operator
  ...
  outputColumnNames: _col0, _col1
  Select Operator
...
outputColumnNames: _col0, _col3, _col4, _col2
Filter Operator
  ...
  predicate:
  expr: (_col2  5.0)
  type: boolean
  File Output Operator
...
table:
...
properties:
  columns _col0,_col2,_col3,_col4
  columns.types string,double,double,double
  escape.delim \
...

 Hive: Query misaligned result for Group by followed by Join with filter and 
 skip a group-by result
 --

 Key: HIVE-3343
 URL: https://issues.apache.org/jira/browse/HIVE-3343
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu

 This simple Hive query would generate wrong result:
 select a.key, b.k2, b.k3
 from src a
 join (
   select key, 
  min(key) as k,
  min(key)+1 as k1,
  min(key)+2 as k2,
  min(key)+3 as k3
   from src
   group by key
 ) b
 on a.key=b.key and b.k1  5;
 0   3.0 1.0
 0   3.0 1.0
 0   3.0 1.0
 2   5.0 3.0
 The right result is:
 0 2.0 3.0
 0 2.0 3.0
 0 2.0 3.0
 2 4.0 5.0

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3343) Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result

2012-08-07 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13430761#comment-13430761
 ] 

Gang Tim Liu commented on HIVE-3343:


root-cause is ColumnPrunerFilterProc doesn't perserve order of prunned column 
list (filterOpPrunedColLists). It causes

outputColumnNames: _col0, _col3, _col4, _col2

It should be outputColumnNames: _col0, _col2, _col3, _col4


 Hive: Query misaligned result for Group by followed by Join with filter and 
 skip a group-by result
 --

 Key: HIVE-3343
 URL: https://issues.apache.org/jira/browse/HIVE-3343
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu

 This simple Hive query would generate wrong result:
 select a.key, b.k2, b.k3
 from src a
 join (
   select key, 
  min(key) as k,
  min(key)+1 as k1,
  min(key)+2 as k2,
  min(key)+3 as k3
   from src
   group by key
 ) b
 on a.key=b.key and b.k1  5;
 0   3.0 1.0
 0   3.0 1.0
 0   3.0 1.0
 2   5.0 3.0
 The right result is:
 0 2.0 3.0
 0 2.0 3.0
 0 2.0 3.0
 2 4.0 5.0

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Assigned] (HIVE-3026) List Bucketing in Hive

2012-07-30 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu reassigned HIVE-3026:
--

Assignee: Gang Tim Liu

 List Bucketing in Hive
 --

 Key: HIVE-3026
 URL: https://issues.apache.org/jira/browse/HIVE-3026
 Project: Hive
  Issue Type: New Feature
Reporter: Namit Jain
Assignee: Gang Tim Liu

 Details are at:
 https://cwiki.apache.org/confluence/display/Hive/ListBucketing
 Please comment

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3026) List Bucketing in Hive

2012-07-30 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13424988#comment-13424988
 ] 

Gang Tim Liu commented on HIVE-3026:


I am working on it and will release the patch soon.

 List Bucketing in Hive
 --

 Key: HIVE-3026
 URL: https://issues.apache.org/jira/browse/HIVE-3026
 Project: Hive
  Issue Type: New Feature
Reporter: Namit Jain
Assignee: Gang Tim Liu

 Details are at:
 https://cwiki.apache.org/confluence/display/Hive/ListBucketing
 Please comment

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




Re: Lisk Bucketing DDL Patch

2012-07-27 Thread Gang Tim Liu
Yes, Namit has a great summary. thanks

On 7/27/12 9:09 PM, Namit Jain nj...@fb.com wrote:

Yes, that patch will become quiet big to be done a single shot.

Moreover, the skew information can be used by a variety of use-cases.

1. List Bucketing
2. Skew Joins: https://cwiki.apache.org/Hive/skewed-join-optimization.html
3. Another variant of skew joins:
https://issues.apache.org/jira/browse/HIVE-3286

So, the skew may not be limited to be used for list bucketing only.

So, it might be simpler to split into DDL and DML support.

DDL will be common to all the use-cases who want to use/store skew
information. 

Each use-case can implement the DML/Query separately.


Thanks,
-namit


On 7/28/12 7:07 AM, Carl Steinbach c...@cloudera.com wrote:

 Since we are close to release the first patch DDL.

In a comment on the design doc you said that the first phase would
involve
implementing this feature for a single-column end-to-end (DML+DDL). Has
that plan changed?

Thanks.

Carl

On Wed, Jul 25, 2012 at 12:31 AM, Gang Tim Liu g...@fb.com wrote:

 Dear all hive developers,

 Please review the documentation:

 https://cwiki.apache.org/confluence/display/Hive/ListBucketing

 Since we are close to release the first patch DDL.

 We will continue to update the wiki about new information and in the
 meanwhile want to collect your feedback.

 Thanks

 Tim






Lisk Bucketing DDL Patch

2012-07-25 Thread Gang Tim Liu
Dear all hive developers,

Please review the documentation:

https://cwiki.apache.org/confluence/display/Hive/ListBucketing

Since we are close to release the first patch DDL.

We will continue to update the wiki about new information and in the
meanwhile want to collect your feedback.

Thanks

Tim



[jira] [Commented] (HIVE-3251) Hive doesn't remove scrach directories while killing running MR job

2012-07-19 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13418515#comment-13418515
 ] 

Gang Tim Liu commented on HIVE-3251:


Namit, thank you very much for review and commit.

 Hive doesn't remove scrach directories while killing running MR job
 ---

 Key: HIVE-3251
 URL: https://issues.apache.org/jira/browse/HIVE-3251
 Project: Hive
  Issue Type: Bug
  Components: Server Infrastructure
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3151.patch


 While killing running MR job, hive doesn't clean up scratch directory 
 (mapred.cache.files). So that, afterwards, scratch directory is left there in 
 hdfs. HDFS name node doesn't know it and try to do lease recovery. while such 
 instances happen more, it will eventually crash namenode.
 The fix is to leverage hdfs clean up functionality. While creating scratch 
 dirs, hive registers it to hdfs cleanup hook. While killing happens, hdfs 
 will clean them up.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (HIVE-3251) Hive doesn't remove scrach directories while killing running MR job

2012-07-11 Thread Gang Tim Liu (JIRA)
Gang Tim Liu created HIVE-3251:
--

 Summary: Hive doesn't remove scrach directories while killing 
running MR job
 Key: HIVE-3251
 URL: https://issues.apache.org/jira/browse/HIVE-3251
 Project: Hive
  Issue Type: Bug
  Components: Server Infrastructure
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu


While killing running MR job, hive doesn't clean up scratch directory 
(mapred.cache.files). So that, afterwards, scratch directory is left there in 
hdfs. HDFS name node doesn't know it and try to do lease recovery. while such 
instances happen more, it will eventually crash namenode.

The fix is to leverage hdfs clean up functionality. While creating scratch 
dirs, hive registers it to hdfs cleanup hook. While killing happens, hdfs will 
clean them up.



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Work started] (HIVE-3251) Hive doesn't remove scrach directories while killing running MR job

2012-07-11 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-3251 started by Gang Tim Liu.

 Hive doesn't remove scrach directories while killing running MR job
 ---

 Key: HIVE-3251
 URL: https://issues.apache.org/jira/browse/HIVE-3251
 Project: Hive
  Issue Type: Bug
  Components: Server Infrastructure
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu

 While killing running MR job, hive doesn't clean up scratch directory 
 (mapred.cache.files). So that, afterwards, scratch directory is left there in 
 hdfs. HDFS name node doesn't know it and try to do lease recovery. while such 
 instances happen more, it will eventually crash namenode.
 The fix is to leverage hdfs clean up functionality. While creating scratch 
 dirs, hive registers it to hdfs cleanup hook. While killing happens, hdfs 
 will clean them up.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3251) Hive doesn't remove scrach directories while killing running MR job

2012-07-11 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13412280#comment-13412280
 ] 

Gang Tim Liu commented on HIVE-3251:


diff is ready to review https://reviews.facebook.net/D4089

 Hive doesn't remove scrach directories while killing running MR job
 ---

 Key: HIVE-3251
 URL: https://issues.apache.org/jira/browse/HIVE-3251
 Project: Hive
  Issue Type: Bug
  Components: Server Infrastructure
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu

 While killing running MR job, hive doesn't clean up scratch directory 
 (mapred.cache.files). So that, afterwards, scratch directory is left there in 
 hdfs. HDFS name node doesn't know it and try to do lease recovery. while such 
 instances happen more, it will eventually crash namenode.
 The fix is to leverage hdfs clean up functionality. While creating scratch 
 dirs, hive registers it to hdfs cleanup hook. While killing happens, hdfs 
 will clean them up.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3251) Hive doesn't remove scrach directories while killing running MR job

2012-07-11 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3251:
---

Status: Patch Available  (was: In Progress)

https://reviews.facebook.net/D4089

 Hive doesn't remove scrach directories while killing running MR job
 ---

 Key: HIVE-3251
 URL: https://issues.apache.org/jira/browse/HIVE-3251
 Project: Hive
  Issue Type: Bug
  Components: Server Infrastructure
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu

 While killing running MR job, hive doesn't clean up scratch directory 
 (mapred.cache.files). So that, afterwards, scratch directory is left there in 
 hdfs. HDFS name node doesn't know it and try to do lease recovery. while such 
 instances happen more, it will eventually crash namenode.
 The fix is to leverage hdfs clean up functionality. While creating scratch 
 dirs, hive registers it to hdfs cleanup hook. While killing happens, hdfs 
 will clean them up.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3251) Hive doesn't remove scrach directories while killing running MR job

2012-07-11 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3251:
---

Attachment: HIVE-3151.patch

 Hive doesn't remove scrach directories while killing running MR job
 ---

 Key: HIVE-3251
 URL: https://issues.apache.org/jira/browse/HIVE-3251
 Project: Hive
  Issue Type: Bug
  Components: Server Infrastructure
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
 Attachments: HIVE-3151.patch


 While killing running MR job, hive doesn't clean up scratch directory 
 (mapred.cache.files). So that, afterwards, scratch directory is left there in 
 hdfs. HDFS name node doesn't know it and try to do lease recovery. while such 
 instances happen more, it will eventually crash namenode.
 The fix is to leverage hdfs clean up functionality. While creating scratch 
 dirs, hive registers it to hdfs cleanup hook. While killing happens, hdfs 
 will clean them up.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()

2012-06-25 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13400820#comment-13400820
 ] 

Gang Tim Liu commented on HIVE-3165:


@Carl, good Monday. How tests go? what I can do for it. thanks a lot. Tim

 Hive thrift code doesnt generate quality hashCode()
 ---

 Key: HIVE-3165
 URL: https://issues.apache.org/jira/browse/HIVE-3165
 Project: Hive
  Issue Type: Bug
  Components: Thrift API
 Environment: Hive thrift code doesnt generate quality hashCode(). 
 Right now, it all has
 @Override
 public int hashCode() {
 return 0;
 }
 It will cause all entries go to the same bucket if it's used as hash key.
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: Hive-3165.patch, Hive-3165.patch-2




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()

2012-06-25 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13400844#comment-13400844
 ] 

Gang Tim Liu commented on HIVE-3165:


@Carl, thank you:)

 Hive thrift code doesnt generate quality hashCode()
 ---

 Key: HIVE-3165
 URL: https://issues.apache.org/jira/browse/HIVE-3165
 Project: Hive
  Issue Type: Bug
  Components: Thrift API
 Environment: Hive thrift code doesnt generate quality hashCode(). 
 Right now, it all has
 @Override
 public int hashCode() {
 return 0;
 }
 It will cause all entries go to the same bucket if it's used as hash key.
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Fix For: 0.10.0

 Attachments: Hive-3165.patch, Hive-3165.patch-2




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Work started] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()

2012-06-21 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-3165 started by Gang Tim Liu.

 Hive thrift code doesnt generate quality hashCode()
 ---

 Key: HIVE-3165
 URL: https://issues.apache.org/jira/browse/HIVE-3165
 Project: Hive
  Issue Type: Bug
  Components: Thrift API
 Environment: Hive thrift code doesnt generate quality hashCode(). 
 Right now, it all has
 @Override
 public int hashCode() {
 return 0;
 }
 It will cause all entries go to the same bucket if it's used as hash key.
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()

2012-06-21 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3165:
---

Release Note: Patch has been submitted https://reviews.facebook.net/D3759
  Status: Patch Available  (was: In Progress)

 Hive thrift code doesnt generate quality hashCode()
 ---

 Key: HIVE-3165
 URL: https://issues.apache.org/jira/browse/HIVE-3165
 Project: Hive
  Issue Type: Bug
  Components: Thrift API
 Environment: Hive thrift code doesnt generate quality hashCode(). 
 Right now, it all has
 @Override
 public int hashCode() {
 return 0;
 }
 It will cause all entries go to the same bucket if it's used as hash key.
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()

2012-06-21 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13398233#comment-13398233
 ] 

Gang Tim Liu commented on HIVE-3165:


patch is available for review https://reviews.facebook.net/D3759

 Hive thrift code doesnt generate quality hashCode()
 ---

 Key: HIVE-3165
 URL: https://issues.apache.org/jira/browse/HIVE-3165
 Project: Hive
  Issue Type: Bug
  Components: Thrift API
 Environment: Hive thrift code doesnt generate quality hashCode(). 
 Right now, it all has
 @Override
 public int hashCode() {
 return 0;
 }
 It will cause all entries go to the same bucket if it's used as hash key.
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()

2012-06-21 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3165:
---

Attachment: Hive-3165.patch

 Hive thrift code doesnt generate quality hashCode()
 ---

 Key: HIVE-3165
 URL: https://issues.apache.org/jira/browse/HIVE-3165
 Project: Hive
  Issue Type: Bug
  Components: Thrift API
 Environment: Hive thrift code doesnt generate quality hashCode(). 
 Right now, it all has
 @Override
 public int hashCode() {
 return 0;
 }
 It will cause all entries go to the same bucket if it's used as hash key.
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: Hive-3165.patch




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()

2012-06-21 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13398238#comment-13398238
 ] 

Gang Tim Liu commented on HIVE-3165:


Patch is attached to here also.

 Hive thrift code doesnt generate quality hashCode()
 ---

 Key: HIVE-3165
 URL: https://issues.apache.org/jira/browse/HIVE-3165
 Project: Hive
  Issue Type: Bug
  Components: Thrift API
 Environment: Hive thrift code doesnt generate quality hashCode(). 
 Right now, it all has
 @Override
 public int hashCode() {
 return 0;
 }
 It will cause all entries go to the same bucket if it's used as hash key.
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: Hive-3165.patch




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()

2012-06-21 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13398254#comment-13398254
 ] 

Gang Tim Liu commented on HIVE-3165:


@Carl, let me check. thanks a lot

 Hive thrift code doesnt generate quality hashCode()
 ---

 Key: HIVE-3165
 URL: https://issues.apache.org/jira/browse/HIVE-3165
 Project: Hive
  Issue Type: Bug
  Components: Thrift API
 Environment: Hive thrift code doesnt generate quality hashCode(). 
 Right now, it all has
 @Override
 public int hashCode() {
 return 0;
 }
 It will cause all entries go to the same bucket if it's used as hash key.
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: Hive-3165.patch




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()

2012-06-21 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13398579#comment-13398579
 ] 

Gang Tim Liu commented on HIVE-3165:


Test passed. another batch is submitted to https://reviews.facebook.net/D3759. 
thanks

 Hive thrift code doesnt generate quality hashCode()
 ---

 Key: HIVE-3165
 URL: https://issues.apache.org/jira/browse/HIVE-3165
 Project: Hive
  Issue Type: Bug
  Components: Thrift API
 Environment: Hive thrift code doesnt generate quality hashCode(). 
 Right now, it all has
 @Override
 public int hashCode() {
 return 0;
 }
 It will cause all entries go to the same bucket if it's used as hash key.
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: Hive-3165.patch




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()

2012-06-21 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3165:
---

Attachment: Hive-3165.patch-2

 Hive thrift code doesnt generate quality hashCode()
 ---

 Key: HIVE-3165
 URL: https://issues.apache.org/jira/browse/HIVE-3165
 Project: Hive
  Issue Type: Bug
  Components: Thrift API
 Environment: Hive thrift code doesnt generate quality hashCode(). 
 Right now, it all has
 @Override
 public int hashCode() {
 return 0;
 }
 It will cause all entries go to the same bucket if it's used as hash key.
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: Hive-3165.patch, Hive-3165.patch-2




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()

2012-06-21 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3165:
---

Status: Patch Available  (was: Open)

A new patch is submitted.

 Hive thrift code doesnt generate quality hashCode()
 ---

 Key: HIVE-3165
 URL: https://issues.apache.org/jira/browse/HIVE-3165
 Project: Hive
  Issue Type: Bug
  Components: Thrift API
 Environment: Hive thrift code doesnt generate quality hashCode(). 
 Right now, it all has
 @Override
 public int hashCode() {
 return 0;
 }
 It will cause all entries go to the same bucket if it's used as hash key.
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: Hive-3165.patch, Hive-3165.patch-2




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()

2012-06-21 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13398959#comment-13398959
 ] 

Gang Tim Liu commented on HIVE-3165:


@Carl, thanks

@Namit, thanks

 Hive thrift code doesnt generate quality hashCode()
 ---

 Key: HIVE-3165
 URL: https://issues.apache.org/jira/browse/HIVE-3165
 Project: Hive
  Issue Type: Bug
  Components: Thrift API
 Environment: Hive thrift code doesnt generate quality hashCode(). 
 Right now, it all has
 @Override
 public int hashCode() {
 return 0;
 }
 It will cause all entries go to the same bucket if it's used as hash key.
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor
 Attachments: Hive-3165.patch, Hive-3165.patch-2




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()

2012-06-19 Thread Gang Tim Liu (JIRA)
Gang Tim Liu created HIVE-3165:
--

 Summary: Hive thrift code doesnt generate quality hashCode()
 Key: HIVE-3165
 URL: https://issues.apache.org/jira/browse/HIVE-3165
 Project: Hive
  Issue Type: Bug
  Components: Thrift API
 Environment: Hive thrift code doesnt generate quality hashCode(). 
Right now, it all has
@Override
public int hashCode() {
return 0;
}

It will cause all entries go to the same bucket if it's used as hash key.
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu
Priority: Minor




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support

2012-06-11 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3072:
---

Summary: Hive List Bucketing - DDL support  (was: Hive List Bucketing - DDL 
support (single column))

 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu

 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for single skewed 
 column.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support

2012-06-11 Thread Gang Tim Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gang Tim Liu updated HIVE-3072:
---

Description: 
If a hive table column has skewed keys, query performance on non-skewed key is 
always impacted. Hive List Bucketing feature will address it:

https://cwiki.apache.org/Hive/listbucketing.html

This jira issue will track DDL change for the feature. It's for both single 
skewed column and multiple columns.

  was:
If a hive table column has skewed keys, query performance on non-skewed key is 
always impacted. Hive List Bucketing feature will address it:

https://cwiki.apache.org/Hive/listbucketing.html

This jira issue will track DDL change for the feature. It's for single skewed 
column.


 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu

 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-3072) Hive List Bucketing - DDL support

2012-06-11 Thread Gang Tim Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13292978#comment-13292978
 ] 

Gang Tim Liu commented on HIVE-3072:


making progress on DML. The following syntax started to work: 
create table T (c1 string, c2 string) list bucketed by (c1) with skew 
('x1');
create table T (c1 string, c2 string, c3 string) list bucketed by (c1, 
c2) with skew (('x1', 'x2'), ('y1', 'y2'));


 Hive List Bucketing - DDL support
 -

 Key: HIVE-3072
 URL: https://issues.apache.org/jira/browse/HIVE-3072
 Project: Hive
  Issue Type: New Feature
  Components: SQL
Reporter: Gang Tim Liu
Assignee: Gang Tim Liu

 If a hive table column has skewed keys, query performance on non-skewed key 
 is always impacted. Hive List Bucketing feature will address it:
 https://cwiki.apache.org/Hive/listbucketing.html
 This jira issue will track DDL change for the feature. It's for both single 
 skewed column and multiple columns.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




<    1   2   3   4   5   6   7   >