[jira] [Updated] (HIVE-3485) Skewed DDL doesn't support skewed value with string quote
[ https://issues.apache.org/jira/browse/HIVE-3485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3485: --- Attachment: hive-3485.patch.1 Skewed DDL doesn't support skewed value with string quote - Key: HIVE-3485 URL: https://issues.apache.org/jira/browse/HIVE-3485 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: hive-3485.patch.1 CREATE TABLE list_bucket_single (key STRING, value STRING) SKEWED BY (key) ON ('1','5','6') Save '1' as in map instead 1 should -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3485) Skewed DDL doesn't support skewed value with string quote
[ https://issues.apache.org/jira/browse/HIVE-3485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13458366#comment-13458366 ] Gang Tim Liu commented on HIVE-3485: ready for review https://reviews.facebook.net/D5547 Skewed DDL doesn't support skewed value with string quote - Key: HIVE-3485 URL: https://issues.apache.org/jira/browse/HIVE-3485 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: hive-3485.patch.1 CREATE TABLE list_bucket_single (key STRING, value STRING) SKEWED BY (key) ON ('1','5','6') Save '1' as in map instead 1 should -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
Re: run a test case for hadoop 23
Zhenxiao, Thanks. It also works. Sorry for late reply. Tim On 9/11/12 5:08 PM, Zhenxiao Luo zhenx...@cloudera.com wrote: Hi, The following seems working to build/test hadoop23: $ant very-clean package -Dhadoop.version=0.23.1 -Dhadoop-0.23.version=0.23.1 -Dhadoop.mr.rev=23 $ant test -Dhadoop.version=0.23.1 -Dhadoop-0.23.version=0.23.1 -Dhadoop.mr.rev=23 Thanks, Zhenxiao On Tue, Sep 11, 2012 at 4:51 PM, Gang Tim Liu g...@fb.com wrote: This one will work and it doesn't need change in build.properties. Details in HIVE-3389. ant test -Dhadoop.mr.rev=23 -Dtest.print.classpath=true -Dhadoop.version=2.0.0-alpha -Dhadoop.security.version=2.0.0-alpha -Dtestcase=TestCliDriver -Dqfile=...q I will update hive wiki so that people can save efforts. Thanks Tim On 9/11/12 4:14 PM, Gang Tim Liu g...@fb.com wrote: I am running test in eclipse and it is printing INCLUDE list does not contain Hadoop Version 0.20. SkippingÅ It's from QTestUtil.java due to hadoopversion 0.20 Thanks On 9/11/12 4:04 PM, Chris Drome cdr...@yahoo-inc.com wrote: You can try cleaning and rebuilding: ant clean package Then you can try running your tests: ant test How do you know that it is using hadoop 20? chris On 9/11/12 3:57 PM, Gang Tim Liu g...@fb.com wrote: Not working. I must miss something. What command do you use after changing it? thanks On 9/11/12 2:31 PM, Chris Drome cdr...@yahoo-inc.com wrote: You might try modifying the build.properties file as follows: hadoop.version=${hadoop-0.23.version} hadoop.security.version=${hadoop-0.23.version} hadoop.mr.rev=23 This works for me. chris On 9/11/12 1:59 PM, Gang Tim Liu g...@fb.com wrote: Hello all, I am looking for help and thank first. How to run a test case for hadoop 23 only? Exact command? I tried the following and it didn't work: ant ... -Dhadoop.version=0.23.1 -Dhadoop-0.23.version=0.23.1 -Dhadoop.mr.rev=23 It still gets 0.20 as hadoop version. Thanks Tim
run a test case for hadoop 23
Hello all, I am looking for help and thank first. How to run a test case for hadoop 23 only? Exact command? I tried the following and it didn't work: ant ... -Dhadoop.version=0.23.1 -Dhadoop-0.23.version=0.23.1 -Dhadoop.mr.rev=23 It still gets 0.20 as hadoop version. Thanks Tim
Re: run a test case for hadoop 23
Not working. I must miss something. What command do you use after changing it? thanks On 9/11/12 2:31 PM, Chris Drome cdr...@yahoo-inc.com wrote: You might try modifying the build.properties file as follows: hadoop.version=${hadoop-0.23.version} hadoop.security.version=${hadoop-0.23.version} hadoop.mr.rev=23 This works for me. chris On 9/11/12 1:59 PM, Gang Tim Liu g...@fb.com wrote: Hello all, I am looking for help and thank first. How to run a test case for hadoop 23 only? Exact command? I tried the following and it didn't work: ant ... -Dhadoop.version=0.23.1 -Dhadoop-0.23.version=0.23.1 -Dhadoop.mr.rev=23 It still gets 0.20 as hadoop version. Thanks Tim
Re: run a test case for hadoop 23
I am running test in eclipse and it is printing INCLUDE list does not contain Hadoop Version 0.20. SkippingÅ It's from QTestUtil.java due to hadoopversion 0.20 Thanks On 9/11/12 4:04 PM, Chris Drome cdr...@yahoo-inc.com wrote: You can try cleaning and rebuilding: ant clean package Then you can try running your tests: ant test How do you know that it is using hadoop 20? chris On 9/11/12 3:57 PM, Gang Tim Liu g...@fb.com wrote: Not working. I must miss something. What command do you use after changing it? thanks On 9/11/12 2:31 PM, Chris Drome cdr...@yahoo-inc.com wrote: You might try modifying the build.properties file as follows: hadoop.version=${hadoop-0.23.version} hadoop.security.version=${hadoop-0.23.version} hadoop.mr.rev=23 This works for me. chris On 9/11/12 1:59 PM, Gang Tim Liu g...@fb.com wrote: Hello all, I am looking for help and thank first. How to run a test case for hadoop 23 only? Exact command? I tried the following and it didn't work: ant ... -Dhadoop.version=0.23.1 -Dhadoop-0.23.version=0.23.1 -Dhadoop.mr.rev=23 It still gets 0.20 as hadoop version. Thanks Tim
Re: run a test case for hadoop 23
This one will work and it doesn't need change in build.properties. Details in HIVE-3389. ant test -Dhadoop.mr.rev=23 -Dtest.print.classpath=true -Dhadoop.version=2.0.0-alpha -Dhadoop.security.version=2.0.0-alpha -Dtestcase=TestCliDriver -Dqfile=...q I will update hive wiki so that people can save efforts. Thanks Tim On 9/11/12 4:14 PM, Gang Tim Liu g...@fb.com wrote: I am running test in eclipse and it is printing INCLUDE list does not contain Hadoop Version 0.20. SkippingÅ It's from QTestUtil.java due to hadoopversion 0.20 Thanks On 9/11/12 4:04 PM, Chris Drome cdr...@yahoo-inc.com wrote: You can try cleaning and rebuilding: ant clean package Then you can try running your tests: ant test How do you know that it is using hadoop 20? chris On 9/11/12 3:57 PM, Gang Tim Liu g...@fb.com wrote: Not working. I must miss something. What command do you use after changing it? thanks On 9/11/12 2:31 PM, Chris Drome cdr...@yahoo-inc.com wrote: You might try modifying the build.properties file as follows: hadoop.version=${hadoop-0.23.version} hadoop.security.version=${hadoop-0.23.version} hadoop.mr.rev=23 This works for me. chris On 9/11/12 1:59 PM, Gang Tim Liu g...@fb.com wrote: Hello all, I am looking for help and thank first. How to run a test case for hadoop 23 only? Exact command? I tried the following and it didn't work: ant ... -Dhadoop.version=0.23.1 -Dhadoop-0.23.version=0.23.1 -Dhadoop.mr.rev=23 It still gets 0.20 as hadoop version. Thanks Tim
[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3072: --- Status: Patch Available (was: In Progress) Patch is available. Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, HIVE-3072.patch.3, HIVE-3072.patch.4, HIVE-3072.patch.5, HIVE-3072.patch.6, HIVE-3072.patch.7 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3072: --- Attachment: HIVE-3072.patch.7 Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, HIVE-3072.patch.3, HIVE-3072.patch.4, HIVE-3072.patch.5, HIVE-3072.patch.6, HIVE-3072.patch.7 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work started] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on HIVE-3072 started by Gang Tim Liu. Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, HIVE-3072.patch.3, HIVE-3072.patch.4 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3072: --- Attachment: HIVE-3072.patch.5 Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, HIVE-3072.patch.3, HIVE-3072.patch.4, HIVE-3072.patch.5 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3072: --- Status: Patch Available (was: In Progress) Patch is ready for review. It's in both jira and phabricator. Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, HIVE-3072.patch.3, HIVE-3072.patch.4, HIVE-3072.patch.5 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work started] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on HIVE-3072 started by Gang Tim Liu. Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, HIVE-3072.patch.3, HIVE-3072.patch.4, HIVE-3072.patch.5, HIVE-3072.patch.6 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13440845#comment-13440845 ] Gang Tim Liu commented on HIVE-3072: Patch is available on both jara and phabricator. Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, HIVE-3072.patch.3, HIVE-3072.patch.4, HIVE-3072.patch.5, HIVE-3072.patch.6 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3072: --- Attachment: HIVE-3072.patch.6 Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, HIVE-3072.patch.3, HIVE-3072.patch.4, HIVE-3072.patch.5, HIVE-3072.patch.6 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13440881#comment-13440881 ] Gang Tim Liu commented on HIVE-3072: @Carl, yes , will code after dinner. Thanks a lot Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, HIVE-3072.patch.3, HIVE-3072.patch.4, HIVE-3072.patch.5, HIVE-3072.patch.6 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work started] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on HIVE-3072 started by Gang Tim Liu. Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, HIVE-3072.patch.3 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3072: --- Attachment: HIVE-3072.patch.4 Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, HIVE-3072.patch.3, HIVE-3072.patch.4 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3072: --- Status: Patch Available (was: In Progress) patch is available on both jira and phabricator. thanks Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, HIVE-3072.patch.3, HIVE-3072.patch.4 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3394) Refactor a few classes' constructors with creational patterns
[ https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13438007#comment-13438007 ] Gang Tim Liu commented on HIVE-3394: @Carl, great! will address them and have a patch ready before noon. thanks Refactor a few classes' constructors with creational patterns - Key: HIVE-3394 URL: https://issues.apache.org/jira/browse/HIVE-3394 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: HIVE-3394.patch.1 It's good to refactor the following classes' constructors with builder/factory pattern. This should be done before HIVE-3072 so that HIVE-3072 can refresh according and extend it to skewed use case: 1. MStorageDescriptor.java 2. ColumnInfo.java 3. ParseContext.java 4. CreateTableDesc.java 5. ExprNodeColumnDesc.java -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work started] (HIVE-3394) Refactor a few classes' constructors with creational patterns
[ https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on HIVE-3394 started by Gang Tim Liu. Refactor a few classes' constructors with creational patterns - Key: HIVE-3394 URL: https://issues.apache.org/jira/browse/HIVE-3394 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: HIVE-3394.patch.1 It's good to refactor the following classes' constructors with builder/factory pattern. This should be done before HIVE-3072 so that HIVE-3072 can refresh according and extend it to skewed use case: 1. MStorageDescriptor.java 2. ColumnInfo.java 3. ParseContext.java 4. CreateTableDesc.java 5. ExprNodeColumnDesc.java -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3394) Refactor a few classes' constructors with creational patterns
[ https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3394: --- Status: Patch Available (was: In Progress) Patch is available in both jira and phabricator. Some explannations in phabricator. Refactor a few classes' constructors with creational patterns - Key: HIVE-3394 URL: https://issues.apache.org/jira/browse/HIVE-3394 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: HIVE-3394.patch.1, HIVE-3394.patch.2 It's good to refactor the following classes' constructors with builder/factory pattern. This should be done before HIVE-3072 so that HIVE-3072 can refresh according and extend it to skewed use case: 1. MStorageDescriptor.java 2. ColumnInfo.java 3. ParseContext.java 4. CreateTableDesc.java 5. ExprNodeColumnDesc.java -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3394) Refactor a few classes' constructors with creational patterns
[ https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3394: --- Attachment: HIVE-3394.patch.2 Refactor a few classes' constructors with creational patterns - Key: HIVE-3394 URL: https://issues.apache.org/jira/browse/HIVE-3394 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: HIVE-3394.patch.1, HIVE-3394.patch.2 It's good to refactor the following classes' constructors with builder/factory pattern. This should be done before HIVE-3072 so that HIVE-3072 can refresh according and extend it to skewed use case: 1. MStorageDescriptor.java 2. ColumnInfo.java 3. ParseContext.java 4. CreateTableDesc.java 5. ExprNodeColumnDesc.java -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work started] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on HIVE-3072 started by Gang Tim Liu. Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, HIVE-3072.patch.3 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3072: --- Attachment: HIVE-3072.patch.3 Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, HIVE-3072.patch.3 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3072: --- Status: Patch Available (was: In Progress) Patch is available on both jira and phabricator. Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, HIVE-3072.patch.3 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13438451#comment-13438451 ] Gang Tim Liu commented on HIVE-3072: @Carl, thank you very much for quick review. will address them right now and get a patch tonight. thanks Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2, HIVE-3072.patch.3 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3394) Refactor a few classes' constructors with creational patterns
[ https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13437265#comment-13437265 ] Gang Tim Liu commented on HIVE-3394: @Carl: thank you for giving me the opportunities to work on it. Initially have natural resistance due to work plan but quickly realize it is good thing to do and will make HIVE better which is my goal. I enjoy this refactor with them in mind. have a good weekend. Refactor a few classes' constructors with creational patterns - Key: HIVE-3394 URL: https://issues.apache.org/jira/browse/HIVE-3394 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: HIVE-3394.patch.1 It's good to refactor the following classes' constructors with builder/factory pattern. This should be done before HIVE-3072 so that HIVE-3072 can refresh according and extend it to skewed use case: 1. MStorageDescriptor.java 2. ColumnInfo.java 3. ParseContext.java 4. CreateTableDesc.java 5. ExprNodeColumnDesc.java -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3394) Refractor a few classes' constructor with builder pattern
[ https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3394: --- Description: It's good to refactor the following classes' constructors with builder/factory pattern. This should be done before HIVE-3072 so that HIVE-3072 can refresh according and extend it to skewed use case: 1. MStorageDescriptor.java 2. ColumnInfo.java 3. ParseContext.java 4. CreateTableDesc.java 5. ExprNodeColumnDesc.java Environment: (was: It's good to refactor the following classes' constructors with builder/factory pattern. This should be done before HIVE-3072 so that HIVE-3072 can refresh according and extend it to skewed use case: 1. MStorageDescriptor.java 2. ColumnInfo.java 3. ParseContext.java 4. CreateTableDesc.java 5. ExprNodeColumnDesc.java) Refractor a few classes' constructor with builder pattern - Key: HIVE-3394 URL: https://issues.apache.org/jira/browse/HIVE-3394 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor It's good to refactor the following classes' constructors with builder/factory pattern. This should be done before HIVE-3072 so that HIVE-3072 can refresh according and extend it to skewed use case: 1. MStorageDescriptor.java 2. ColumnInfo.java 3. ParseContext.java 4. CreateTableDesc.java 5. ExprNodeColumnDesc.java -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (HIVE-3394) Refractor a few classes' constructor with builder pattern
Gang Tim Liu created HIVE-3394: -- Summary: Refractor a few classes' constructor with builder pattern Key: HIVE-3394 URL: https://issues.apache.org/jira/browse/HIVE-3394 Project: Hive Issue Type: Bug Environment: It's good to refactor the following classes' constructors with builder/factory pattern. This should be done before HIVE-3072 so that HIVE-3072 can refresh according and extend it to skewed use case: 1. MStorageDescriptor.java 2. ColumnInfo.java 3. ParseContext.java 4. CreateTableDesc.java 5. ExprNodeColumnDesc.java Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3072: --- Attachment: HIVE-3072.patch.2 Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work started] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on HIVE-3072 started by Gang Tim Liu. Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3072: --- Status: Patch Available (was: In Progress) Patch available in phabricator and jira. Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1, HIVE-3072.patch.2 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3394) Refactor a few classes' constructors with creational patterns
[ https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3394: --- Summary: Refactor a few classes' constructors with creational patterns (was: Refractor a few classes' constructor with builder pattern) Refactor a few classes' constructors with creational patterns - Key: HIVE-3394 URL: https://issues.apache.org/jira/browse/HIVE-3394 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor It's good to refactor the following classes' constructors with builder/factory pattern. This should be done before HIVE-3072 so that HIVE-3072 can refresh according and extend it to skewed use case: 1. MStorageDescriptor.java 2. ColumnInfo.java 3. ParseContext.java 4. CreateTableDesc.java 5. ExprNodeColumnDesc.java -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3394) Refactor a few classes' constructors with creational patterns
[ https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13436969#comment-13436969 ] Gang Tim Liu commented on HIVE-3394: Problem === Several classes have many parameters in constructor and it doesn't scale well. Solution After reviewing design patterns (factory method/abstract factory/builder/static factory) in GOF and builder pattern in Joshua's Effective in JAVA (Item #2), feel builder pattern in effective in Java is a good fit since it is designed to solve our problem. Details on each instance 1. MStorageDescriptor.java use builder pattern 2. ColumnInfo.java # of parameter is not big yet but the number of references to the constructors is huge: 40+. risk is high to change them all. solution is use javabean pattern (setter/getter) for new parameters in hive 3072 3. ParseContext.java use builder pattern 4. CreateTableDesc.java use builder pattern 5. ExprNodeColumnDesc.java same as #2 Refactor a few classes' constructors with creational patterns - Key: HIVE-3394 URL: https://issues.apache.org/jira/browse/HIVE-3394 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor It's good to refactor the following classes' constructors with builder/factory pattern. This should be done before HIVE-3072 so that HIVE-3072 can refresh according and extend it to skewed use case: 1. MStorageDescriptor.java 2. ColumnInfo.java 3. ParseContext.java 4. CreateTableDesc.java 5. ExprNodeColumnDesc.java -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3394) Refactor a few classes' constructors with creational patterns
[ https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13436996#comment-13436996 ] Gang Tim Liu commented on HIVE-3394: second thought on MStorageDescriptor.java. It's model class for JDO and let's keep it simple. we can use setter/getter here. Refactor a few classes' constructors with creational patterns - Key: HIVE-3394 URL: https://issues.apache.org/jira/browse/HIVE-3394 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor It's good to refactor the following classes' constructors with builder/factory pattern. This should be done before HIVE-3072 so that HIVE-3072 can refresh according and extend it to skewed use case: 1. MStorageDescriptor.java 2. ColumnInfo.java 3. ParseContext.java 4. CreateTableDesc.java 5. ExprNodeColumnDesc.java -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3394) Refactor a few classes' constructors with creational patterns
[ https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13437036#comment-13437036 ] Gang Tim Liu commented on HIVE-3394: https://reviews.facebook.net/D4719 Refactor a few classes' constructors with creational patterns - Key: HIVE-3394 URL: https://issues.apache.org/jira/browse/HIVE-3394 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: HIVE-3394.patch.1 It's good to refactor the following classes' constructors with builder/factory pattern. This should be done before HIVE-3072 so that HIVE-3072 can refresh according and extend it to skewed use case: 1. MStorageDescriptor.java 2. ColumnInfo.java 3. ParseContext.java 4. CreateTableDesc.java 5. ExprNodeColumnDesc.java -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work started] (HIVE-3394) Refactor a few classes' constructors with creational patterns
[ https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on HIVE-3394 started by Gang Tim Liu. Refactor a few classes' constructors with creational patterns - Key: HIVE-3394 URL: https://issues.apache.org/jira/browse/HIVE-3394 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: HIVE-3394.patch.1 It's good to refactor the following classes' constructors with builder/factory pattern. This should be done before HIVE-3072 so that HIVE-3072 can refresh according and extend it to skewed use case: 1. MStorageDescriptor.java 2. ColumnInfo.java 3. ParseContext.java 4. CreateTableDesc.java 5. ExprNodeColumnDesc.java -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3394) Refactor a few classes' constructors with creational patterns
[ https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3394: --- Attachment: HIVE-3394.patch.1 Refactor a few classes' constructors with creational patterns - Key: HIVE-3394 URL: https://issues.apache.org/jira/browse/HIVE-3394 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: HIVE-3394.patch.1 It's good to refactor the following classes' constructors with builder/factory pattern. This should be done before HIVE-3072 so that HIVE-3072 can refresh according and extend it to skewed use case: 1. MStorageDescriptor.java 2. ColumnInfo.java 3. ParseContext.java 4. CreateTableDesc.java 5. ExprNodeColumnDesc.java -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3394) Refactor a few classes' constructors with creational patterns
[ https://issues.apache.org/jira/browse/HIVE-3394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3394: --- Status: Patch Available (was: In Progress) Patch is ready for review in both jira and phabricator. Refactor a few classes' constructors with creational patterns - Key: HIVE-3394 URL: https://issues.apache.org/jira/browse/HIVE-3394 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: HIVE-3394.patch.1 It's good to refactor the following classes' constructors with builder/factory pattern. This should be done before HIVE-3072 so that HIVE-3072 can refresh according and extend it to skewed use case: 1. MStorageDescriptor.java 2. ColumnInfo.java 3. ParseContext.java 4. CreateTableDesc.java 5. ExprNodeColumnDesc.java -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (HIVE-3390) Hive List Bucketing - DDL support - DB upgrade script for Derby, Postgres, and Oracle
Gang Tim Liu created HIVE-3390: -- Summary: Hive List Bucketing - DDL support - DB upgrade script for Derby, Postgres, and Oracle Key: HIVE-3390 URL: https://issues.apache.org/jira/browse/HIVE-3390 Project: Hive Issue Type: New Feature Reporter: Gang Tim Liu This is a follow-up for HIVE-3072. We need upgrade scripts for Derby, Postgres, and Oracle. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3390) Hive List Bucketing - DDL support - DB upgrade script for Derby, Postgres, and Oracle
[ https://issues.apache.org/jira/browse/HIVE-3390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13436246#comment-13436246 ] Gang Tim Liu commented on HIVE-3390: @Carl, got you. do you have some instructions to generate upgrade script for derby, postgres and Orable? Are you using SchemaTool? If you have instructions, it will be big help. thanks Hive List Bucketing - DDL support - DB upgrade script for Derby, Postgres, and Oracle - Key: HIVE-3390 URL: https://issues.apache.org/jira/browse/HIVE-3390 Project: Hive Issue Type: New Feature Reporter: Gang Tim Liu This is a follow-up for HIVE-3072. We need upgrade scripts for Derby, Postgres, and Oracle. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3390) Hive List Bucketing - DDL support - DB upgrade script for Derby, Postgres, and Oracle
[ https://issues.apache.org/jira/browse/HIVE-3390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13436253#comment-13436253 ] Gang Tim Liu commented on HIVE-3390: I see. will do it last. I am making changes and target to get you a patch to review today. thanks Hive List Bucketing - DDL support - DB upgrade script for Derby, Postgres, and Oracle - Key: HIVE-3390 URL: https://issues.apache.org/jira/browse/HIVE-3390 Project: Hive Issue Type: New Feature Reporter: Gang Tim Liu This is a follow-up for HIVE-3072. We need upgrade scripts for Derby, Postgres, and Oracle. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work started] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on HIVE-3072 started by Gang Tim Liu. Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3072: --- Attachment: HIVE-3072.patch.1 Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch, HIVE-3072.patch.1 If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work started] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on HIVE-3072 started by Gang Tim Liu. Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3072: --- Attachment: HIVE-3072.patch Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3072: --- Status: Patch Available (was: In Progress) Patch is available for review. Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3072.patch If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3372) Local mode errors point to the wrong file
[ https://issues.apache.org/jira/browse/HIVE-3372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3372: --- Priority: Trivial (was: Major) Issue Type: Task (was: Bug) Atlassian JIRA doesn't create the jira issue correctly: 1. no action to submit patch 2. no action to resolve issue etc so far, no way to clean them up. temporarily move to task. Local mode errors point to the wrong file - Key: HIVE-3372 URL: https://issues.apache.org/jira/browse/HIVE-3372 Project: Hive Issue Type: Task Components: CLI Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Trivial This is a classic example of a query that fails due to a bad udf in local mode: Ended Job = job_local_0001 with errors Error during job, obtaining debugging information... Execution failed with exit status: 2 Obtaining error information Task failed! Task ID: Stage-1 Logs: /tmp/gang/hive.log FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask But actually local mr job in /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3371) Local mode errors point to the wrong file
[ https://issues.apache.org/jira/browse/HIVE-3371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3371: --- Priority: Trivial (was: Minor) Issue Type: Task (was: Bug) Atlassian JIRA doesn't create the jira issue correctly: 1. no action to submit patch 2. no action to resolve issue etc so far, no way to clean them up. temporarily move to task. Local mode errors point to the wrong file - Key: HIVE-3371 URL: https://issues.apache.org/jira/browse/HIVE-3371 Project: Hive Issue Type: Task Components: CLI Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Trivial Attachments: hive-3371 Time Spent: 48h Remaining Estimate: 0h This is a classic example of a query that fails due to a bad udf in local mode: Ended Job = job_local_0001 with errors Error during job, obtaining debugging information... Execution failed with exit status: 2 Obtaining error information Task failed! Task ID: Stage-1 Logs: /tmp/gang/hive.log FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask But actually local mr job in /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3347) Local mode errors point to the wrong file
[ https://issues.apache.org/jira/browse/HIVE-3347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3347: --- Priority: Trivial (was: Major) Issue Type: Task (was: Bug) Atlassian JIRA doesn't create the jira issue correctly: 1. no action to submit patch 2. no action to resolve issue etc so far, no way to clean them up. temporarily move to task. Local mode errors point to the wrong file - Key: HIVE-3347 URL: https://issues.apache.org/jira/browse/HIVE-3347 Project: Hive Issue Type: Task Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Trivial This is a classic example of a query that fails due to a bad udf in local mode: Ended Job = job_local_0001 with errors Error during job, obtaining debugging information... Execution failed with exit status: 2 Obtaining error information Task failed! Task ID: Stage-1 Logs: /tmp/gang/hive.log FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask But actually local mr job in /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3346) Local mode errors point to the wrong file
[ https://issues.apache.org/jira/browse/HIVE-3346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3346: --- Priority: Trivial (was: Major) Issue Type: Task (was: Bug) Atlassian JIRA doesn't create the jira issue correctly: 1. no action to submit patch 2. no action to resolve issue etc so far, no way to clean them up. temporarily move to task. Local mode errors point to the wrong file - Key: HIVE-3346 URL: https://issues.apache.org/jira/browse/HIVE-3346 Project: Hive Issue Type: Task Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Trivial This is a classic example of a query that fails due to a bad udf in local mode: Ended Job = job_local_0001 with errors Error during job, obtaining debugging information... Execution failed with exit status: 2 Obtaining error information Task failed! Task ID: Stage-1 Logs: /tmp/gang/hive.log FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask But actually local mr job in /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3370) Local mode errors point to the wrong file
[ https://issues.apache.org/jira/browse/HIVE-3370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3370: --- Priority: Trivial (was: Major) Issue Type: Task (was: Bug) Atlassian JIRA doesn't create the jira issue correctly: 1. no action to submit patch 2. no action to resolve issue etc so far, no way to clean them up. temporarily move to task. Local mode errors point to the wrong file - Key: HIVE-3370 URL: https://issues.apache.org/jira/browse/HIVE-3370 Project: Hive Issue Type: Task Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Trivial This is a classic example of a query that fails due to a bad udf in local mode: Ended Job = job_local_0001 with errors Error during job, obtaining debugging information... Execution failed with exit status: 2 Obtaining error information Task failed! Task ID: Stage-1 Logs: /tmp/gang/hive.log FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask But actually local mr job in /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (HIVE-3376) Local mode errors point to the wrong file
Gang Tim Liu created HIVE-3376: -- Summary: Local mode errors point to the wrong file Key: HIVE-3376 URL: https://issues.apache.org/jira/browse/HIVE-3376 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu This is a classic example of a query that fails due to a bad udf in local mode: Ended Job = job_local_0001 with errors Error during job, obtaining debugging information... Execution failed with exit status: 2 Obtaining error information Task failed! Task ID: Stage-1 Logs: /tmp/gang/hive.log FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask But actually local mr job in /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Assigned] (HIVE-3376) Local mode errors point to the wrong file
[ https://issues.apache.org/jira/browse/HIVE-3376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu reassigned HIVE-3376: -- Assignee: Gang Tim Liu Local mode errors point to the wrong file - Key: HIVE-3376 URL: https://issues.apache.org/jira/browse/HIVE-3376 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: hive-3371 This is a classic example of a query that fails due to a bad udf in local mode: Ended Job = job_local_0001 with errors Error during job, obtaining debugging information... Execution failed with exit status: 2 Obtaining error information Task failed! Task ID: Stage-1 Logs: /tmp/gang/hive.log FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask But actually local mr job in /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work started] (HIVE-3376) Local mode errors point to the wrong file
[ https://issues.apache.org/jira/browse/HIVE-3376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on HIVE-3376 started by Gang Tim Liu. Local mode errors point to the wrong file - Key: HIVE-3376 URL: https://issues.apache.org/jira/browse/HIVE-3376 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: hive-3371 This is a classic example of a query that fails due to a bad udf in local mode: Ended Job = job_local_0001 with errors Error during job, obtaining debugging information... Execution failed with exit status: 2 Obtaining error information Task failed! Task ID: Stage-1 Logs: /tmp/gang/hive.log FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask But actually local mr job in /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3376) Local mode errors point to the wrong file
[ https://issues.apache.org/jira/browse/HIVE-3376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13433316#comment-13433316 ] Gang Tim Liu commented on HIVE-3376: https://reviews.facebook.net/D4575 Local mode errors point to the wrong file - Key: HIVE-3376 URL: https://issues.apache.org/jira/browse/HIVE-3376 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: hive-3371 This is a classic example of a query that fails due to a bad udf in local mode: Ended Job = job_local_0001 with errors Error during job, obtaining debugging information... Execution failed with exit status: 2 Obtaining error information Task failed! Task ID: Stage-1 Logs: /tmp/gang/hive.log FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask But actually local mr job in /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3376) Local mode errors point to the wrong file
[ https://issues.apache.org/jira/browse/HIVE-3376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3376: --- Attachment: hive-3371 Local mode errors point to the wrong file - Key: HIVE-3376 URL: https://issues.apache.org/jira/browse/HIVE-3376 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: hive-3371 This is a classic example of a query that fails due to a bad udf in local mode: Ended Job = job_local_0001 with errors Error during job, obtaining debugging information... Execution failed with exit status: 2 Obtaining error information Task failed! Task ID: Stage-1 Logs: /tmp/gang/hive.log FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask But actually local mr job in /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3376) Local mode errors point to the wrong file
[ https://issues.apache.org/jira/browse/HIVE-3376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3376: --- Status: Patch Available (was: In Progress) Local mode errors point to the wrong file - Key: HIVE-3376 URL: https://issues.apache.org/jira/browse/HIVE-3376 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: hive-3371 This is a classic example of a query that fails due to a bad udf in local mode: Ended Job = job_local_0001 with errors Error during job, obtaining debugging information... Execution failed with exit status: 2 Obtaining error information Task failed! Task ID: Stage-1 Logs: /tmp/gang/hive.log FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask But actually local mr job in /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3376) Local mode errors point to the wrong file
[ https://issues.apache.org/jira/browse/HIVE-3376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13433325#comment-13433325 ] Gang Tim Liu commented on HIVE-3376: While executing in local mode, the process to collect failed job information is a different process from running local MR job. Each process reads a different log4j property file. One points to hive.log and another is MR job log. No communication on 2 processes yet on job log. There are some ways like external files etc. Consider the bug impact itself (log file location), easy way is to pass MR job stream and pick up log file location and output in CLI stream. Local mode errors point to the wrong file - Key: HIVE-3376 URL: https://issues.apache.org/jira/browse/HIVE-3376 Project: Hive Issue Type: Bug Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: hive-3371 This is a classic example of a query that fails due to a bad udf in local mode: Ended Job = job_local_0001 with errors Error during job, obtaining debugging information... Execution failed with exit status: 2 Obtaining error information Task failed! Task ID: Stage-1 Logs: /tmp/gang/hive.log FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask But actually local mr job in /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13433326#comment-13433326 ] Gang Tim Liu commented on HIVE-3072: https://reviews.facebook.net/D4599 Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13433328#comment-13433328 ] Gang Tim Liu commented on HIVE-3072: There are a few reasons to release a DDL as a patch: 1. the content is getting bigger. might be better to get it review so that we can capture issues earlier. 2. the skewed grammar will not only benefit list bucketing feature but also other feature like skewed join. releasing grammar patch will unblock others' development. Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3343) Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result
[ https://issues.apache.org/jira/browse/HIVE-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13433892#comment-13433892 ] Gang Tim Liu commented on HIVE-3343: thanks Namit Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result -- Key: HIVE-3343 URL: https://issues.apache.org/jira/browse/HIVE-3343 Project: Hive Issue Type: Bug Components: Query Processor Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: hive-3343.patch This simple Hive query would generate wrong result: select a.key, b.k2, b.k3 from src a join ( select key, min(key) as k, min(key)+1 as k1, min(key)+2 as k2, min(key)+3 as k3 from src group by key ) b on a.key=b.key and b.k1 5; 0 3.0 1.0 0 3.0 1.0 0 3.0 1.0 2 5.0 3.0 The right result is: 0 2.0 3.0 0 2.0 3.0 0 2.0 3.0 2 4.0 5.0 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13433904#comment-13433904 ] Gang Tim Liu commented on HIVE-3072: yes, I am. Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3371) Local mode errors point to the wrong file
[ https://issues.apache.org/jira/browse/HIVE-3371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13432283#comment-13432283 ] Gang Tim Liu commented on HIVE-3371: https://reviews.facebook.net/D4575 Local mode errors point to the wrong file - Key: HIVE-3371 URL: https://issues.apache.org/jira/browse/HIVE-3371 Project: Hive Issue Type: Bug Components: CLI Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor This is a classic example of a query that fails due to a bad udf in local mode: Ended Job = job_local_0001 with errors Error during job, obtaining debugging information... Execution failed with exit status: 2 Obtaining error information Task failed! Task ID: Stage-1 Logs: /tmp/gang/hive.log FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask But actually local mr job in /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3371) Local mode errors point to the wrong file
[ https://issues.apache.org/jira/browse/HIVE-3371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3371: --- Attachment: hive-3371 Local mode errors point to the wrong file - Key: HIVE-3371 URL: https://issues.apache.org/jira/browse/HIVE-3371 Project: Hive Issue Type: Bug Components: CLI Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: hive-3371 This is a classic example of a query that fails due to a bad udf in local mode: Ended Job = job_local_0001 with errors Error during job, obtaining debugging information... Execution failed with exit status: 2 Obtaining error information Task failed! Task ID: Stage-1 Logs: /tmp/gang/hive.log FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask But actually local mr job in /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work logged] (HIVE-3371) Local mode errors point to the wrong file
[ https://issues.apache.org/jira/browse/HIVE-3371?page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel\#worklog-{worklog.getId()} ] Gang Tim Liu logged work on HIVE-3371: -- Author: Gang Tim Liu Created on: 09/Aug/12 23:08 Start Date: 09/Aug/12 23:08 Worklog Time Spent: 48h Issue Time Tracking --- Worklog Id: (was: 13834) Time Spent: 48h Remaining Estimate: 0h Local mode errors point to the wrong file - Key: HIVE-3371 URL: https://issues.apache.org/jira/browse/HIVE-3371 Project: Hive Issue Type: Bug Components: CLI Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: hive-3371 Time Spent: 48h Remaining Estimate: 0h This is a classic example of a query that fails due to a bad udf in local mode: Ended Job = job_local_0001 with errors Error during job, obtaining debugging information... Execution failed with exit status: 2 Obtaining error information Task failed! Task ID: Stage-1 Logs: /tmp/gang/hive.log FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask But actually local mr job in /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
Atlassian JIRA Issue.
Dear all, Have you seen problem while creating jira issue today? I tried to create jira issue and it showed exception in window but actually issue is created. The created issues are abnormal: it missed actions like submit patch. It also doesn't allow me to close/resolve them. So, you will see a few of them for the same issue: HIVE-3372 https://issues.apache.org/jira/browse/HIVE-3372 HIVE-3371 https://issues.apache.org/jira/browse/HIVE-3371 HIVE-3370 https://issues.apache.org/jira/browse/HIVE-3370 HIVE-3369 https://issues.apache.org/jira/browse/HIVE-3369 Thanks Tim On 8/9/12 4:09 PM, Gang Tim Liu (JIRA) j...@apache.org wrote: [ https://issues.apache.org/jira/browse/HIVE-3371?page=com.atlassian.jira.pl ugin.system.issuetabpanels:worklog-tabpanel\#worklog-{worklog.getId()} ] Gang Tim Liu logged work on HIVE-3371: -- Author: Gang Tim Liu Created on: 09/Aug/12 23:08 Start Date: 09/Aug/12 23:08 Worklog Time Spent: 48h Issue Time Tracking --- Worklog Id: (was: 13834) Time Spent: 48h Remaining Estimate: 0h Local mode errors point to the wrong file - Key: HIVE-3371 URL: https://issues.apache.org/jira/browse/HIVE-3371 Project: Hive Issue Type: Bug Components: CLI Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: hive-3371 Time Spent: 48h Remaining Estimate: 0h This is a classic example of a query that fails due to a bad udf in local mode: Ended Job = job_local_0001 with errors Error during job, obtaining debugging information... Execution failed with exit status: 2 Obtaining error information Task failed! Task ID: Stage-1 Logs: /tmp/gang/hive.log FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask But actually local mr job in /tmp/gang/gang_20120808181010_b716b5fb-5b8f-49d2-94e9-95bb8f465342.log -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3343) Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result
[ https://issues.apache.org/jira/browse/HIVE-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3343: --- Attachment: hive-3343.patch Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result -- Key: HIVE-3343 URL: https://issues.apache.org/jira/browse/HIVE-3343 Project: Hive Issue Type: Bug Components: Query Processor Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: hive-3343.patch This simple Hive query would generate wrong result: select a.key, b.k2, b.k3 from src a join ( select key, min(key) as k, min(key)+1 as k1, min(key)+2 as k2, min(key)+3 as k3 from src group by key ) b on a.key=b.key and b.k1 5; 0 3.0 1.0 0 3.0 1.0 0 3.0 1.0 2 5.0 3.0 The right result is: 0 2.0 3.0 0 2.0 3.0 0 2.0 3.0 2 4.0 5.0 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3343) Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result
[ https://issues.apache.org/jira/browse/HIVE-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3343: --- Status: Patch Available (was: Open) patch is submitted. Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result -- Key: HIVE-3343 URL: https://issues.apache.org/jira/browse/HIVE-3343 Project: Hive Issue Type: Bug Components: Query Processor Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: hive-3343.patch This simple Hive query would generate wrong result: select a.key, b.k2, b.k3 from src a join ( select key, min(key) as k, min(key)+1 as k1, min(key)+2 as k2, min(key)+3 as k3 from src group by key ) b on a.key=b.key and b.k1 5; 0 3.0 1.0 0 3.0 1.0 0 3.0 1.0 2 5.0 3.0 The right result is: 0 2.0 3.0 0 2.0 3.0 0 2.0 3.0 2 4.0 5.0 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (HIVE-3342) Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result
Gang Tim Liu created HIVE-3342: -- Summary: Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result Key: HIVE-3342 URL: https://issues.apache.org/jira/browse/HIVE-3342 Project: Hive Issue Type: Bug Components: Query Processor Reporter: Gang Tim Liu This simple Hive query would generate wrong result: select a.key, b.k2, b.k3 from src a join ( select key, min(key) as k, min(key)+1 as k1, min(key)+2 as k2, min(key)+3 as k3 from src group by key ) b on a.key=b.key and b.k1 5; 0 3.0 1.0 0 3.0 1.0 0 3.0 1.0 2 5.0 3.0 The right result is: 0 2.0 3.0 0 2.0 3.0 0 2.0 3.0 2 4.0 5.0 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (HIVE-3343) Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result
Gang Tim Liu created HIVE-3343: -- Summary: Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result Key: HIVE-3343 URL: https://issues.apache.org/jira/browse/HIVE-3343 Project: Hive Issue Type: Bug Components: Query Processor Reporter: Gang Tim Liu Assignee: Gang Tim Liu This simple Hive query would generate wrong result: select a.key, b.k2, b.k3 from src a join ( select key, min(key) as k, min(key)+1 as k1, min(key)+2 as k2, min(key)+3 as k3 from src group by key ) b on a.key=b.key and b.k1 5; 0 3.0 1.0 0 3.0 1.0 0 3.0 1.0 2 5.0 3.0 The right result is: 0 2.0 3.0 0 2.0 3.0 0 2.0 3.0 2 4.0 5.0 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3343) Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result
[ https://issues.apache.org/jira/browse/HIVE-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13430757#comment-13430757 ] Gang Tim Liu commented on HIVE-3343: If we remove b.k1 5, it will generate right result. explain extended shows Select Operator has different order of columns (_col0, _col3, _col4, _col2) from filter operator's (_col0,_col2,_col3,_col4). Here is snapshot: Reduce Operator Tree: Group By Operator ... outputColumnNames: _col0, _col1 Select Operator ... outputColumnNames: _col0, _col3, _col4, _col2 Filter Operator ... predicate: expr: (_col2 5.0) type: boolean File Output Operator ... table: ... properties: columns _col0,_col2,_col3,_col4 columns.types string,double,double,double escape.delim \ ... Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result -- Key: HIVE-3343 URL: https://issues.apache.org/jira/browse/HIVE-3343 Project: Hive Issue Type: Bug Components: Query Processor Reporter: Gang Tim Liu Assignee: Gang Tim Liu This simple Hive query would generate wrong result: select a.key, b.k2, b.k3 from src a join ( select key, min(key) as k, min(key)+1 as k1, min(key)+2 as k2, min(key)+3 as k3 from src group by key ) b on a.key=b.key and b.k1 5; 0 3.0 1.0 0 3.0 1.0 0 3.0 1.0 2 5.0 3.0 The right result is: 0 2.0 3.0 0 2.0 3.0 0 2.0 3.0 2 4.0 5.0 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3343) Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result
[ https://issues.apache.org/jira/browse/HIVE-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13430761#comment-13430761 ] Gang Tim Liu commented on HIVE-3343: root-cause is ColumnPrunerFilterProc doesn't perserve order of prunned column list (filterOpPrunedColLists). It causes outputColumnNames: _col0, _col3, _col4, _col2 It should be outputColumnNames: _col0, _col2, _col3, _col4 Hive: Query misaligned result for Group by followed by Join with filter and skip a group-by result -- Key: HIVE-3343 URL: https://issues.apache.org/jira/browse/HIVE-3343 Project: Hive Issue Type: Bug Components: Query Processor Reporter: Gang Tim Liu Assignee: Gang Tim Liu This simple Hive query would generate wrong result: select a.key, b.k2, b.k3 from src a join ( select key, min(key) as k, min(key)+1 as k1, min(key)+2 as k2, min(key)+3 as k3 from src group by key ) b on a.key=b.key and b.k1 5; 0 3.0 1.0 0 3.0 1.0 0 3.0 1.0 2 5.0 3.0 The right result is: 0 2.0 3.0 0 2.0 3.0 0 2.0 3.0 2 4.0 5.0 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Assigned] (HIVE-3026) List Bucketing in Hive
[ https://issues.apache.org/jira/browse/HIVE-3026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu reassigned HIVE-3026: -- Assignee: Gang Tim Liu List Bucketing in Hive -- Key: HIVE-3026 URL: https://issues.apache.org/jira/browse/HIVE-3026 Project: Hive Issue Type: New Feature Reporter: Namit Jain Assignee: Gang Tim Liu Details are at: https://cwiki.apache.org/confluence/display/Hive/ListBucketing Please comment -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3026) List Bucketing in Hive
[ https://issues.apache.org/jira/browse/HIVE-3026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13424988#comment-13424988 ] Gang Tim Liu commented on HIVE-3026: I am working on it and will release the patch soon. List Bucketing in Hive -- Key: HIVE-3026 URL: https://issues.apache.org/jira/browse/HIVE-3026 Project: Hive Issue Type: New Feature Reporter: Namit Jain Assignee: Gang Tim Liu Details are at: https://cwiki.apache.org/confluence/display/Hive/ListBucketing Please comment -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
Re: Lisk Bucketing DDL Patch
Yes, Namit has a great summary. thanks On 7/27/12 9:09 PM, Namit Jain nj...@fb.com wrote: Yes, that patch will become quiet big to be done a single shot. Moreover, the skew information can be used by a variety of use-cases. 1. List Bucketing 2. Skew Joins: https://cwiki.apache.org/Hive/skewed-join-optimization.html 3. Another variant of skew joins: https://issues.apache.org/jira/browse/HIVE-3286 So, the skew may not be limited to be used for list bucketing only. So, it might be simpler to split into DDL and DML support. DDL will be common to all the use-cases who want to use/store skew information. Each use-case can implement the DML/Query separately. Thanks, -namit On 7/28/12 7:07 AM, Carl Steinbach c...@cloudera.com wrote: Since we are close to release the first patch DDL. In a comment on the design doc you said that the first phase would involve implementing this feature for a single-column end-to-end (DML+DDL). Has that plan changed? Thanks. Carl On Wed, Jul 25, 2012 at 12:31 AM, Gang Tim Liu g...@fb.com wrote: Dear all hive developers, Please review the documentation: https://cwiki.apache.org/confluence/display/Hive/ListBucketing Since we are close to release the first patch DDL. We will continue to update the wiki about new information and in the meanwhile want to collect your feedback. Thanks Tim
Lisk Bucketing DDL Patch
Dear all hive developers, Please review the documentation: https://cwiki.apache.org/confluence/display/Hive/ListBucketing Since we are close to release the first patch DDL. We will continue to update the wiki about new information and in the meanwhile want to collect your feedback. Thanks Tim
[jira] [Commented] (HIVE-3251) Hive doesn't remove scrach directories while killing running MR job
[ https://issues.apache.org/jira/browse/HIVE-3251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13418515#comment-13418515 ] Gang Tim Liu commented on HIVE-3251: Namit, thank you very much for review and commit. Hive doesn't remove scrach directories while killing running MR job --- Key: HIVE-3251 URL: https://issues.apache.org/jira/browse/HIVE-3251 Project: Hive Issue Type: Bug Components: Server Infrastructure Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3151.patch While killing running MR job, hive doesn't clean up scratch directory (mapred.cache.files). So that, afterwards, scratch directory is left there in hdfs. HDFS name node doesn't know it and try to do lease recovery. while such instances happen more, it will eventually crash namenode. The fix is to leverage hdfs clean up functionality. While creating scratch dirs, hive registers it to hdfs cleanup hook. While killing happens, hdfs will clean them up. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (HIVE-3251) Hive doesn't remove scrach directories while killing running MR job
Gang Tim Liu created HIVE-3251: -- Summary: Hive doesn't remove scrach directories while killing running MR job Key: HIVE-3251 URL: https://issues.apache.org/jira/browse/HIVE-3251 Project: Hive Issue Type: Bug Components: Server Infrastructure Reporter: Gang Tim Liu Assignee: Gang Tim Liu While killing running MR job, hive doesn't clean up scratch directory (mapred.cache.files). So that, afterwards, scratch directory is left there in hdfs. HDFS name node doesn't know it and try to do lease recovery. while such instances happen more, it will eventually crash namenode. The fix is to leverage hdfs clean up functionality. While creating scratch dirs, hive registers it to hdfs cleanup hook. While killing happens, hdfs will clean them up. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work started] (HIVE-3251) Hive doesn't remove scrach directories while killing running MR job
[ https://issues.apache.org/jira/browse/HIVE-3251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on HIVE-3251 started by Gang Tim Liu. Hive doesn't remove scrach directories while killing running MR job --- Key: HIVE-3251 URL: https://issues.apache.org/jira/browse/HIVE-3251 Project: Hive Issue Type: Bug Components: Server Infrastructure Reporter: Gang Tim Liu Assignee: Gang Tim Liu While killing running MR job, hive doesn't clean up scratch directory (mapred.cache.files). So that, afterwards, scratch directory is left there in hdfs. HDFS name node doesn't know it and try to do lease recovery. while such instances happen more, it will eventually crash namenode. The fix is to leverage hdfs clean up functionality. While creating scratch dirs, hive registers it to hdfs cleanup hook. While killing happens, hdfs will clean them up. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3251) Hive doesn't remove scrach directories while killing running MR job
[ https://issues.apache.org/jira/browse/HIVE-3251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13412280#comment-13412280 ] Gang Tim Liu commented on HIVE-3251: diff is ready to review https://reviews.facebook.net/D4089 Hive doesn't remove scrach directories while killing running MR job --- Key: HIVE-3251 URL: https://issues.apache.org/jira/browse/HIVE-3251 Project: Hive Issue Type: Bug Components: Server Infrastructure Reporter: Gang Tim Liu Assignee: Gang Tim Liu While killing running MR job, hive doesn't clean up scratch directory (mapred.cache.files). So that, afterwards, scratch directory is left there in hdfs. HDFS name node doesn't know it and try to do lease recovery. while such instances happen more, it will eventually crash namenode. The fix is to leverage hdfs clean up functionality. While creating scratch dirs, hive registers it to hdfs cleanup hook. While killing happens, hdfs will clean them up. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3251) Hive doesn't remove scrach directories while killing running MR job
[ https://issues.apache.org/jira/browse/HIVE-3251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3251: --- Status: Patch Available (was: In Progress) https://reviews.facebook.net/D4089 Hive doesn't remove scrach directories while killing running MR job --- Key: HIVE-3251 URL: https://issues.apache.org/jira/browse/HIVE-3251 Project: Hive Issue Type: Bug Components: Server Infrastructure Reporter: Gang Tim Liu Assignee: Gang Tim Liu While killing running MR job, hive doesn't clean up scratch directory (mapred.cache.files). So that, afterwards, scratch directory is left there in hdfs. HDFS name node doesn't know it and try to do lease recovery. while such instances happen more, it will eventually crash namenode. The fix is to leverage hdfs clean up functionality. While creating scratch dirs, hive registers it to hdfs cleanup hook. While killing happens, hdfs will clean them up. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3251) Hive doesn't remove scrach directories while killing running MR job
[ https://issues.apache.org/jira/browse/HIVE-3251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3251: --- Attachment: HIVE-3151.patch Hive doesn't remove scrach directories while killing running MR job --- Key: HIVE-3251 URL: https://issues.apache.org/jira/browse/HIVE-3251 Project: Hive Issue Type: Bug Components: Server Infrastructure Reporter: Gang Tim Liu Assignee: Gang Tim Liu Attachments: HIVE-3151.patch While killing running MR job, hive doesn't clean up scratch directory (mapred.cache.files). So that, afterwards, scratch directory is left there in hdfs. HDFS name node doesn't know it and try to do lease recovery. while such instances happen more, it will eventually crash namenode. The fix is to leverage hdfs clean up functionality. While creating scratch dirs, hive registers it to hdfs cleanup hook. While killing happens, hdfs will clean them up. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()
[ https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13400820#comment-13400820 ] Gang Tim Liu commented on HIVE-3165: @Carl, good Monday. How tests go? what I can do for it. thanks a lot. Tim Hive thrift code doesnt generate quality hashCode() --- Key: HIVE-3165 URL: https://issues.apache.org/jira/browse/HIVE-3165 Project: Hive Issue Type: Bug Components: Thrift API Environment: Hive thrift code doesnt generate quality hashCode(). Right now, it all has @Override public int hashCode() { return 0; } It will cause all entries go to the same bucket if it's used as hash key. Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: Hive-3165.patch, Hive-3165.patch-2 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()
[ https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13400844#comment-13400844 ] Gang Tim Liu commented on HIVE-3165: @Carl, thank you:) Hive thrift code doesnt generate quality hashCode() --- Key: HIVE-3165 URL: https://issues.apache.org/jira/browse/HIVE-3165 Project: Hive Issue Type: Bug Components: Thrift API Environment: Hive thrift code doesnt generate quality hashCode(). Right now, it all has @Override public int hashCode() { return 0; } It will cause all entries go to the same bucket if it's used as hash key. Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Fix For: 0.10.0 Attachments: Hive-3165.patch, Hive-3165.patch-2 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work started] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()
[ https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on HIVE-3165 started by Gang Tim Liu. Hive thrift code doesnt generate quality hashCode() --- Key: HIVE-3165 URL: https://issues.apache.org/jira/browse/HIVE-3165 Project: Hive Issue Type: Bug Components: Thrift API Environment: Hive thrift code doesnt generate quality hashCode(). Right now, it all has @Override public int hashCode() { return 0; } It will cause all entries go to the same bucket if it's used as hash key. Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()
[ https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3165: --- Release Note: Patch has been submitted https://reviews.facebook.net/D3759 Status: Patch Available (was: In Progress) Hive thrift code doesnt generate quality hashCode() --- Key: HIVE-3165 URL: https://issues.apache.org/jira/browse/HIVE-3165 Project: Hive Issue Type: Bug Components: Thrift API Environment: Hive thrift code doesnt generate quality hashCode(). Right now, it all has @Override public int hashCode() { return 0; } It will cause all entries go to the same bucket if it's used as hash key. Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()
[ https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13398233#comment-13398233 ] Gang Tim Liu commented on HIVE-3165: patch is available for review https://reviews.facebook.net/D3759 Hive thrift code doesnt generate quality hashCode() --- Key: HIVE-3165 URL: https://issues.apache.org/jira/browse/HIVE-3165 Project: Hive Issue Type: Bug Components: Thrift API Environment: Hive thrift code doesnt generate quality hashCode(). Right now, it all has @Override public int hashCode() { return 0; } It will cause all entries go to the same bucket if it's used as hash key. Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()
[ https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3165: --- Attachment: Hive-3165.patch Hive thrift code doesnt generate quality hashCode() --- Key: HIVE-3165 URL: https://issues.apache.org/jira/browse/HIVE-3165 Project: Hive Issue Type: Bug Components: Thrift API Environment: Hive thrift code doesnt generate quality hashCode(). Right now, it all has @Override public int hashCode() { return 0; } It will cause all entries go to the same bucket if it's used as hash key. Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: Hive-3165.patch -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()
[ https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13398238#comment-13398238 ] Gang Tim Liu commented on HIVE-3165: Patch is attached to here also. Hive thrift code doesnt generate quality hashCode() --- Key: HIVE-3165 URL: https://issues.apache.org/jira/browse/HIVE-3165 Project: Hive Issue Type: Bug Components: Thrift API Environment: Hive thrift code doesnt generate quality hashCode(). Right now, it all has @Override public int hashCode() { return 0; } It will cause all entries go to the same bucket if it's used as hash key. Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: Hive-3165.patch -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()
[ https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13398254#comment-13398254 ] Gang Tim Liu commented on HIVE-3165: @Carl, let me check. thanks a lot Hive thrift code doesnt generate quality hashCode() --- Key: HIVE-3165 URL: https://issues.apache.org/jira/browse/HIVE-3165 Project: Hive Issue Type: Bug Components: Thrift API Environment: Hive thrift code doesnt generate quality hashCode(). Right now, it all has @Override public int hashCode() { return 0; } It will cause all entries go to the same bucket if it's used as hash key. Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: Hive-3165.patch -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()
[ https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13398579#comment-13398579 ] Gang Tim Liu commented on HIVE-3165: Test passed. another batch is submitted to https://reviews.facebook.net/D3759. thanks Hive thrift code doesnt generate quality hashCode() --- Key: HIVE-3165 URL: https://issues.apache.org/jira/browse/HIVE-3165 Project: Hive Issue Type: Bug Components: Thrift API Environment: Hive thrift code doesnt generate quality hashCode(). Right now, it all has @Override public int hashCode() { return 0; } It will cause all entries go to the same bucket if it's used as hash key. Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: Hive-3165.patch -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()
[ https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3165: --- Attachment: Hive-3165.patch-2 Hive thrift code doesnt generate quality hashCode() --- Key: HIVE-3165 URL: https://issues.apache.org/jira/browse/HIVE-3165 Project: Hive Issue Type: Bug Components: Thrift API Environment: Hive thrift code doesnt generate quality hashCode(). Right now, it all has @Override public int hashCode() { return 0; } It will cause all entries go to the same bucket if it's used as hash key. Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: Hive-3165.patch, Hive-3165.patch-2 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()
[ https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3165: --- Status: Patch Available (was: Open) A new patch is submitted. Hive thrift code doesnt generate quality hashCode() --- Key: HIVE-3165 URL: https://issues.apache.org/jira/browse/HIVE-3165 Project: Hive Issue Type: Bug Components: Thrift API Environment: Hive thrift code doesnt generate quality hashCode(). Right now, it all has @Override public int hashCode() { return 0; } It will cause all entries go to the same bucket if it's used as hash key. Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: Hive-3165.patch, Hive-3165.patch-2 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()
[ https://issues.apache.org/jira/browse/HIVE-3165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13398959#comment-13398959 ] Gang Tim Liu commented on HIVE-3165: @Carl, thanks @Namit, thanks Hive thrift code doesnt generate quality hashCode() --- Key: HIVE-3165 URL: https://issues.apache.org/jira/browse/HIVE-3165 Project: Hive Issue Type: Bug Components: Thrift API Environment: Hive thrift code doesnt generate quality hashCode(). Right now, it all has @Override public int hashCode() { return 0; } It will cause all entries go to the same bucket if it's used as hash key. Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor Attachments: Hive-3165.patch, Hive-3165.patch-2 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (HIVE-3165) Hive thrift code doesnt generate quality hashCode()
Gang Tim Liu created HIVE-3165: -- Summary: Hive thrift code doesnt generate quality hashCode() Key: HIVE-3165 URL: https://issues.apache.org/jira/browse/HIVE-3165 Project: Hive Issue Type: Bug Components: Thrift API Environment: Hive thrift code doesnt generate quality hashCode(). Right now, it all has @Override public int hashCode() { return 0; } It will cause all entries go to the same bucket if it's used as hash key. Reporter: Gang Tim Liu Assignee: Gang Tim Liu Priority: Minor -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3072: --- Summary: Hive List Bucketing - DDL support (was: Hive List Bucketing - DDL support (single column)) Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for single skewed column. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gang Tim Liu updated HIVE-3072: --- Description: If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. was: If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for single skewed column. Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-3072) Hive List Bucketing - DDL support
[ https://issues.apache.org/jira/browse/HIVE-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13292978#comment-13292978 ] Gang Tim Liu commented on HIVE-3072: making progress on DML. The following syntax started to work: create table T (c1 string, c2 string) list bucketed by (c1) with skew ('x1'); create table T (c1 string, c2 string, c3 string) list bucketed by (c1, c2) with skew (('x1', 'x2'), ('y1', 'y2')); Hive List Bucketing - DDL support - Key: HIVE-3072 URL: https://issues.apache.org/jira/browse/HIVE-3072 Project: Hive Issue Type: New Feature Components: SQL Reporter: Gang Tim Liu Assignee: Gang Tim Liu If a hive table column has skewed keys, query performance on non-skewed key is always impacted. Hive List Bucketing feature will address it: https://cwiki.apache.org/Hive/listbucketing.html This jira issue will track DDL change for the feature. It's for both single skewed column and multiple columns. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira