from:"wangmeng $JIRA$"

[jira] [Commented] (HIVE-11880) filter bug of UNION ALL when hive.ppd.remove.duplicatefilters=true and filter condition is type incompatible column

2016-05-02 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15267887#comment-15267887
 ] 

WangMeng commented on HIVE-11880:
-

Thanks [~aihuaxu].

> filter bug  of UNION ALL when hive.ppd.remove.duplicatefilters=true and 
> filter condition is type incompatible column 
> -
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Logical Optimizer
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch, HIVE-11880.02.patch, 
> HIVE-11880.03.patch, HIVE-11880.04.patch
>
>
>For UNION ALL , when an union operator is constant column (such as '0L', 
> BIGINT Type)  and its corresponding column has incompatible type (such as INT 
> type). 
>   Query with filter condition on type incompatible column on this UNION ALL  
> will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders",in  the following query:
>  Type of 'orders'.'o_custkey' is INT normally,  while  the type of 
> corresponding constant column  "0" is BIGINT( `0L AS `o_custkey` ). 
>  This following query (with filter "type incompatible column 'o_custkey' ")  
> will fail  with  java.lang.IndexOutOfBoundsException : 
> {code}
> set hive.cbo.enable=false;
> set hive.ppd.remove.duplicatefilters=true;
> CREATE TABLE `orders`(
>   `o_orderkey` int, 
>   `o_custkey` int, 
>   `o_orderstatus` string, 
>   `o_totalprice` double, 
>   `o_orderdate` string, 
>   `o_orderpriority` string, 
>   `o_clerk` string, 
>   `o_shippriority` int, 
>   `o_comment` string);
> SELECT o_orderkey
> FROM   (
>   SELECT `o_orderkey` ,
>  `o_custkey`
>   FROM   `orders`
>   UNION ALL
>   SELECT `o_orderkey`,
>  0L  AS `o_custkey`
>   FROM   `orders`) `oo`
> WHERE  o_custkey<10;
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-12231) StorageBasedAuthorization requires write permission of default Warehouse when create external database

2015-12-01 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-12231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15033397#comment-15033397
 ] 

WangMeng commented on HIVE-12231:
-

[~thejas] Many thanks for your detailed explaination.

> StorageBasedAuthorization requires write permission of default Warehouse when 
> create external database
> --
>
> Key: HIVE-12231
> URL: https://issues.apache.org/jira/browse/HIVE-12231
> Project: Hive
>  Issue Type: Bug
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-12231.01.patch
>
>
> Please look at the stacktrace, when enabled StorageBasedAuthorization, I set 
> external location to create database. However, it will also check write 
> permission of default warehouse "/user/hive/warehouse"  :
> > CREATE DATABASE test LOCATION  '/tmp/wangmeng/test'  ;
> Error: Error while compiling statement: FAILED: HiveException 
> java.security.AccessControlException: Permission denied: user=wangmeng, 
> access=WRITE, inode="/user/hive/warehouse":hive:hive:drwxr-x--t
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkFsPermission(DefaultAuthorizationProvider.java:255)
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.check(DefaultAuthorizationProvider.java:236)
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkPermission(DefaultAuthorizationProvider.java:151)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Assigned] (HIVE-12231) StorageBasedAuthorization requires write permission of default Warehouse PATH when execute "CREATE DATABASE $Name LOCATION '$ExternalPath' "

2015-10-22 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-12231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng reassigned HIVE-12231:
---

Assignee: WangMeng

> StorageBasedAuthorization requires write permission of default Warehouse PATH 
> when execute "CREATE DATABASE $Name LOCATION '$ExternalPath' "
> 
>
> Key: HIVE-12231
> URL: https://issues.apache.org/jira/browse/HIVE-12231
> Project: Hive
>  Issue Type: Bug
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
>
> Please look at the stacktrace, when enabled StorageBasedAuthorization ,  I 
> set external Location of creating database, it will also check write 
> permission of default Warehouse "/user/hive/warehouse" :
> > create  database test  location '/tmp/wangmeng/test'  ;
> Error: Error while compiling statement: FAILED: HiveException 
> java.security.AccessControlException: Permission denied: user=wangmeng, 
> access=WRITE, inode="/user/hive/warehouse":hive:hive:drwxr-x--t
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkFsPermission(DefaultAuthorizationProvider.java:255)
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.check(DefaultAuthorizationProvider.java:236)
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkPermission(DefaultAuthorizationProvider.java:151)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-12231) StorageBasedAuthorization requires write permission of default Warehouse when execute "CREATE DATABASE $Name LOCATION '$ExternalPath' "

2015-10-22 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-12231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-12231:

Summary: StorageBasedAuthorization requires write permission of default 
Warehouse when execute "CREATE DATABASE $Name LOCATION '$ExternalPath' "  (was: 
StorageBasedAuthorization requires write permission of default Warehouse PATH 
when execute "CREATE DATABASE $Name LOCATION '$ExternalPath' ")

> StorageBasedAuthorization requires write permission of default Warehouse when 
> execute "CREATE DATABASE $Name LOCATION '$ExternalPath' "
> ---
>
> Key: HIVE-12231
> URL: https://issues.apache.org/jira/browse/HIVE-12231
> Project: Hive
>  Issue Type: Bug
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-12231.01.patch
>
>
> Please look at the stacktrace, when enabled StorageBasedAuthorization ,  I 
> set external Location of creating database, it will also check write 
> permission of default Warehouse "/user/hive/warehouse" :
> > create  database test  location '/tmp/wangmeng/test'  ;
> Error: Error while compiling statement: FAILED: HiveException 
> java.security.AccessControlException: Permission denied: user=wangmeng, 
> access=WRITE, inode="/user/hive/warehouse":hive:hive:drwxr-x--t
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkFsPermission(DefaultAuthorizationProvider.java:255)
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.check(DefaultAuthorizationProvider.java:236)
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkPermission(DefaultAuthorizationProvider.java:151)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-12231) StorageBasedAuthorization requires write permission of default Warehouse when execute "CREATE DATABASE $Name LOCATION '$ExternalPath' "

2015-10-22 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-12231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-12231:

Description: 
Please look at the stacktrace, when enabled StorageBasedAuthorization, I set 
external location to create database. However, it will also check write 
permission of default warehouse "/user/hive/warehouse"  :
> CREATE DATABASE test LOCATION  '/tmp/wangmeng/test'  ;
Error: Error while compiling statement: FAILED: HiveException 
java.security.AccessControlException: Permission denied: user=wangmeng, 
access=WRITE, inode="/user/hive/warehouse":hive:hive:drwxr-x--t
at 
org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkFsPermission(DefaultAuthorizationProvider.java:255)
at 
org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.check(DefaultAuthorizationProvider.java:236)
at 
org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkPermission(DefaultAuthorizationProvider.java:151)

  was:
Please look at the stacktrace, when enabled StorageBasedAuthorization ,  I set 
external Location of creating database, it will also check write permission of 
default Warehouse "/user/hive/warehouse" :
> create  database test  location '/tmp/wangmeng/test'  ;
Error: Error while compiling statement: FAILED: HiveException 
java.security.AccessControlException: Permission denied: user=wangmeng, 
access=WRITE, inode="/user/hive/warehouse":hive:hive:drwxr-x--t
at 
org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkFsPermission(DefaultAuthorizationProvider.java:255)
at 
org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.check(DefaultAuthorizationProvider.java:236)
at 
org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkPermission(DefaultAuthorizationProvider.java:151)


> StorageBasedAuthorization requires write permission of default Warehouse when 
> execute "CREATE DATABASE $Name LOCATION '$ExternalPath' "
> ---
>
> Key: HIVE-12231
> URL: https://issues.apache.org/jira/browse/HIVE-12231
> Project: Hive
>  Issue Type: Bug
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-12231.01.patch
>
>
> Please look at the stacktrace, when enabled StorageBasedAuthorization, I set 
> external location to create database. However, it will also check write 
> permission of default warehouse "/user/hive/warehouse"  :
> > CREATE DATABASE test LOCATION  '/tmp/wangmeng/test'  ;
> Error: Error while compiling statement: FAILED: HiveException 
> java.security.AccessControlException: Permission denied: user=wangmeng, 
> access=WRITE, inode="/user/hive/warehouse":hive:hive:drwxr-x--t
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkFsPermission(DefaultAuthorizationProvider.java:255)
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.check(DefaultAuthorizationProvider.java:236)
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkPermission(DefaultAuthorizationProvider.java:151)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-12231) StorageBasedAuthorization requires write permission of default Warehouse PATH when execute "CREATE DATABASE $Name LOCATION '$ExternalPath' "

2015-10-22 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-12231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-12231:

Attachment: HIVE-12231.01.patch

> StorageBasedAuthorization requires write permission of default Warehouse PATH 
> when execute "CREATE DATABASE $Name LOCATION '$ExternalPath' "
> 
>
> Key: HIVE-12231
> URL: https://issues.apache.org/jira/browse/HIVE-12231
> Project: Hive
>  Issue Type: Bug
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-12231.01.patch
>
>
> Please look at the stacktrace, when enabled StorageBasedAuthorization ,  I 
> set external Location of creating database, it will also check write 
> permission of default Warehouse "/user/hive/warehouse" :
> > create  database test  location '/tmp/wangmeng/test'  ;
> Error: Error while compiling statement: FAILED: HiveException 
> java.security.AccessControlException: Permission denied: user=wangmeng, 
> access=WRITE, inode="/user/hive/warehouse":hive:hive:drwxr-x--t
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkFsPermission(DefaultAuthorizationProvider.java:255)
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.check(DefaultAuthorizationProvider.java:236)
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkPermission(DefaultAuthorizationProvider.java:151)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-12231) StorageBasedAuthorization requires write permission of default Warehouse when create external database

2015-10-22 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-12231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-12231:

Summary: StorageBasedAuthorization requires write permission of default 
Warehouse when create external database  (was: StorageBasedAuthorization 
requires write permission of default Warehouse when execute "CREATE DATABASE 
$Name LOCATION '$ExternalPath' ")

> StorageBasedAuthorization requires write permission of default Warehouse when 
> create external database
> --
>
> Key: HIVE-12231
> URL: https://issues.apache.org/jira/browse/HIVE-12231
> Project: Hive
>  Issue Type: Bug
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-12231.01.patch
>
>
> Please look at the stacktrace, when enabled StorageBasedAuthorization, I set 
> external location to create database. However, it will also check write 
> permission of default warehouse "/user/hive/warehouse"  :
> > CREATE DATABASE test LOCATION  '/tmp/wangmeng/test'  ;
> Error: Error while compiling statement: FAILED: HiveException 
> java.security.AccessControlException: Permission denied: user=wangmeng, 
> access=WRITE, inode="/user/hive/warehouse":hive:hive:drwxr-x--t
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkFsPermission(DefaultAuthorizationProvider.java:255)
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.check(DefaultAuthorizationProvider.java:236)
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkPermission(DefaultAuthorizationProvider.java:151)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-12231) StorageBasedAuthorization requires write permission of default Warehouse when create external database

2015-10-22 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-12231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14970294#comment-14970294
 ] 

WangMeng commented on HIVE-12231:
-

[~thejas] Thanks for your detailed clarification.
However, if user has write permission on the external path which is used in 
creating external database, this check mechanism maybe a little inappropriate. 
User should own the right to create database in path which user owns write 
permission such as his own dir. How about checking the destination path of 
external database directly? In production environment, creating external 
database in permission dir is relative common.
Thanks.

> StorageBasedAuthorization requires write permission of default Warehouse when 
> create external database
> --
>
> Key: HIVE-12231
> URL: https://issues.apache.org/jira/browse/HIVE-12231
> Project: Hive
>  Issue Type: Bug
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-12231.01.patch
>
>
> Please look at the stacktrace, when enabled StorageBasedAuthorization, I set 
> external location to create database. However, it will also check write 
> permission of default warehouse "/user/hive/warehouse"  :
> > CREATE DATABASE test LOCATION  '/tmp/wangmeng/test'  ;
> Error: Error while compiling statement: FAILED: HiveException 
> java.security.AccessControlException: Permission denied: user=wangmeng, 
> access=WRITE, inode="/user/hive/warehouse":hive:hive:drwxr-x--t
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkFsPermission(DefaultAuthorizationProvider.java:255)
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.check(DefaultAuthorizationProvider.java:236)
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkPermission(DefaultAuthorizationProvider.java:151)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-12232) Create external table failed when enabled StorageBasedAuthorization

2015-10-22 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-12232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-12232:

Attachment: HIVE-12232.01.patch

> Create external table failed when enabled StorageBasedAuthorization
> ---
>
> Key: HIVE-12232
> URL: https://issues.apache.org/jira/browse/HIVE-12232
> Project: Hive
>  Issue Type: Bug
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-12232.01.patch
>
>
> Please look at the stacktrace, when enabled StorageBasedAuthorization, 
> creating external table will failed with write permission about the default 
> warehouse path "/user/hive/warehouse": 
> > CREATE EXTERNAL TABLE test(id int) LOCATION '/tmp/wangmeng/test'  ;
> Error: Error while compiling statement: FAILED: HiveException 
> java.security.AccessControlException: Permission denied: user=wangmeng, 
> access=WRITE, inode="/user/hive/warehouse":hive:hive:drwxr-x--t.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-12231) StorageBasedAuthorization requires write permission of default Warehouse when execute "CREATE DATABASE $Name LOCATION '$ExternalPath' "

2015-10-22 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-12231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14968670#comment-14968670
 ] 

WangMeng commented on HIVE-12231:
-

[~xuefuz] [~thejas] :
Please take a look about this small bug. Thanks.

> StorageBasedAuthorization requires write permission of default Warehouse when 
> execute "CREATE DATABASE $Name LOCATION '$ExternalPath' "
> ---
>
> Key: HIVE-12231
> URL: https://issues.apache.org/jira/browse/HIVE-12231
> Project: Hive
>  Issue Type: Bug
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-12231.01.patch
>
>
> Please look at the stacktrace, when enabled StorageBasedAuthorization, I set 
> external location to create database. However, it will also check write 
> permission of default warehouse "/user/hive/warehouse"  :
> > CREATE DATABASE test LOCATION  '/tmp/wangmeng/test'  ;
> Error: Error while compiling statement: FAILED: HiveException 
> java.security.AccessControlException: Permission denied: user=wangmeng, 
> access=WRITE, inode="/user/hive/warehouse":hive:hive:drwxr-x--t
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkFsPermission(DefaultAuthorizationProvider.java:255)
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.check(DefaultAuthorizationProvider.java:236)
>   at 
> org.apache.hadoop.hdfs.server.namenode.DefaultAuthorizationProvider.checkPermission(DefaultAuthorizationProvider.java:151)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-11880) filter bug of UNION ALL when hive.ppd.remove.duplicatefilters=true and filter condition is type incompatible column

2015-10-07 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14948023#comment-14948023
 ] 

WangMeng commented on HIVE-11880:
-

[~jpullokkaran]   
 I have  added  Review Board in Issue Links.
 The  execution engine is MR( I  don't use TEZ) .You can use 
TPC-H(http://www.tpc.org/tpch/)  to  reproduce this Jira according to the 
descriptition above. Thanks.
 Different from  HIVE-11919 , only when  occurs "union type mismatch" and  one 
of the type mismatch column  is constant  and this type mismatch column is 
filter column, then  UNION ALL will  throws HIVE-11880．

> filter bug  of UNION ALL when hive.ppd.remove.duplicatefilters=true and 
> filter condition is type incompatible column 
> -
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Logical Optimizer
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch, HIVE-11880.02.patch, 
> HIVE-11880.03.patch, HIVE-11880.04.patch
>
>
>For UNION ALL , when an union operator is constant column (such as '0L', 
> BIGINT Type)  and its corresponding column has incompatible type (such as INT 
> type). 
>   Query with filter condition on type incompatible column on this UNION ALL  
> will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders",in  the following query:
>  Type of 'orders'.'o_custkey' is INT normally,  while  the type of 
> corresponding constant column  "0" is BIGINT( `0L AS `o_custkey` ). 
>  This query (with filter "type incompatible column 'o_custkey' ")  will fail  
> with  java.lang.IndexOutOfBoundsException : 
> {code}
> SELECT Count(1)
> FROM   (
>   SELECT `o_orderkey` ,
>  `o_custkey`
>   FROM   `orders`
>   UNION ALL
>   SELECT `o_orderkey`,
>  0L  AS `o_custkey`
>   FROM   `orders`) `oo`
> WHERE  o_custkey<10 limit 4 ;
> {code}
> When 
> {code}
> set hive.ppd.remove.duplicatefilters=true
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-11880) filter bug of UNION ALL when hive.ppd.remove.duplicatefilters=true and filter condition is type incompatible column

2015-09-29 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14935024#comment-14935024
 ] 

WangMeng commented on HIVE-11880:
-

[~ashutoshc]   [~jpullokkaran]  I  have published this patch on Review Board: 
https://reviews.apache.org/r/38805/
Please help  review it . Thanks. 

> filter bug  of UNION ALL when hive.ppd.remove.duplicatefilters=true and 
> filter condition is type incompatible column 
> -
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Logical Optimizer
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch, HIVE-11880.02.patch, 
> HIVE-11880.03.patch, HIVE-11880.04.patch
>
>
>For UNION ALL , when an union operator is constant column (such as '0L', 
> BIGINT Type)  and its corresponding column has incompatible type (such as INT 
> type). 
>   Query with filter condition on type incompatible column on this UNION ALL  
> will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders",in  the following query:
>  Type of 'orders'.'o_custkey' is INT normally,  while  the type of 
> corresponding constant column  "0" is BIGINT( `0L AS `o_custkey` ). 
>  This query (with filter "type incompatible column 'o_custkey' ")  will fail  
> with  java.lang.IndexOutOfBoundsException : 
> {code}
> SELECT Count(1)
> FROM   (
>   SELECT `o_orderkey` ,
>  `o_custkey`
>   FROM   `orders`
>   UNION ALL
>   SELECT `o_orderkey`,
>  0L  AS `o_custkey`
>   FROM   `orders`) `oo`
> WHERE  o_custkey<10 limit 4 ;
> {code}
> When 
> {code}
> set hive.ppd.remove.duplicatefilters=true
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11880) filter bug of UNION ALL when hive.ppd.remove.duplicatefilters=true and filter condition is type incompatible column

2015-09-28 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11880:

Attachment: HIVE-11880.04.patch

> filter bug  of UNION ALL when hive.ppd.remove.duplicatefilters=true and 
> filter condition is type incompatible column 
> -
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Logical Optimizer
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch, HIVE-11880.02.patch, 
> HIVE-11880.03.patch, HIVE-11880.04.patch
>
>
>For UNION ALL , when an union operator is constant column (such as '0L', 
> BIGINT Type)  and its corresponding column has incompatible type (such as INT 
> type). 
>   Query with filter condition on type incompatible column on this UNION ALL  
> will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders",in  the following query:
>  Type of 'orders'.'o_custkey' is INT normally,  while  the type of 
> corresponding constant column  "0" is BIGINT( `0L AS `o_custkey` ). 
>  This query (with filter "type incompatible column 'o_custkey' ")  will fail  
> with  java.lang.IndexOutOfBoundsException : 
> {code}
> SELECT Count(1)
> FROM   (
>   SELECT `o_orderkey` ,
>  `o_custkey`
>   FROM   `orders`
>   UNION ALL
>   SELECT `o_orderkey`,
>  0L  AS `o_custkey`
>   FROM   `orders`) `oo`
> WHERE  o_custkey<10 limit 4 ;
> {code}
> When 
> {code}
> set hive.ppd.remove.duplicatefilters=true
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-11880) filter bug of UNION ALL when hive.ppd.remove.duplicatefilters=true and filter condition is type incompatible column

2015-09-28 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14933230#comment-14933230
 ] 

WangMeng commented on HIVE-11880:
-

[~jpullokkaran]  this exception is as follows:
Caused by: java.lang.RuntimeException: Map operator initialization failed
..
Caused by: java.lang.IndexOutOfBoundsException: Index: 0, Size: 0
at java.util.ArrayList.rangeCheck(ArrayList.java:635)
at java.util.ArrayList.get(ArrayList.java:411)
at 
org.apache.hadoop.hive.ql.exec.UnionOperator.initializeOp(UnionOperator.java:83).
.
And I uploaded a new patch:HIVE-11880.04.patch, please check it again. Thanks.

> filter bug  of UNION ALL when hive.ppd.remove.duplicatefilters=true and 
> filter condition is type incompatible column 
> -
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Logical Optimizer
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch, HIVE-11880.02.patch, 
> HIVE-11880.03.patch
>
>
>For UNION ALL , when an union operator is constant column (such as '0L', 
> BIGINT Type)  and its corresponding column has incompatible type (such as INT 
> type). 
>   Query with filter condition on type incompatible column on this UNION ALL  
> will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders",in  the following query:
>  Type of 'orders'.'o_custkey' is INT normally,  while  the type of 
> corresponding constant column  "0" is BIGINT( `0L AS `o_custkey` ). 
>  This query (with filter "type incompatible column 'o_custkey' ")  will fail  
> with  java.lang.IndexOutOfBoundsException : 
> {code}
> SELECT Count(1)
> FROM   (
>   SELECT `o_orderkey` ,
>  `o_custkey`
>   FROM   `orders`
>   UNION ALL
>   SELECT `o_orderkey`,
>  0L  AS `o_custkey`
>   FROM   `orders`) `oo`
> WHERE  o_custkey<10 limit 4 ;
> {code}
> When 
> {code}
> set hive.ppd.remove.duplicatefilters=true
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11880) filter bug of UNION ALL when hive.ppd.remove.duplicatefilters=true and filter condition is type incompatible column

2015-09-25 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11880:

Attachment: HIVE-11880.03.patch

> filter bug  of UNION ALL when hive.ppd.remove.duplicatefilters=true and 
> filter condition is type incompatible column 
> -
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Logical Optimizer
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch, HIVE-11880.02.patch, 
> HIVE-11880.03.patch
>
>
>For UNION ALL , when an union operator is constant column (such as '0L', 
> BIGINT Type)  and its corresponding column has incompatible type (such as INT 
> type). 
>   Query with filter condition on type incompatible column on this UNION ALL  
> will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders",in  the following query:
>  Type of 'orders'.'o_custkey' is INT normally,  while  the type of 
> corresponding constant column  "0" is BIGINT( `0L AS `o_custkey` ). 
>  This query (with filter "type incompatible column 'o_custkey' ")  will fail  
> with  java.lang.IndexOutOfBoundsException : 
> {code}
> SELECT Count(1)
> FROM   (
>   SELECT `o_orderkey` ,
>  `o_custkey`
>   FROM   `orders`
>   UNION ALL
>   SELECT `o_orderkey`,
>  0L  AS `o_custkey`
>   FROM   `orders`) `oo`
> WHERE  o_custkey<10 limit 4 ;
> {code}
> When 
> {code}
> set hive.ppd.remove.duplicatefilters=true
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-11880) filter bug of UNION ALL when hive.ppd.remove.duplicatefilters=true and filter condition is type incompatible column

2015-09-25 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14907667#comment-14907667
 ] 

WangMeng commented on HIVE-11880:
-

[~xuefuz]I have rebased it and uploaded a new patch.
[~ashutoshc] I tried the patch of HIVE-11919 again after I rebased, it can not 
fix this bug also.
Please check it again . Thanks.

> filter bug  of UNION ALL when hive.ppd.remove.duplicatefilters=true and 
> filter condition is type incompatible column 
> -
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Logical Optimizer
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch, HIVE-11880.02.patch, 
> HIVE-11880.03.patch
>
>
>For UNION ALL , when an union operator is constant column (such as '0L', 
> BIGINT Type)  and its corresponding column has incompatible type (such as INT 
> type). 
>   Query with filter condition on type incompatible column on this UNION ALL  
> will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders",in  the following query:
>  Type of 'orders'.'o_custkey' is INT normally,  while  the type of 
> corresponding constant column  "0" is BIGINT( `0L AS `o_custkey` ). 
>  This query (with filter "type incompatible column 'o_custkey' ")  will fail  
> with  java.lang.IndexOutOfBoundsException : 
> {code}
> SELECT Count(1)
> FROM   (
>   SELECT `o_orderkey` ,
>  `o_custkey`
>   FROM   `orders`
>   UNION ALL
>   SELECT `o_orderkey`,
>  0L  AS `o_custkey`
>   FROM   `orders`) `oo`
> WHERE  o_custkey<10 limit 4 ;
> {code}
> When 
> {code}
> set hive.ppd.remove.duplicatefilters=true
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-11880) filter bug of UNION ALL when hive.ppd.remove.duplicatefilters=true and filter condition is type incompatible column

2015-09-24 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14906211#comment-14906211
 ] 

WangMeng commented on HIVE-11880:
-

[~ashutoshc]   [~xuefuz] 
I feel it will wait a very long time until  [~hiveqa] runs tests in recent  
period. 
Does  the QA queue have been blocked ? Thanks.

> filter bug  of UNION ALL when hive.ppd.remove.duplicatefilters=true and 
> filter condition is type incompatible column 
> -
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Logical Optimizer
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch, HIVE-11880.02.patch
>
>
>For UNION ALL , when an union operator is constant column (such as '0L', 
> BIGINT Type)  and its corresponding column has incompatible type (such as INT 
> type). 
>   Query with filter condition on type incompatible column on this UNION ALL  
> will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders",in  the following query:
>  Type of 'orders'.'o_custkey' is INT normally,  while  the type of 
> corresponding constant column  "0" is BIGINT( `0L AS `o_custkey` ). 
>  This query (with filter "type incompatible column 'o_custkey' ")  will fail  
> with  java.lang.IndexOutOfBoundsException : 
> {code}
> SELECT Count(1)
> FROM   (
>   SELECT `o_orderkey` ,
>  `o_custkey`
>   FROM   `orders`
>   UNION ALL
>   SELECT `o_orderkey`,
>  0L  AS `o_custkey`
>   FROM   `orders`) `oo`
> WHERE  o_custkey<10 limit 4 ;
> {code}
> When 
> {code}
> set hive.ppd.remove.duplicatefilters=true
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-11149) Fix issue with sometimes HashMap in PerfLogger.java hangs

2015-09-24 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14906213#comment-14906213
 ] 

WangMeng commented on HIVE-11149:
-

[~sershe]
I feel it will wait a very long time until  [~hiveqa] runs tests in recent  
period. 
Does  the QA queue have been blocked ? Thanks.

> Fix issue with sometimes HashMap in PerfLogger.java hangs 
> --
>
> Key: HIVE-11149
> URL: https://issues.apache.org/jira/browse/HIVE-11149
> Project: Hive
>  Issue Type: Bug
>  Components: Logging
>Affects Versions: 1.2.0
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11149.01.patch, HIVE-11149.02.patch, 
> HIVE-11149.03.patch
>
>
> In  Multi-thread environment,  sometimes the  HashMap in PerfLogger.java  
> will  casue massive Java Processes hang  and cost  large amounts of 
> unnecessary CPU and Memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11149) Fix issue with sometimes HashMap in PerfLogger.java hangs

2015-09-24 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11149:

Affects Version/s: (was: 1.2.0)

> Fix issue with sometimes HashMap in PerfLogger.java hangs 
> --
>
> Key: HIVE-11149
> URL: https://issues.apache.org/jira/browse/HIVE-11149
> Project: Hive
>  Issue Type: Bug
>  Components: Logging
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11149.01.patch, HIVE-11149.02.patch, 
> HIVE-11149.03.patch
>
>
> In  Multi-thread environment,  sometimes the  HashMap in PerfLogger.java  
> will  casue massive Java Processes hang  and cost  large amounts of 
> unnecessary CPU and Memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11149) Fix issue with sometimes HashMap in PerfLogger.java hangs

2015-09-24 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11149:

Affects Version/s: 1.2.1

> Fix issue with sometimes HashMap in PerfLogger.java hangs 
> --
>
> Key: HIVE-11149
> URL: https://issues.apache.org/jira/browse/HIVE-11149
> Project: Hive
>  Issue Type: Bug
>  Components: Logging
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11149.01.patch, HIVE-11149.02.patch, 
> HIVE-11149.03.patch
>
>
> In  Multi-thread environment,  sometimes the  HashMap in PerfLogger.java  
> will  casue massive Java Processes hang  and cost  large amounts of 
> unnecessary CPU and Memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11880) filter bug of UNION ALL when hive.ppd.remove.duplicatefilters=true and filter condition is type incompatible column

2015-09-24 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11880:

Description: 
   For UNION ALL , when an union operator is constant column (such as '0L', 
BIGINT Type)  and its corresponding column has incompatible type (such as INT 
type). 
  Query with filter condition on type incompatible column on this UNION ALL  
will cause IndexOutOfBoundsException.

 Such as TPC-H table "orders",in  the following query:
 Type of 'orders'.'o_custkey' is INT normally,  while  the type of 
corresponding constant column  "0" is BIGINT( `0L AS `o_custkey` ). 
 This query (with filter "type incompatible column 'o_custkey' ")  will fail  
with  java.lang.IndexOutOfBoundsException : 
{code}
SELECT Count(1)
FROM   (
  SELECT `o_orderkey` ,
 `o_custkey`
  FROM   `orders`
  UNION ALL
  SELECT `o_orderkey`,
 0L  AS `o_custkey`
  FROM   `orders`) `oo`
WHERE  o_custkey<10 limit 4 ;
{code}
When 
{code}
set hive.ppd.remove.duplicatefilters=true
{code}

  was:
   For UNION ALL , when an union operator is constant(column 'a' such as '0L')  
and it has incompatible type with the corresponding column A(INT type). 
  Query with filter condition on type incompatible column on this UNION ALL  
will cause IndexOutOfBoundsException.

 Such as TPC-H table "orders",in  the fllowing query:
 Type of 'o_custkey' is INT normally,  while  the type of corresponding column 
constant "0" is BIGINT( `0L AS `o_custkey` ). 
 This query (with filter " incompatible column 'o_custkey' ")  will fail  with  
java.lang.IndexOutOfBoundsException : 
{code}
SELECT Count(1)
FROM   (
  SELECT `o_orderkey` ,
 `o_custkey`
  FROM   `orders`
  UNION ALL
  SELECT `o_orderkey`,
 0L  AS `o_custkey`
  FROM   `rcfileorders`) `oo`
WHERE  o_custkey<10 limit 4 ;

{code}

When 
{code}
set hive.ppd.remove.duplicatefilters=true
{code}
 the fllowing query (with filter " incompatible column 'o_custkey' ")  will 
fail  with  java.lang.IndexOutOfBoundsException:
{code}   
 select count(1) from view_orders  where o_custkey<10
{code}


> filter bug  of UNION ALL when hive.ppd.remove.duplicatefilters=true and 
> filter condition is type incompatible column 
> -
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Logical Optimizer
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch
>
>
>For UNION ALL , when an union operator is constant column (such as '0L', 
> BIGINT Type)  and its corresponding column has incompatible type (such as INT 
> type). 
>   Query with filter condition on type incompatible column on this UNION ALL  
> will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders",in  the following query:
>  Type of 'orders'.'o_custkey' is INT normally,  while  the type of 
> corresponding constant column  "0" is BIGINT( `0L AS `o_custkey` ). 
>  This query (with filter "type incompatible column 'o_custkey' ")  will fail  
> with  java.lang.IndexOutOfBoundsException : 
> {code}
> SELECT Count(1)
> FROM   (
>   SELECT `o_orderkey` ,
>  `o_custkey`
>   FROM   `orders`
>   UNION ALL
>   SELECT `o_orderkey`,
>  0L  AS `o_custkey`
>   FROM   `orders`) `oo`
> WHERE  o_custkey<10 limit 4 ;
> {code}
> When 
> {code}
> set hive.ppd.remove.duplicatefilters=true
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-11880) IndexOutOfBoundsException when query with filter condition on type incompatible column of UNION ALL when hive.ppd.remove.duplicatefilters=true

2015-09-24 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905982#comment-14905982
 ] 

WangMeng commented on HIVE-11880:
-

[~ashutoshc] I tried the patch of HIVE-11919, it can not fix this bug.
The two bugs are different.I think HIVE-11919 focuses on type mismatch and 
HIVE-11880 focuses on filter bug when hive.ppd.remove.duplicatefilters=true

> IndexOutOfBoundsException when query with filter condition on type 
> incompatible column of UNION ALL when hive.ppd.remove.duplicatefilters=true
> --
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Logical Optimizer
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch
>
>
>For Hive UNION ALL , when an union column is constant(column a such as 
> '0L')  and it has incompatible type with the corresponding column A(INT 
> Type). 
>   Query with filter condition on type incompatible column a on this UNION ALL 
>  will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders", we  create View  by : 
> {code}
>   CREATE VIEW `view_orders` AS
>   SELECT `oo`.`o_orderkey` ,
>  `oo`.`o_custkey` 
>   FROM   (   
>   SELECT`o_orderkey` , `0L AS `o_custkey`   
>   FROM   `rcfileorders`   
>   UNION ALL   
>   SELECT `o_orderkey` ,`o_custkey`   
>   FROM  `textfileorders`) `oo`.
> {code}
>   In VIEW view_orders , type of 'o_custkey' is INT normally, while  the type 
> of corresponding column constant "0" is BIGINT( `0L AS `o_custkey` ).
> When 
> {code}
> set hive.ppd.remove.duplicatefilters=true
> {code}
>  the fllowing query (with filter " incompatible column 'o_custkey' ")  will 
> fail  with  java.lang.IndexOutOfBoundsException:
> {code}   
>  select count(1) from view_orders  where o_custkey<10
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-11880) IndexOutOfBoundsException when query with filter condition on type incompatible column of UNION ALL when hive.ppd.remove.duplicatefilters=true

2015-09-24 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14905983#comment-14905983
 ] 

WangMeng commented on HIVE-11880:
-

[~ashutoshc] I tried the patch of HIVE-11919, it can not fix this bug.
The two bugs are different.I think HIVE-11919 focuses on type mismatch and 
HIVE-11880 focuses on filter bug when hive.ppd.remove.duplicatefilters=true

> IndexOutOfBoundsException when query with filter condition on type 
> incompatible column of UNION ALL when hive.ppd.remove.duplicatefilters=true
> --
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Logical Optimizer
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch
>
>
>For Hive UNION ALL , when an union column is constant(column a such as 
> '0L')  and it has incompatible type with the corresponding column A(INT 
> Type). 
>   Query with filter condition on type incompatible column a on this UNION ALL 
>  will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders", we  create View  by : 
> {code}
>   CREATE VIEW `view_orders` AS
>   SELECT `oo`.`o_orderkey` ,
>  `oo`.`o_custkey` 
>   FROM   (   
>   SELECT`o_orderkey` , `0L AS `o_custkey`   
>   FROM   `rcfileorders`   
>   UNION ALL   
>   SELECT `o_orderkey` ,`o_custkey`   
>   FROM  `textfileorders`) `oo`.
> {code}
>   In VIEW view_orders , type of 'o_custkey' is INT normally, while  the type 
> of corresponding column constant "0" is BIGINT( `0L AS `o_custkey` ).
> When 
> {code}
> set hive.ppd.remove.duplicatefilters=true
> {code}
>  the fllowing query (with filter " incompatible column 'o_custkey' ")  will 
> fail  with  java.lang.IndexOutOfBoundsException:
> {code}   
>  select count(1) from view_orders  where o_custkey<10
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11880) filter bug of UNION ALL when hive.ppd.remove.duplicatefilters=true and filter condition is type incompatible column

2015-09-24 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11880:

Summary: filter bug  of UNION ALL when 
hive.ppd.remove.duplicatefilters=true and filter condition is type incompatible 
column   (was: IndexOutOfBoundsException when query with filter condition on 
type incompatible column of UNION ALL when 
hive.ppd.remove.duplicatefilters=true)

> filter bug  of UNION ALL when hive.ppd.remove.duplicatefilters=true and 
> filter condition is type incompatible column 
> -
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Logical Optimizer
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch
>
>
>For Hive UNION ALL , when an union column is constant(column a such as 
> '0L')  and it has incompatible type with the corresponding column A(INT 
> Type). 
>   Query with filter condition on type incompatible column a on this UNION ALL 
>  will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders", we  create View  by : 
> {code}
>   CREATE VIEW `view_orders` AS
>   SELECT `oo`.`o_orderkey` ,
>  `oo`.`o_custkey` 
>   FROM   (   
>   SELECT`o_orderkey` , `0L AS `o_custkey`   
>   FROM   `rcfileorders`   
>   UNION ALL   
>   SELECT `o_orderkey` ,`o_custkey`   
>   FROM  `textfileorders`) `oo`.
> {code}
>   In VIEW view_orders , type of 'o_custkey' is INT normally, while  the type 
> of corresponding column constant "0" is BIGINT( `0L AS `o_custkey` ).
> When 
> {code}
> set hive.ppd.remove.duplicatefilters=true
> {code}
>  the fllowing query (with filter " incompatible column 'o_custkey' ")  will 
> fail  with  java.lang.IndexOutOfBoundsException:
> {code}   
>  select count(1) from view_orders  where o_custkey<10
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11880) filter bug of UNION ALL when hive.ppd.remove.duplicatefilters=true and filter condition is type incompatible column

2015-09-24 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11880:

Attachment: HIVE-11880.02.patch

> filter bug  of UNION ALL when hive.ppd.remove.duplicatefilters=true and 
> filter condition is type incompatible column 
> -
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Logical Optimizer
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch, HIVE-11880.02.patch
>
>
>For UNION ALL , when an union operator is constant column (such as '0L', 
> BIGINT Type)  and its corresponding column has incompatible type (such as INT 
> type). 
>   Query with filter condition on type incompatible column on this UNION ALL  
> will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders",in  the following query:
>  Type of 'orders'.'o_custkey' is INT normally,  while  the type of 
> corresponding constant column  "0" is BIGINT( `0L AS `o_custkey` ). 
>  This query (with filter "type incompatible column 'o_custkey' ")  will fail  
> with  java.lang.IndexOutOfBoundsException : 
> {code}
> SELECT Count(1)
> FROM   (
>   SELECT `o_orderkey` ,
>  `o_custkey`
>   FROM   `orders`
>   UNION ALL
>   SELECT `o_orderkey`,
>  0L  AS `o_custkey`
>   FROM   `orders`) `oo`
> WHERE  o_custkey<10 limit 4 ;
> {code}
> When 
> {code}
> set hive.ppd.remove.duplicatefilters=true
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-11880) filter bug of UNION ALL when hive.ppd.remove.duplicatefilters=true and filter condition is type incompatible column

2015-09-24 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14906092#comment-14906092
 ] 

WangMeng commented on HIVE-11880:
-

[~ashutoshc]   Hi , I uploaded a new patch to add  notes  for this issue and 
alterd  jira title and description.
Please check it again . Thanks.

> filter bug  of UNION ALL when hive.ppd.remove.duplicatefilters=true and 
> filter condition is type incompatible column 
> -
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Logical Optimizer
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch, HIVE-11880.02.patch
>
>
>For UNION ALL , when an union operator is constant column (such as '0L', 
> BIGINT Type)  and its corresponding column has incompatible type (such as INT 
> type). 
>   Query with filter condition on type incompatible column on this UNION ALL  
> will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders",in  the following query:
>  Type of 'orders'.'o_custkey' is INT normally,  while  the type of 
> corresponding constant column  "0" is BIGINT( `0L AS `o_custkey` ). 
>  This query (with filter "type incompatible column 'o_custkey' ")  will fail  
> with  java.lang.IndexOutOfBoundsException : 
> {code}
> SELECT Count(1)
> FROM   (
>   SELECT `o_orderkey` ,
>  `o_custkey`
>   FROM   `orders`
>   UNION ALL
>   SELECT `o_orderkey`,
>  0L  AS `o_custkey`
>   FROM   `orders`) `oo`
> WHERE  o_custkey<10 limit 4 ;
> {code}
> When 
> {code}
> set hive.ppd.remove.duplicatefilters=true
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11880) IndexOutOfBoundsException when query with filter condition on type incompatible column of UNION ALL when hive.ppd.remove.duplicatefilters=true

2015-09-23 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11880:

Component/s: (was: Query Processor)
 Logical Optimizer

> IndexOutOfBoundsException when query with filter condition on type 
> incompatible column of UNION ALL when hive.ppd.remove.duplicatefilters=true
> --
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Logical Optimizer
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch
>
>
>For Hive UNION ALL , when an union column is constant(column a such as 
> '0L')  and it has incompatible type with the corresponding column A(INT 
> Type). 
>   Query with filter condition on type incompatible column a on this UNION ALL 
>  will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders", we  create View  by : 
> {code}
>   CREATE VIEW `view_orders` AS
>   SELECT `oo`.`o_orderkey` ,
>  `oo`.`o_custkey` 
>   FROM   (   
>   SELECT`o_orderkey` , `0L AS `o_custkey`   
>   FROM   `rcfileorders`   
>   UNION ALL   
>   SELECT `o_orderkey` ,`o_custkey`   
>   FROM  `textfileorders`) `oo`.
> {code}
>   In VIEW view_orders , type of 'o_custkey' is INT normally, while  the type 
> of corresponding column constant "0" is BIGINT( `0L AS `o_custkey` ).
> When 
> {code}
> set hive.ppd.remove.duplicatefilters=true
> {code}
>  the fllowing query (with filter " incompatible column 'o_custkey' ")  will 
> fail  with  java.lang.IndexOutOfBoundsException:
> {code}   
>  select count(1) from view_orders  where o_custkey<10
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-11880) IndexOutOfBoundsException when query with filter condition on type incompatible column of UNION ALL when hive.ppd.remove.duplicatefilters=true

2015-09-22 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14903977#comment-14903977
 ] 

WangMeng commented on HIVE-11880:
-

[~hiveqa]

> IndexOutOfBoundsException when query with filter condition on type 
> incompatible column of UNION ALL when hive.ppd.remove.duplicatefilters=true
> --
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch
>
>
>For Hive UNION ALL , when an union column is constant(column a such as 
> '0L')  and it has incompatible type with the corresponding column A(INT 
> Type). 
>   Query with filter condition on type incompatible column a on this UNION ALL 
>  will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders", we  create View  by : 
> {code}
>   CREATE VIEW `view_orders` AS
>   SELECT `oo`.`o_orderkey` ,
>  `oo`.`o_custkey` 
>   FROM   (   
>   SELECT`o_orderkey` , `0L AS `o_custkey`   
>   FROM   `rcfileorders`   
>   UNION ALL   
>   SELECT `o_orderkey` ,`o_custkey`   
>   FROM  `textfileorders`) `oo`.
> {code}
>   In VIEW view_orders , type of 'o_custkey' is INT normally, while  the type 
> of corresponding column constant "0" is BIGINT( `0L AS `o_custkey` ).
> When 
> {code}
> set hive.ppd.remove.duplicatefilters=true
> {code}
>  the fllowing query (with filter " incompatible column 'o_custkey' ")  will 
> fail  with  java.lang.IndexOutOfBoundsException:
> {code}   
>  select count(1) from view_orders  where o_custkey<10
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11880) IndexOutOfBoundsException when execute query with filter condition on type incompatible column on data(generated by UNION ALL with an union column is constant and it h

2015-09-22 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11880:

Attachment: HIVE-11880.01.patch

>IndexOutOfBoundsException when execute query with filter condition on type 
> incompatible column on data(generated by UNION ALL with an union column is 
> constant and it has incompatible type with corresponding column) 
> --
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch
>
>
>For Hive UNION ALL , when a union column is constant(column a such as 
> '0L')  and it has incompatible type with the corresponding column A. The 
> query with filter condition on type incompatible column a on this UNION-ALL 
> results  will cause IndexOutOfBoundsException
>   Such as TPC-H table "orders", we  CREATE VIEW `view_orders` AS select 
> `oo`.`o_orderkey` , `oo`.`o_custkey`  from (  select  `orders`.`o_orderkey` , 
> `rcfileorders`.`o_custkey` from `tpch270g`.`rcfileorders`   union all  select 
> `orcfileorders`.`o_orderkey` , 0L as `o_custkey`   from  
> `tpch270g`.`textfileorders`) `oo`.
>Type of 'o_custkey' is INT normally, while  the type of corresponding 
> column constant "0" is BIGINT.
>Then the fllowing query(with filter incompatible column 0_custkey)  will 
> fail  with  java.lang.IndexOutOfBoundsException:
> 'select count(1) from view_orders  where o_custkey<10 '.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11880) IndexOutOfBoundsException when query with filter condition on type incompatible column of UNION ALL

2015-09-22 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11880:

Summary: IndexOutOfBoundsException when query with filter condition on type 
incompatible column of UNION ALL  (was:IndexOutOfBoundsException when 
execute query with filter condition on type incompatible column on 
data(generated by UNION ALL with an union column is constant and it has 
incompatible type with corresponding column) )

> IndexOutOfBoundsException when query with filter condition on type 
> incompatible column of UNION ALL
> ---
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch
>
>
>For Hive UNION ALL , when a union column is constant(column a such as 
> '0L')  and it has incompatible type with the corresponding column A. The 
> query with filter condition on type incompatible column a on this UNION-ALL 
> results  will cause IndexOutOfBoundsException
>   Such as TPC-H table "orders", we  CREATE VIEW `view_orders` AS select 
> `oo`.`o_orderkey` , `oo`.`o_custkey`  from (  select  `orders`.`o_orderkey` , 
> `rcfileorders`.`o_custkey` from `tpch270g`.`rcfileorders`   union all  select 
> `orcfileorders`.`o_orderkey` , 0L as `o_custkey`   from  
> `tpch270g`.`textfileorders`) `oo`.
>Type of 'o_custkey' is INT normally, while  the type of corresponding 
> column constant "0" is BIGINT.
>Then the fllowing query(with filter incompatible column 0_custkey)  will 
> fail  with  java.lang.IndexOutOfBoundsException:
> 'select count(1) from view_orders  where o_custkey<10 '.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11880) IndexOutOfBoundsException when query with filter condition on type incompatible column of UNION ALL when hive.ppd.remove.duplicatefilters=true

2015-09-22 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11880:

Description: 
   For Hive UNION ALL , when an union column is constant(column a such as '0L') 
 and it has incompatible type with the corresponding column A(INT Type). 
  Query with filter condition on type incompatible column a on this UNION ALL  
will cause IndexOutOfBoundsException.

 Such as TPC-H table "orders", we  create View  by : 

  CREATE VIEW `view_orders` AS
  SELECT `oo`.`o_orderkey` ,
 `oo`.`o_custkey` 
  FROM   (   
  SELECT   `orders`.`o_orderkey` ,  
   `rcfileorders`.`o_custkey`   
  FROM  `rcfileorders`   
  UNION ALL   
  SELECT   `orcfileorders`.`o_orderkey` , 
0L AS `o_custkey`   
  FROM  `textfileorders`) `oo`.

  In view_orders , type of 'o_custkey' is INT normally, while  the type of 
corresponding column constant "0" is BIGINT.

  When hive.ppd.remove.duplicatefilters=true, the fllowing query (with filter " 
incompatible column 'o_custkey' ")  will fail  with  
java.lang.IndexOutOfBoundsException:
'select count(1) from view_orders  where o_custkey<10 '.

  was:
   For Hive UNION ALL , when an union column is constant(column a such as '0L') 
 and it has incompatible type with the corresponding column A. 
  Query with filter condition on type incompatible column a on this UNION ALL  
will cause IndexOutOfBoundsException.

 Such as TPC-H table "orders", we  create View  by : 

  CREATE VIEW `view_orders` AS
  SELECT `oo`.`o_orderkey` ,
 `oo`.`o_custkey` 
  FROM   (   
  SELECT   `orders`.`o_orderkey` ,  
   `rcfileorders`.`o_custkey`   
  FROM  `rcfileorders`   
  UNION ALL   
  SELECT   `orcfileorders`.`o_orderkey` , 
0L AS `o_custkey`   
  FROM  `textfileorders`) `oo`.

  In view_orders , type of 'o_custkey' is INT normally, while  the type of 
corresponding column constant "0" is BIGINT.

  Then the fllowing query(with filter " incompatible column 'o_custkey' ")  
will fail  with  java.lang.IndexOutOfBoundsException:
'select count(1) from view_orders  where o_custkey<10 '.


> IndexOutOfBoundsException when query with filter condition on type 
> incompatible column of UNION ALL when hive.ppd.remove.duplicatefilters=true
> --
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch
>
>
>For Hive UNION ALL , when an union column is constant(column a such as 
> '0L')  and it has incompatible type with the corresponding column A(INT 
> Type). 
>   Query with filter condition on type incompatible column a on this UNION ALL 
>  will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders", we  create View  by : 
>   CREATE VIEW `view_orders` AS
>   SELECT `oo`.`o_orderkey` ,
>  `oo`.`o_custkey` 
>   FROM   (   
>   SELECT   `orders`.`o_orderkey` ,  
>`rcfileorders`.`o_custkey`   
>   FROM  `rcfileorders`   
>   UNION ALL   
>   SELECT   `orcfileorders`.`o_orderkey` , 
> 0L AS `o_custkey`   
>   FROM  `textfileorders`) `oo`.
>   In view_orders , type of 'o_custkey' is INT normally, while  the type of 
> corresponding column constant "0" is BIGINT.
>   When hive.ppd.remove.duplicatefilters=true, the fllowing query (with filter 
> " incompatible column 'o_custkey' ")  will fail  with  
> java.lang.IndexOutOfBoundsException:
> 'select count(1) from view_orders  where o_custkey<10 '.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11880) IndexOutOfBoundsException when query with filter condition on type incompatible column of UNION ALL when hive.ppd.remove.duplicatefilters=true

2015-09-22 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11880:

Summary: IndexOutOfBoundsException when query with filter condition on type 
incompatible column of UNION ALL when hive.ppd.remove.duplicatefilters=true  
(was: IndexOutOfBoundsException when query with filter condition on type 
incompatible column of UNION ALL)

> IndexOutOfBoundsException when query with filter condition on type 
> incompatible column of UNION ALL when hive.ppd.remove.duplicatefilters=true
> --
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch
>
>
>For Hive UNION ALL , when an union column is constant(column a such as 
> '0L')  and it has incompatible type with the corresponding column A. 
>   Query with filter condition on type incompatible column a on this UNION ALL 
>  will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders", we  create View  by : 
>   CREATE VIEW `view_orders` AS
>   SELECT `oo`.`o_orderkey` ,
>  `oo`.`o_custkey` 
>   FROM   (   
>   SELECT   `orders`.`o_orderkey` ,  
>`rcfileorders`.`o_custkey`   
>   FROM  `rcfileorders`   
>   UNION ALL   
>   SELECT   `orcfileorders`.`o_orderkey` , 
> 0L AS `o_custkey`   
>   FROM  `textfileorders`) `oo`.
>   In view_orders , type of 'o_custkey' is INT normally, while  the type of 
> corresponding column constant "0" is BIGINT.
>   Then the fllowing query(with filter " incompatible column 'o_custkey' ")  
> will fail  with  java.lang.IndexOutOfBoundsException:
> 'select count(1) from view_orders  where o_custkey<10 '.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11880) IndexOutOfBoundsException when query with filter condition on type incompatible column of UNION ALL

2015-09-22 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11880:

Description: 
   For Hive UNION ALL , when an union column is constant(column a such as '0L') 
 and it has incompatible type with the corresponding column A. 
  Query with filter condition on type incompatible column a on this UNION ALL  
will cause IndexOutOfBoundsException.

 Such as TPC-H table "orders", we  create View  by : 

  CREATE VIEW `view_orders` AS
  SELECT `oo`.`o_orderkey` ,
 `oo`.`o_custkey` 
  FROM   (   
  SELECT   `orders`.`o_orderkey` ,  
   `rcfileorders`.`o_custkey`   
  FROM  `rcfileorders`   
  UNION ALL   
  SELECT   `orcfileorders`.`o_orderkey` , 
0L AS `o_custkey`   
  FROM  `textfileorders`) `oo`.

  In view_orders , type of 'o_custkey' is INT normally, while  the type of 
corresponding column constant "0" is BIGINT.

  Then the fllowing query(with filter " incompatible column 'o_custkey' ")  
will fail  with  java.lang.IndexOutOfBoundsException:
'select count(1) from view_orders  where o_custkey<10 '.

  was:
   For Hive UNION ALL , when a union column is constant(column a such as '0L')  
and it has incompatible type with the corresponding column A. The query with 
filter condition on type incompatible column a on this UNION-ALL results  will 
cause IndexOutOfBoundsException

  Such as TPC-H table "orders", we  CREATE VIEW `view_orders` AS select 
`oo`.`o_orderkey` , `oo`.`o_custkey`  from (  select  `orders`.`o_orderkey` , 
`rcfileorders`.`o_custkey` from `tpch270g`.`rcfileorders`   union all  select 
`orcfileorders`.`o_orderkey` , 0L as `o_custkey`   from  
`tpch270g`.`textfileorders`) `oo`.

   Type of 'o_custkey' is INT normally, while  the type of corresponding column 
constant "0" is BIGINT.

   Then the fllowing query(with filter incompatible column 0_custkey)  will 
fail  with  java.lang.IndexOutOfBoundsException:
'select count(1) from view_orders  where o_custkey<10 '.


> IndexOutOfBoundsException when query with filter condition on type 
> incompatible column of UNION ALL
> ---
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch
>
>
>For Hive UNION ALL , when an union column is constant(column a such as 
> '0L')  and it has incompatible type with the corresponding column A. 
>   Query with filter condition on type incompatible column a on this UNION ALL 
>  will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders", we  create View  by : 
>   CREATE VIEW `view_orders` AS
>   SELECT `oo`.`o_orderkey` ,
>  `oo`.`o_custkey` 
>   FROM   (   
>   SELECT   `orders`.`o_orderkey` ,  
>`rcfileorders`.`o_custkey`   
>   FROM  `rcfileorders`   
>   UNION ALL   
>   SELECT   `orcfileorders`.`o_orderkey` , 
> 0L AS `o_custkey`   
>   FROM  `textfileorders`) `oo`.
>   In view_orders , type of 'o_custkey' is INT normally, while  the type of 
> corresponding column constant "0" is BIGINT.
>   Then the fllowing query(with filter " incompatible column 'o_custkey' ")  
> will fail  with  java.lang.IndexOutOfBoundsException:
> 'select count(1) from view_orders  where o_custkey<10 '.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-11880) IndexOutOfBoundsException when query with filter condition on type incompatible column of UNION ALL when hive.ppd.remove.duplicatefilters=true

2015-09-22 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14903975#comment-14903975
 ] 

WangMeng commented on HIVE-11880:
-

[~xuefuz]
I uploaded a new patch for this issue.
Please check it. Thanks.

> IndexOutOfBoundsException when query with filter condition on type 
> incompatible column of UNION ALL when hive.ppd.remove.duplicatefilters=true
> --
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch
>
>
>For Hive UNION ALL , when an union column is constant(column a such as 
> '0L')  and it has incompatible type with the corresponding column A(INT 
> Type). 
>   Query with filter condition on type incompatible column a on this UNION ALL 
>  will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders", we  create View  by : 
> {code}
>   CREATE VIEW `view_orders` AS
>   SELECT `oo`.`o_orderkey` ,
>  `oo`.`o_custkey` 
>   FROM   (   
>   SELECT`o_orderkey` , `0L AS `o_custkey`   
>   FROM   `rcfileorders`   
>   UNION ALL   
>   SELECT `o_orderkey` ,`o_custkey`   
>   FROM  `textfileorders`) `oo`.
> {code}
>   In VIEW view_orders , type of 'o_custkey' is INT normally, while  the type 
> of corresponding column constant "0" is BIGINT( `0L AS `o_custkey` ).
> When 
> {code}
> set hive.ppd.remove.duplicatefilters=true
> {code}
>  the fllowing query (with filter " incompatible column 'o_custkey' ")  will 
> fail  with  java.lang.IndexOutOfBoundsException:
> {code}   
>  select count(1) from view_orders  where o_custkey<10
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11880) IndexOutOfBoundsException when query with filter condition on type incompatible column of UNION ALL when hive.ppd.remove.duplicatefilters=true

2015-09-22 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11880:

Description: 
   For Hive UNION ALL , when an union column is constant(column a such as '0L') 
 and it has incompatible type with the corresponding column A(INT Type). 
  Query with filter condition on type incompatible column a on this UNION ALL  
will cause IndexOutOfBoundsException.

 Such as TPC-H table "orders", we  create View  by : 
{code}
  CREATE VIEW `view_orders` AS
  SELECT `oo`.`o_orderkey` ,
 `oo`.`o_custkey` 
  FROM   (   
  SELECT`o_orderkey` , `0L AS `o_custkey`   
  FROM   `rcfileorders`   
  UNION ALL   
  SELECT `o_orderkey` ,`o_custkey`   
  FROM  `textfileorders`) `oo`.
{code}
  In VIEW view_orders , type of 'o_custkey' is INT normally, while  the type of 
corresponding column constant "0" is BIGINT( `0L AS `o_custkey` ).

When 
{code}
set hive.ppd.remove.duplicatefilters=true
{code}
 the fllowing query (with filter " incompatible column 'o_custkey' ")  will 
fail  with  java.lang.IndexOutOfBoundsException:
{code}   
 select count(1) from view_orders  where o_custkey<10
{code}

  was:
   For Hive UNION ALL , when an union column is constant(column a such as '0L') 
 and it has incompatible type with the corresponding column A(INT Type). 
  Query with filter condition on type incompatible column a on this UNION ALL  
will cause IndexOutOfBoundsException.

 Such as TPC-H table "orders", we  create View  by : 

  CREATE VIEW `view_orders` AS
  SELECT `oo`.`o_orderkey` ,
 `oo`.`o_custkey` 
  FROM   (   
  SELECT   `orders`.`o_orderkey` ,  
   `rcfileorders`.`o_custkey`   
  FROM  `rcfileorders`   
  UNION ALL   
  SELECT   `orcfileorders`.`o_orderkey` , 
0L AS `o_custkey`   
  FROM  `textfileorders`) `oo`.

  In view_orders , type of 'o_custkey' is INT normally, while  the type of 
corresponding column constant "0" is BIGINT.

  When hive.ppd.remove.duplicatefilters=true, the fllowing query (with filter " 
incompatible column 'o_custkey' ")  will fail  with  
java.lang.IndexOutOfBoundsException:
'select count(1) from view_orders  where o_custkey<10 '.


> IndexOutOfBoundsException when query with filter condition on type 
> incompatible column of UNION ALL when hive.ppd.remove.duplicatefilters=true
> --
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11880.01.patch
>
>
>For Hive UNION ALL , when an union column is constant(column a such as 
> '0L')  and it has incompatible type with the corresponding column A(INT 
> Type). 
>   Query with filter condition on type incompatible column a on this UNION ALL 
>  will cause IndexOutOfBoundsException.
>  Such as TPC-H table "orders", we  create View  by : 
> {code}
>   CREATE VIEW `view_orders` AS
>   SELECT `oo`.`o_orderkey` ,
>  `oo`.`o_custkey` 
>   FROM   (   
>   SELECT`o_orderkey` , `0L AS `o_custkey`   
>   FROM   `rcfileorders`   
>   UNION ALL   
>   SELECT `o_orderkey` ,`o_custkey`   
>   FROM  `textfileorders`) `oo`.
> {code}
>   In VIEW view_orders , type of 'o_custkey' is INT normally, while  the type 
> of corresponding column constant "0" is BIGINT( `0L AS `o_custkey` ).
> When 
> {code}
> set hive.ppd.remove.duplicatefilters=true
> {code}
>  the fllowing query (with filter " incompatible column 'o_custkey' ")  will 
> fail  with  java.lang.IndexOutOfBoundsException:
> {code}   
>  select count(1) from view_orders  where o_custkey<10
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-11149) Fix issue with sometimes HashMap in PerfLogger.java hangs

2015-09-21 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14900430#comment-14900430
 ] 

WangMeng commented on HIVE-11149:
-

[~hiveqa]

> Fix issue with sometimes HashMap in PerfLogger.java hangs 
> --
>
> Key: HIVE-11149
> URL: https://issues.apache.org/jira/browse/HIVE-11149
> Project: Hive
>  Issue Type: Bug
>  Components: Logging
>Affects Versions: 1.2.0
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11149.01.patch, HIVE-11149.02.patch, 
> HIVE-11149.03.patch
>
>
> In  Multi-thread environment,  sometimes the  HashMap in PerfLogger.java  
> will  casue massive Java Processes hang  and cost  large amounts of 
> unnecessary CPU and Memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11149) Fix issue with Thread unsafe Class HashMap in PerfLogger.java hangs in Multi-thread environment

2015-09-18 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11149:

Attachment: HIVE-11149.03.patch

> Fix issue with Thread unsafe Class  HashMap in PerfLogger.java  hangs  in  
> Multi-thread environment
> ---
>
> Key: HIVE-11149
> URL: https://issues.apache.org/jira/browse/HIVE-11149
> Project: Hive
>  Issue Type: Bug
>  Components: Logging
>Affects Versions: 1.2.0
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11149.01.patch, HIVE-11149.02.patch, 
> HIVE-11149.03.patch
>
>
> In  Multi-thread environment,  the Thread unsafe Class HashMap in 
> PerfLogger.java  will  casue massive Java Processes hang  and cost  large 
> amounts of unnecessary CPU and Memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11149) Fix issue with sometimes HashMap in PerfLogger.java hangs

2015-09-18 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11149:

Summary: Fix issue with sometimes HashMap in PerfLogger.java hangs   (was: 
Fix issue with HashMap in PerfLogger.java hangs )

> Fix issue with sometimes HashMap in PerfLogger.java hangs 
> --
>
> Key: HIVE-11149
> URL: https://issues.apache.org/jira/browse/HIVE-11149
> Project: Hive
>  Issue Type: Bug
>  Components: Logging
>Affects Versions: 1.2.0
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11149.01.patch, HIVE-11149.02.patch, 
> HIVE-11149.03.patch
>
>
> In  Multi-thread environment,  the  HashMap in PerfLogger.java  will  casue 
> massive Java Processes hang  and cost  large amounts of unnecessary CPU and 
> Memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11149) Fix issue with sometimes HashMap in PerfLogger.java hangs

2015-09-18 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11149:

Description: In  Multi-thread environment,  sometimes the  HashMap in 
PerfLogger.java  will  casue massive Java Processes hang  and cost  large 
amounts of unnecessary CPU and Memory.  (was: In  Multi-thread environment,  
the  HashMap in PerfLogger.java  will  casue massive Java Processes hang  and 
cost  large amounts of unnecessary CPU and Memory.)

> Fix issue with sometimes HashMap in PerfLogger.java hangs 
> --
>
> Key: HIVE-11149
> URL: https://issues.apache.org/jira/browse/HIVE-11149
> Project: Hive
>  Issue Type: Bug
>  Components: Logging
>Affects Versions: 1.2.0
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11149.01.patch, HIVE-11149.02.patch, 
> HIVE-11149.03.patch
>
>
> In  Multi-thread environment,  sometimes the  HashMap in PerfLogger.java  
> will  casue massive Java Processes hang  and cost  large amounts of 
> unnecessary CPU and Memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11149) Fix issue with HashMap in PerfLogger.java hangs

2015-09-18 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11149:

Description: In  Multi-thread environment,  the  HashMap in PerfLogger.java 
 will  casue massive Java Processes hang  and cost  large amounts of 
unnecessary CPU and Memory.  (was: In  Multi-thread environment,  the Thread 
unsafe Class HashMap in PerfLogger.java  will  casue massive Java Processes 
hang  and cost  large amounts of unnecessary CPU and Memory.)

> Fix issue with HashMap in PerfLogger.java hangs 
> 
>
> Key: HIVE-11149
> URL: https://issues.apache.org/jira/browse/HIVE-11149
> Project: Hive
>  Issue Type: Bug
>  Components: Logging
>Affects Versions: 1.2.0
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11149.01.patch, HIVE-11149.02.patch, 
> HIVE-11149.03.patch
>
>
> In  Multi-thread environment,  the  HashMap in PerfLogger.java  will  casue 
> massive Java Processes hang  and cost  large amounts of unnecessary CPU and 
> Memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11149) Fix issue with HashMap in PerfLogger.java hangs

2015-09-18 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11149:

Summary: Fix issue with HashMap in PerfLogger.java hangs   (was: Fix issue 
with Thread unsafe Class  HashMap in PerfLogger.java  hangs  in  Multi-thread 
environment)

> Fix issue with HashMap in PerfLogger.java hangs 
> 
>
> Key: HIVE-11149
> URL: https://issues.apache.org/jira/browse/HIVE-11149
> Project: Hive
>  Issue Type: Bug
>  Components: Logging
>Affects Versions: 1.2.0
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11149.01.patch, HIVE-11149.02.patch, 
> HIVE-11149.03.patch
>
>
> In  Multi-thread environment,  the Thread unsafe Class HashMap in 
> PerfLogger.java  will  casue massive Java Processes hang  and cost  large 
> amounts of unnecessary CPU and Memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-11149) Fix issue with Thread unsafe Class HashMap in PerfLogger.java hangs in Multi-thread environment

2015-09-18 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14805164#comment-14805164
 ] 

WangMeng commented on HIVE-11149:
-

[~xuefuz]  [~sershe]  I uploaded a new patch for this issue.
Please check it again. Thanks.

> Fix issue with Thread unsafe Class  HashMap in PerfLogger.java  hangs  in  
> Multi-thread environment
> ---
>
> Key: HIVE-11149
> URL: https://issues.apache.org/jira/browse/HIVE-11149
> Project: Hive
>  Issue Type: Bug
>  Components: Logging
>Affects Versions: 1.2.0
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11149.01.patch, HIVE-11149.02.patch, 
> HIVE-11149.03.patch
>
>
> In  Multi-thread environment,  the Thread unsafe Class HashMap in 
> PerfLogger.java  will  casue massive Java Processes hang  and cost  large 
> amounts of unnecessary CPU and Memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-11149) Fix issue with sometimes HashMap in PerfLogger.java hangs

2015-09-18 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14876805#comment-14876805
 ] 

WangMeng commented on HIVE-11149:
-

[~sershe] According to my understanding,it may not occur the problem you 
mentioned above.
According to code logic in Hive,each thread  accessing the session owns the 
initial default value"resetPerfLogger == true" when it begin initialization and 
then it registered as ThreadLocal thread by perfLogger.set() method.
Do I understand  your problem? Thanks.


//Reset the perf logger in Driver.runInternal()
PerfLogger perfLogger = PerfLogger.getPerfLogger(true);

//PerfLogger.java
 public static PerfLogger getPerfLogger(boolean resetPerfLogger) {
  if (SessionState.get() == null) {
 if (perfLogger.get() == null || resetPerfLogger) {
   perfLogger.set(new PerfLogger());
  }
return perfLogger.get();
  } else {
perfLogger.set(SessionState.get().getPerfLogger(resetPerfLogger));
return perfLogger.get();
}
}

> Fix issue with sometimes HashMap in PerfLogger.java hangs 
> --
>
> Key: HIVE-11149
> URL: https://issues.apache.org/jira/browse/HIVE-11149
> Project: Hive
>  Issue Type: Bug
>  Components: Logging
>Affects Versions: 1.2.0
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11149.01.patch, HIVE-11149.02.patch, 
> HIVE-11149.03.patch
>
>
> In  Multi-thread environment,  sometimes the  HashMap in PerfLogger.java  
> will  casue massive Java Processes hang  and cost  large amounts of 
> unnecessary CPU and Memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11880) IndexOutOfBoundsException when execute query with filter condition on type incompatible column(A) on data(composed by UNION ALL when a union column is constant and it

2015-09-18 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11880:

Description: 
   For Hive UNION ALL , when a union column is constant(column a such as '0L')  
and it has incompatible type with the corresponding column A. The query with 
filter condition on type incompatible column a on this UNION-ALL results  will 
cause IndexOutOfBoundsException

  Such as TPC-H table "orders", we  CREATE VIEW `view_orders` AS select 
`oo`.`o_orderkey` , `oo`.`o_custkey`  from (  select  `orders`.`o_orderkey` , 
`rcfileorders`.`o_custkey` from `tpch270g`.`rcfileorders`   union all  select 
`orcfileorders`.`o_orderkey` , 0L as `o_custkey`   from  
`tpch270g`.`textfileorders`) `oo`.

   Type of "o_custkey" is INT,  the type of corresponding constant "0" is 
BIGINT.

   Then the fllowing query(with filter incompatible column 0_custkey)  will 
fail  with  java.lang.IndexOutOfBoundsException:
"select count(1) from view_orders  where o_custkey<10 ".

  was:
For Hive UNION ALL , when a union column is constant(column a) and it has 
incompatible type with the corresponding column A. The query with filter 
condition on type incompatible column a on this UNION-ALL results  will cause 
IndexOutOfBoundsException

such as TPC-H table orders:
CREATE VIEW `view_orders` AS select `oo`.`o_orderkey` , `oo`.`o_custkey`  from 
(  select  `orders`.`o_orderkey` , `rcfileorders`.`o_custkey` from 
`tpch270g`.`rcfileorders`   union all  select `orcfileorders`.`o_orderkey` , 0L 
as `o_custkey`   from  `tpch270g`.`textfileorders`) `oo`.

Type of "o_custkey" is INT,  the type of corresponding constant column 0 is 
BIGINT.
Then the fllowing query(with filter incompatible column 0_custkey)  will fail:
select count(1) from view_orders  where o_custkey<10 with  
java.lang.IndexOutOfBoundsException.


>IndexOutOfBoundsException when execute query with filter condition on type 
> incompatible column(A) on data(composed by UNION ALL when a union column is 
> constant and it has incompatible type with  corresponding column) 
> 
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
>
>For Hive UNION ALL , when a union column is constant(column a such as 
> '0L')  and it has incompatible type with the corresponding column A. The 
> query with filter condition on type incompatible column a on this UNION-ALL 
> results  will cause IndexOutOfBoundsException
>   Such as TPC-H table "orders", we  CREATE VIEW `view_orders` AS select 
> `oo`.`o_orderkey` , `oo`.`o_custkey`  from (  select  `orders`.`o_orderkey` , 
> `rcfileorders`.`o_custkey` from `tpch270g`.`rcfileorders`   union all  select 
> `orcfileorders`.`o_orderkey` , 0L as `o_custkey`   from  
> `tpch270g`.`textfileorders`) `oo`.
>Type of "o_custkey" is INT,  the type of corresponding constant "0" is 
> BIGINT.
>Then the fllowing query(with filter incompatible column 0_custkey)  will 
> fail  with  java.lang.IndexOutOfBoundsException:
> "select count(1) from view_orders  where o_custkey<10 ".



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11880) IndexOutOfBoundsException when execute query with filter condition on type incompatible column on data(generated by UNION ALL with an union column is constant and it h

2015-09-18 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11880:

Summary:IndexOutOfBoundsException when execute query with filter 
condition on type incompatible column on data(generated by UNION ALL with an 
union column is constant and it has incompatible type with corresponding 
column)   (was:IndexOutOfBoundsException when execute query with filter 
condition on type incompatible column(A) on data(composed by UNION ALL when a 
union column is constant and it has incompatible type with  corresponding 
column) )

>IndexOutOfBoundsException when execute query with filter condition on type 
> incompatible column on data(generated by UNION ALL with an union column is 
> constant and it has incompatible type with corresponding column) 
> --
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
>
>For Hive UNION ALL , when a union column is constant(column a such as 
> '0L')  and it has incompatible type with the corresponding column A. The 
> query with filter condition on type incompatible column a on this UNION-ALL 
> results  will cause IndexOutOfBoundsException
>   Such as TPC-H table "orders", we  CREATE VIEW `view_orders` AS select 
> `oo`.`o_orderkey` , `oo`.`o_custkey`  from (  select  `orders`.`o_orderkey` , 
> `rcfileorders`.`o_custkey` from `tpch270g`.`rcfileorders`   union all  select 
> `orcfileorders`.`o_orderkey` , 0L as `o_custkey`   from  
> `tpch270g`.`textfileorders`) `oo`.
>Type of "o_custkey" is INT,  the type of corresponding constant "0" is 
> BIGINT.
>Then the fllowing query(with filter incompatible column 0_custkey)  will 
> fail  with  java.lang.IndexOutOfBoundsException:
> "select count(1) from view_orders  where o_custkey<10 ".



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-11695) If user have no permission to create LOCAL DIRECTORY ，the Hql does not throw any exception and fail silently.

2015-09-18 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14852764#comment-14852764
 ] 

WangMeng commented on HIVE-11695:
-

[~ashutoshc]  and [~xuefuz] :The above failures are about hcatalog ,this patch 
may have little associatition with Hcatlog. Thanks.

> If user have no permission to  create LOCAL DIRECTORY ，the Hql does not throw 
> any exception and fail silently.
> --
>
> Key: HIVE-11695
> URL: https://issues.apache.org/jira/browse/HIVE-11695
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.13.0, 0.14.0, 1.0.0, 1.2.0, 1.1.0, 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11695.01.patch
>
>
> If user have no permission to  create LOCAL DIRECTORY such as   
> "/data/wangmeng/hiveserver2"  ,the query does not throw any exception and 
> fail silently.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11880) IndexOutOfBoundsException when execute query with filter condition on type incompatible column on data(generated by UNION ALL with an union column is constant and it h

2015-09-18 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11880:

Description: 
   For Hive UNION ALL , when a union column is constant(column a such as '0L')  
and it has incompatible type with the corresponding column A. The query with 
filter condition on type incompatible column a on this UNION-ALL results  will 
cause IndexOutOfBoundsException

  Such as TPC-H table "orders", we  CREATE VIEW `view_orders` AS select 
`oo`.`o_orderkey` , `oo`.`o_custkey`  from (  select  `orders`.`o_orderkey` , 
`rcfileorders`.`o_custkey` from `tpch270g`.`rcfileorders`   union all  select 
`orcfileorders`.`o_orderkey` , 0L as `o_custkey`   from  
`tpch270g`.`textfileorders`) `oo`.

   Type of 'o_custkey' is INT normally, while  the type of corresponding column 
constant "0" is BIGINT.

   Then the fllowing query(with filter incompatible column 0_custkey)  will 
fail  with  java.lang.IndexOutOfBoundsException:
'select count(1) from view_orders  where o_custkey<10 '.

  was:
   For Hive UNION ALL , when a union column is constant(column a such as '0L')  
and it has incompatible type with the corresponding column A. The query with 
filter condition on type incompatible column a on this UNION-ALL results  will 
cause IndexOutOfBoundsException

  Such as TPC-H table "orders", we  CREATE VIEW `view_orders` AS select 
`oo`.`o_orderkey` , `oo`.`o_custkey`  from (  select  `orders`.`o_orderkey` , 
`rcfileorders`.`o_custkey` from `tpch270g`.`rcfileorders`   union all  select 
`orcfileorders`.`o_orderkey` , 0L as `o_custkey`   from  
`tpch270g`.`textfileorders`) `oo`.

   Type of 'o_custkey' is INT,  the type of corresponding constant "0" is 
BIGINT.

   Then the fllowing query(with filter incompatible column 0_custkey)  will 
fail  with  java.lang.IndexOutOfBoundsException:
'select count(1) from view_orders  where o_custkey<10 '.


>IndexOutOfBoundsException when execute query with filter condition on type 
> incompatible column on data(generated by UNION ALL with an union column is 
> constant and it has incompatible type with corresponding column) 
> --
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
>
>For Hive UNION ALL , when a union column is constant(column a such as 
> '0L')  and it has incompatible type with the corresponding column A. The 
> query with filter condition on type incompatible column a on this UNION-ALL 
> results  will cause IndexOutOfBoundsException
>   Such as TPC-H table "orders", we  CREATE VIEW `view_orders` AS select 
> `oo`.`o_orderkey` , `oo`.`o_custkey`  from (  select  `orders`.`o_orderkey` , 
> `rcfileorders`.`o_custkey` from `tpch270g`.`rcfileorders`   union all  select 
> `orcfileorders`.`o_orderkey` , 0L as `o_custkey`   from  
> `tpch270g`.`textfileorders`) `oo`.
>Type of 'o_custkey' is INT normally, while  the type of corresponding 
> column constant "0" is BIGINT.
>Then the fllowing query(with filter incompatible column 0_custkey)  will 
> fail  with  java.lang.IndexOutOfBoundsException:
> 'select count(1) from view_orders  where o_custkey<10 '.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11880) IndexOutOfBoundsException when execute query with filter condition on type incompatible column on data(generated by UNION ALL with an union column is constant and it h

2015-09-18 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11880:

Description: 
   For Hive UNION ALL , when a union column is constant(column a such as '0L')  
and it has incompatible type with the corresponding column A. The query with 
filter condition on type incompatible column a on this UNION-ALL results  will 
cause IndexOutOfBoundsException

  Such as TPC-H table "orders", we  CREATE VIEW `view_orders` AS select 
`oo`.`o_orderkey` , `oo`.`o_custkey`  from (  select  `orders`.`o_orderkey` , 
`rcfileorders`.`o_custkey` from `tpch270g`.`rcfileorders`   union all  select 
`orcfileorders`.`o_orderkey` , 0L as `o_custkey`   from  
`tpch270g`.`textfileorders`) `oo`.

   Type of 'o_custkey' is INT,  the type of corresponding constant "0" is 
BIGINT.

   Then the fllowing query(with filter incompatible column 0_custkey)  will 
fail  with  java.lang.IndexOutOfBoundsException:
'select count(1) from view_orders  where o_custkey<10 '.

  was:
   For Hive UNION ALL , when a union column is constant(column a such as '0L')  
and it has incompatible type with the corresponding column A. The query with 
filter condition on type incompatible column a on this UNION-ALL results  will 
cause IndexOutOfBoundsException

  Such as TPC-H table "orders", we  CREATE VIEW `view_orders` AS select 
`oo`.`o_orderkey` , `oo`.`o_custkey`  from (  select  `orders`.`o_orderkey` , 
`rcfileorders`.`o_custkey` from `tpch270g`.`rcfileorders`   union all  select 
`orcfileorders`.`o_orderkey` , 0L as `o_custkey`   from  
`tpch270g`.`textfileorders`) `oo`.

   Type of "o_custkey" is INT,  the type of corresponding constant "0" is 
BIGINT.

   Then the fllowing query(with filter incompatible column 0_custkey)  will 
fail  with  java.lang.IndexOutOfBoundsException:
"select count(1) from view_orders  where o_custkey<10 ".


>IndexOutOfBoundsException when execute query with filter condition on type 
> incompatible column on data(generated by UNION ALL with an union column is 
> constant and it has incompatible type with corresponding column) 
> --
>
> Key: HIVE-11880
> URL: https://issues.apache.org/jira/browse/HIVE-11880
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
>
>For Hive UNION ALL , when a union column is constant(column a such as 
> '0L')  and it has incompatible type with the corresponding column A. The 
> query with filter condition on type incompatible column a on this UNION-ALL 
> results  will cause IndexOutOfBoundsException
>   Such as TPC-H table "orders", we  CREATE VIEW `view_orders` AS select 
> `oo`.`o_orderkey` , `oo`.`o_custkey`  from (  select  `orders`.`o_orderkey` , 
> `rcfileorders`.`o_custkey` from `tpch270g`.`rcfileorders`   union all  select 
> `orcfileorders`.`o_orderkey` , 0L as `o_custkey`   from  
> `tpch270g`.`textfileorders`) `oo`.
>Type of 'o_custkey' is INT,  the type of corresponding constant "0" is 
> BIGINT.
>Then the fllowing query(with filter incompatible column 0_custkey)  will 
> fail  with  java.lang.IndexOutOfBoundsException:
> 'select count(1) from view_orders  where o_custkey<10 '.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11149) Fix issue with Thread unsafe Class HashMap in PerfLogger.java hangs in Multi-thread environment

2015-09-18 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11149:

Attachment: (was: HIVE-11149.03.patch)

> Fix issue with Thread unsafe Class  HashMap in PerfLogger.java  hangs  in  
> Multi-thread environment
> ---
>
> Key: HIVE-11149
> URL: https://issues.apache.org/jira/browse/HIVE-11149
> Project: Hive
>  Issue Type: Bug
>  Components: Logging
>Affects Versions: 1.2.0
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11149.01.patch, HIVE-11149.02.patch, 
> HIVE-11149.03.patch
>
>
> In  Multi-thread environment,  the Thread unsafe Class HashMap in 
> PerfLogger.java  will  casue massive Java Processes hang  and cost  large 
> amounts of unnecessary CPU and Memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11149) Fix issue with Thread unsafe Class HashMap in PerfLogger.java hangs in Multi-thread environment

2015-09-18 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11149:

Attachment: HIVE-11149.03.patch

> Fix issue with Thread unsafe Class  HashMap in PerfLogger.java  hangs  in  
> Multi-thread environment
> ---
>
> Key: HIVE-11149
> URL: https://issues.apache.org/jira/browse/HIVE-11149
> Project: Hive
>  Issue Type: Bug
>  Components: Logging
>Affects Versions: 1.2.0
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11149.01.patch, HIVE-11149.02.patch, 
> HIVE-11149.03.patch
>
>
> In  Multi-thread environment,  the Thread unsafe Class HashMap in 
> PerfLogger.java  will  casue massive Java Processes hang  and cost  large 
> amounts of unnecessary CPU and Memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-11695) If user have no permission to create LOCAL DIRECTORY ，the Hql does not throw any exception and fail silently.

2015-09-17 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14804863#comment-14804863
 ] 

WangMeng commented on HIVE-11695:
-

[~hiveqa]

> If user have no permission to  create LOCAL DIRECTORY ，the Hql does not throw 
> any exception and fail silently.
> --
>
> Key: HIVE-11695
> URL: https://issues.apache.org/jira/browse/HIVE-11695
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.13.0, 0.14.0, 1.0.0, 1.2.0, 1.1.0, 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11695.01.patch
>
>
> If user have no permission to  create LOCAL DIRECTORY such as   
> "/data/wangmeng/hiveserver2"  ,the query does not throw any exception and 
> fail silently.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11695) If user have no permission to create LOCAL DIRECTORY ，the Hql does not throw any exception and fail silently.

2015-09-17 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11695:

Fix Version/s: (was: 1.2.1)
   (was: 1.3.0)

> If user have no permission to  create LOCAL DIRECTORY ，the Hql does not throw 
> any exception and fail silently.
> --
>
> Key: HIVE-11695
> URL: https://issues.apache.org/jira/browse/HIVE-11695
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.13.0, 0.14.0, 1.0.0, 1.2.0, 1.1.0, 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11695.01.patch
>
>
> If user have no permission to  create LOCAL DIRECTORY such as   
> "/data/wangmeng/hiveserver2"  ,the query does not throw any exception and 
> fail silently.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-11149) Fix issue with Thread unsafe Class HashMap in PerfLogger.java hangs in Multi-thread environment

2015-09-17 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14804885#comment-14804885
 ] 

WangMeng commented on HIVE-11149:
-

[~sershe] I know PerfLogger is threadlocal, but in current environment, some 
times Hive Client's  Java Process will hang at " 
java.util.HashMap.put(HashMap.java:494) "and never exit.
such as the following:
"main" prio=10 tid=0x7f938c014800 nid=0x4e1a runnable [0x7f9392f3c000]
java.lang.Thread.State: RUNNABLE
at java.util.HashMap.put(HashMap.java:494)
at org.apache.hadoop.hive.ql.log.PerfLogger.PerfLogBegin(PerfLogger.java:109)
at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1282)
at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1101)
at org.apache.hadoop.hive.ql.Driver.run(Driver.java:924)
at org.apache.hadoop.hive.ql.Driver.run(Driver.java:914)
at org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:269)
at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:221)
at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:431)
at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:367)
at org.apache.hadoop.hive.cli.CliDriver.executeDriver(CliDriver.java:750)
at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:694)
at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:633)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org.apache.hadoop.util.RunJar.main(RunJar.java:212)

> Fix issue with Thread unsafe Class  HashMap in PerfLogger.java  hangs  in  
> Multi-thread environment
> ---
>
> Key: HIVE-11149
> URL: https://issues.apache.org/jira/browse/HIVE-11149
> Project: Hive
>  Issue Type: Bug
>  Components: Logging
>Affects Versions: 1.2.0
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11149.01.patch, HIVE-11149.02.patch
>
>
> In  Multi-thread environment,  the Thread unsafe Class HashMap in 
> PerfLogger.java  will  casue massive Java Processes hang  and cost  large 
> amounts of unnecessary CPU and Memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11695) If user have no permission to create LOCAL DIRECTORY ，the Hql does not throw any exception and fail silently.

2015-09-16 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11695:

Summary: If user have no permission to  create LOCAL DIRECTORY ，the Hql 
does not throw any exception and fail silently.  (was: If user have no 
permission to  create LOCAL DIRECTORY such as   "/data/wangmeng/hiveserver2"  
,the Hql does not throw any exception and fail silently.)

> If user have no permission to  create LOCAL DIRECTORY ，the Hql does not throw 
> any exception and fail silently.
> --
>
> Key: HIVE-11695
> URL: https://issues.apache.org/jira/browse/HIVE-11695
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.13.0, 0.14.0, 1.0.0, 1.2.0, 1.1.0, 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Fix For: 1.3.0, 1.2.1
>
> Attachments: HIVE-11695.01.patch
>
>
> If user have no permission to  create LOCAL DIRECTORY such as   
> "/data/wangmeng/hiveserver2"  ,the query does not throw any exception and 
> fail silently.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11695) If user have no permission to create LOCAL DIRECTORY such as "/data/wangmeng/hiveserver2" ,the Hql does not throw any exception and fail silently.

2015-09-16 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11695:

Summary: If user have no permission to  create LOCAL DIRECTORY such as   
"/data/wangmeng/hiveserver2"  ,the Hql does not throw any exception and fail 
silently.  (was:  Hql "insert  overwrite  LOCAL DIRECTORY " does not throw 
exception if Hive user does not have write-permission of the DIRECTORY)

> If user have no permission to  create LOCAL DIRECTORY such as   
> "/data/wangmeng/hiveserver2"  ,the Hql does not throw any exception and fail 
> silently.
> --
>
> Key: HIVE-11695
> URL: https://issues.apache.org/jira/browse/HIVE-11695
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.13.0, 0.14.0, 1.0.0, 1.2.0, 1.1.0, 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Fix For: 1.3.0, 1.2.1
>
> Attachments: HIVE-11695.01.patch
>
>
> If user have no permission to  create LOCAL DIRECTORY such as   
> "/data/wangmeng/hiveserver2"  ,the query does not throw any exception and 
> fail silently.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11695) Hql "insert overwrite LOCAL DIRECTORY " does not throw exception if Hive user does not have write-permission of the DIRECTORY

2015-09-16 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11695:

Attachment: HIVE-11695.01.patch

>  Hql "insert  overwrite  LOCAL DIRECTORY " does not throw exception if Hive 
> user does not have write-permission of the DIRECTORY
> 
>
> Key: HIVE-11695
> URL: https://issues.apache.org/jira/browse/HIVE-11695
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.13.0, 0.14.0, 1.0.0, 1.2.0, 1.1.0, 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Attachments: HIVE-11695.01.patch
>
>
> If user have no write permission to LOCAL DIRECTORY such as   
> "/data/wangmeng/"  ,the query "insert  overwrite LOCAL  DIRECTORY  
> "/data/wangmeng/hiveserver2" does not throw any exception and fail silently.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-11695) Hql "insert overwrite LOCAL DIRECTORY " does not throw exception if Hive user does not have write-permission of the DIRECTORY

2015-09-16 Thread WangMeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-11695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14791586#comment-14791586
 ] 

WangMeng commented on HIVE-11695:
-

if  hive user  can not  create local target directory  and return false, this 
should throw an exception rather than fail sliently.
[~ashutoshc] Can you please take a look at this small patch? Thanks.


>  Hql "insert  overwrite  LOCAL DIRECTORY " does not throw exception if Hive 
> user does not have write-permission of the DIRECTORY
> 
>
> Key: HIVE-11695
> URL: https://issues.apache.org/jira/browse/HIVE-11695
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.13.0, 0.14.0, 1.0.0, 1.2.0, 1.1.0, 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Fix For: 1.3.0, 1.2.1
>
> Attachments: HIVE-11695.01.patch
>
>
> If user have no write permission to LOCAL DIRECTORY such as   
> "/data/wangmeng/"  ,the query "insert  overwrite LOCAL  DIRECTORY  
> "/data/wangmeng/hiveserver2" does not throw any exception and fail silently.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11695) Hql "insert overwrite LOCAL DIRECTORY " does not throw exception if Hive user does not have write-permission of the DIRECTORY

2015-09-16 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11695:

Description: If user have no permission to  create LOCAL DIRECTORY such as  
 "/data/wangmeng/hiveserver2"  ,the query does not throw any exception and fail 
silently.  (was: If user have no write permission to LOCAL DIRECTORY such as   
"/data/wangmeng/"  ,the query "insert  overwrite LOCAL  DIRECTORY  
"/data/wangmeng/hiveserver2" does not throw any exception and fail silently.)

>  Hql "insert  overwrite  LOCAL DIRECTORY " does not throw exception if Hive 
> user does not have write-permission of the DIRECTORY
> 
>
> Key: HIVE-11695
> URL: https://issues.apache.org/jira/browse/HIVE-11695
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.13.0, 0.14.0, 1.0.0, 1.2.0, 1.1.0, 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
> Fix For: 1.3.0, 1.2.1
>
> Attachments: HIVE-11695.01.patch
>
>
> If user have no permission to  create LOCAL DIRECTORY such as   
> "/data/wangmeng/hiveserver2"  ,the query does not throw any exception and 
> fail silently.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11695) Hql "insert overwrite to LOCAL DIRECTORY " can not throw exception when Hive user does not have write-permission of the DIRECTORY

2015-08-31 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11695:

Summary:  Hql "insert  overwrite to LOCAL DIRECTORY " can not throw 
exception when Hive user does not have write-permission of the DIRECTORY  (was: 
 Hql "write to LOCAL DIRECTORY " can not throw exception when Hive user does 
not have write-permission of the DIRECTORY)

>  Hql "insert  overwrite to LOCAL DIRECTORY " can not throw exception when 
> Hive user does not have write-permission of the DIRECTORY
> ---
>
> Key: HIVE-11695
> URL: https://issues.apache.org/jira/browse/HIVE-11695
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.13.0, 0.14.0, 1.0.0, 1.2.0, 1.1.0, 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
>
> If user have no write permission to LOCAL DIRECTORY such as   
> "/data/wangmeng/"  ,the query "insert  overwrite LOCAL  DIRECTORY  
> "/data/wangmeng/hiveserver2" does not throw any exception and fails silently.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11695) Hql "insert overwrite LOCAL DIRECTORY " can not throw exception when Hive user does not have write-permission of the DIRECTORY

2015-08-31 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11695:

Summary:  Hql "insert  overwrite  LOCAL DIRECTORY " can not throw exception 
when Hive user does not have write-permission of the DIRECTORY  (was:  Hql 
"insert  overwrite to LOCAL DIRECTORY " can not throw exception when Hive user 
does not have write-permission of the DIRECTORY)

>  Hql "insert  overwrite  LOCAL DIRECTORY " can not throw exception when Hive 
> user does not have write-permission of the DIRECTORY
> -
>
> Key: HIVE-11695
> URL: https://issues.apache.org/jira/browse/HIVE-11695
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.13.0, 0.14.0, 1.0.0, 1.2.0, 1.1.0, 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
>
> If user have no write permission to LOCAL DIRECTORY such as   
> "/data/wangmeng/"  ,the query "insert  overwrite LOCAL  DIRECTORY  
> "/data/wangmeng/hiveserver2" does not throw any exception and fails silently.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11695) Hql "write to LOCAL DIRECTORY " can not throw exception when Hive user does not have write-permission of the DIRECTORY

2015-08-31 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11695:

Description: If user have no write permission to LOCAL DIRECTORY such as   
"/data/wangmeng/"  ,the query "insert  overwrite LOCAL  DIRECTORY  
"/data/wangmeng/hiveserver2" does not throw any exception and fails silently.  
(was:  For Hive user who dose not have write permission of  LOCAL DIRECTORY 
such as   "/data/wangmeng/"  , when the user executes Hql "insert  overwrite 
LOCAL  DIRECTORY  "/data/wangmeng/hiveserver2" ,this query can not throw any 
exception  and pretend to have finished successfully.)

>  Hql "write to LOCAL DIRECTORY " can not throw exception when Hive user does 
> not have write-permission of the DIRECTORY
> ---
>
> Key: HIVE-11695
> URL: https://issues.apache.org/jira/browse/HIVE-11695
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.13.0, 0.14.0, 1.0.0, 1.2.0, 1.1.0, 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
>
> If user have no write permission to LOCAL DIRECTORY such as   
> "/data/wangmeng/"  ,the query "insert  overwrite LOCAL  DIRECTORY  
> "/data/wangmeng/hiveserver2" does not throw any exception and fails silently.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11695) Hql "insert overwrite LOCAL DIRECTORY " does not throw exception if Hive user does not have write-permission of the DIRECTORY

2015-08-31 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11695:

Summary:  Hql "insert  overwrite  LOCAL DIRECTORY " does not throw 
exception if Hive user does not have write-permission of the DIRECTORY  (was:  
Hql "insert  overwrite  LOCAL DIRECTORY " does not throw exception when Hive 
user does not have write-permission of the DIRECTORY)

>  Hql "insert  overwrite  LOCAL DIRECTORY " does not throw exception if Hive 
> user does not have write-permission of the DIRECTORY
> 
>
> Key: HIVE-11695
> URL: https://issues.apache.org/jira/browse/HIVE-11695
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.13.0, 0.14.0, 1.0.0, 1.2.0, 1.1.0, 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
>
> If user have no write permission to LOCAL DIRECTORY such as   
> "/data/wangmeng/"  ,the query "insert  overwrite LOCAL  DIRECTORY  
> "/data/wangmeng/hiveserver2" does not throw any exception and fail silently.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11695) Hql "write to LOCAL DIRECTORY " can not throw exception when Hive user does not have write-promotion of the DIRECTORY

2015-08-31 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11695:

Summary:  Hql "write to LOCAL DIRECTORY " can not throw exception when Hive 
user does not have write-promotion of the DIRECTORY  (was:  Hql "write to LOCAL 
DIRECTORY " can not throws exception when Hive user does not have 
write-promotion of the DIRECTORY)

>  Hql "write to LOCAL DIRECTORY " can not throw exception when Hive user does 
> not have write-promotion of the DIRECTORY
> --
>
> Key: HIVE-11695
> URL: https://issues.apache.org/jira/browse/HIVE-11695
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.13.0, 0.14.0, 1.0.0, 1.2.0, 1.1.0, 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
>
>  For Hive user who dose not have write promotion of  LOCAL DIRECTORY such as  
>  "/data/wangmeng/"  , when the user executes Hql "insert  overwrite LOCAL  
> DIRECTORY  "/data/wangmeng/hiveserver2" ,this query can not throw any 
> exception  and pretend to have finished successfully.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11695) Hql "insert overwrite LOCAL DIRECTORY " does not throw exception when Hive user does not have write-permission of the DIRECTORY

2015-08-31 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11695:

Description: If user have no write permission to LOCAL DIRECTORY such as   
"/data/wangmeng/"  ,the query "insert  overwrite LOCAL  DIRECTORY  
"/data/wangmeng/hiveserver2" does not throw any exception and fail silently.  
(was: If user have no write permission to LOCAL DIRECTORY such as   
"/data/wangmeng/"  ,the query "insert  overwrite LOCAL  DIRECTORY  
"/data/wangmeng/hiveserver2" does not throw any exception and fails silently.)

>  Hql "insert  overwrite  LOCAL DIRECTORY " does not throw exception when Hive 
> user does not have write-permission of the DIRECTORY
> --
>
> Key: HIVE-11695
> URL: https://issues.apache.org/jira/browse/HIVE-11695
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.13.0, 0.14.0, 1.0.0, 1.2.0, 1.1.0, 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
>
> If user have no write permission to LOCAL DIRECTORY such as   
> "/data/wangmeng/"  ,the query "insert  overwrite LOCAL  DIRECTORY  
> "/data/wangmeng/hiveserver2" does not throw any exception and fail silently.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11695) Hql "insert overwrite LOCAL DIRECTORY " does not throw exception when Hive user does not have write-permission of the DIRECTORY

2015-08-31 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11695:

Summary:  Hql "insert  overwrite  LOCAL DIRECTORY " does not throw 
exception when Hive user does not have write-permission of the DIRECTORY  (was: 
 Hql "insert  overwrite  LOCAL DIRECTORY " can not throw exception when Hive 
user does not have write-permission of the DIRECTORY)

>  Hql "insert  overwrite  LOCAL DIRECTORY " does not throw exception when Hive 
> user does not have write-permission of the DIRECTORY
> --
>
> Key: HIVE-11695
> URL: https://issues.apache.org/jira/browse/HIVE-11695
> Project: Hive
>  Issue Type: Bug
>  Components: Query Processor
>Affects Versions: 0.13.0, 0.14.0, 1.0.0, 1.2.0, 1.1.0, 1.2.1
>Reporter: WangMeng
>Assignee: WangMeng
>
> If user have no write permission to LOCAL DIRECTORY such as   
> "/data/wangmeng/"  ,the query "insert  overwrite LOCAL  DIRECTORY  
> "/data/wangmeng/hiveserver2" does not throw any exception and fails silently.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11149) Fix issue with Thread unsafe Class HashMap in PerfLogger.java hangs in Multi-thread environment

2015-06-30 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11149:

Description: In  Multi-thread environment,  the Thread unsafe Class HashMap 
in PerfLogger.java  will  casue massive Java Processes hang  and cost  large 
amounts of unnecessary CPU and Memory.  (was: In  Multi-thread environment,  
the Thread unsafe Class HashMap in PerfLogger.java  will hang  and cost  large 
amounts of unnecessary CPU and Memory.)

 Fix issue with Thread unsafe Class  HashMap in PerfLogger.java  hangs  in  
 Multi-thread environment
 ---

 Key: HIVE-11149
 URL: https://issues.apache.org/jira/browse/HIVE-11149
 Project: Hive
  Issue Type: Bug
  Components: Logging
Affects Versions: 1.2.0
Reporter: WangMeng
Assignee: WangMeng
 Fix For: 1.2.0

 Attachments: HIVE-11149.01.patch


 In  Multi-thread environment,  the Thread unsafe Class HashMap in 
 PerfLogger.java  will  casue massive Java Processes hang  and cost  large 
 amounts of unnecessary CPU and Memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-11149) Fix issue with Thread unsafe Class HashMap in PerfLogger.java hangs in Multi-thread environment

2015-06-30 Thread WangMeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-11149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

WangMeng updated HIVE-11149:

Attachment: HIVE-11149.01.patch

 Fix issue with Thread unsafe Class  HashMap in PerfLogger.java  hangs  in  
 Multi-thread environment
 ---

 Key: HIVE-11149
 URL: https://issues.apache.org/jira/browse/HIVE-11149
 Project: Hive
  Issue Type: Bug
  Components: Logging
Affects Versions: 1.2.0
Reporter: WangMeng
Assignee: WangMeng
 Fix For: 1.2.0

 Attachments: HIVE-11149.01.patch


 In  Multi-thread environment,  the Thread unsafe Class HashMap in 
 PerfLogger.java  will hang  and cost  large amounts of unnecessary CPU and 
 Memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-10971) count(*) with count(distinct) gives wrong results when hive.groupby.skewindata=true

2015-06-09 Thread wangmeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-10971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

wangmeng updated HIVE-10971:

Attachment: HIVE-10971.01.patch

 count(*) with count(distinct) gives wrong results when 
 hive.groupby.skewindata=true
 ---

 Key: HIVE-10971
 URL: https://issues.apache.org/jira/browse/HIVE-10971
 Project: Hive
  Issue Type: Bug
  Components: Logical Optimizer
Affects Versions: 1.2.0
Reporter: wangmeng
Assignee: wangmeng
 Attachments: HIVE-10971.01.patch


 When hive.groupby.skewindata=true, the following query based on TPC-H gives 
 wrong results:
 {code}
 set hive.groupby.skewindata=true;
 select l_returnflag, count(*), count(distinct l_linestatus)
 from lineitem
 group by l_returnflag
 limit 10;
 {code}
 The query plan shows that it generates only one MapReduce job instead of two 
 theoretically, which is dictated by hive.groupby.skewindata=true.
 The problem arises only when {noformat}count(*){noformat} and 
 {noformat}count(distinct){noformat} exist together.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-10971) count(*) with count(distinct) gives wrong results when hive.groupby.skewindata=true

2015-06-09 Thread wangmeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-10971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

wangmeng updated HIVE-10971:

Component/s: (was: Hive)
 Logical Optimizer

 count(*) with count(distinct) gives wrong results when 
 hive.groupby.skewindata=true
 ---

 Key: HIVE-10971
 URL: https://issues.apache.org/jira/browse/HIVE-10971
 Project: Hive
  Issue Type: Bug
  Components: Logical Optimizer
Affects Versions: 1.2.0
Reporter: wangmeng
Assignee: wangmeng

 When hive.groupby.skewindata=true, the following query based on TPC-H gives 
 wrong results:
 {code}
 set hive.groupby.skewindata=true;
 select l_returnflag, count(*), count(distinct l_linestatus)
 from lineitem
 group by l_returnflag
 limit 10;
 {code}
 The query plan shows that it generates only one MapReduce job instead of two 
 theoretically, which is dictated by hive.groupby.skewindata=true.
 The problem arises only when {noformat}count(*){noformat} and 
 {noformat}count(distinct){noformat} exist together.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-10971) count(*) with count(distinct) gives wrong results when hive.groupby.skewindata=true

2015-06-09 Thread wangmeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-10971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

wangmeng updated HIVE-10971:

Attachment: HIVE-10971.01.patch

 count(*) with count(distinct) gives wrong results when 
 hive.groupby.skewindata=true
 ---

 Key: HIVE-10971
 URL: https://issues.apache.org/jira/browse/HIVE-10971
 Project: Hive
  Issue Type: Bug
  Components: Logical Optimizer
Affects Versions: 1.2.0
Reporter: wangmeng
Assignee: wangmeng
 Attachments: HIVE-10971.01.patch, HIVE-10971.01.patch


 When hive.groupby.skewindata=true, the following query based on TPC-H gives 
 wrong results:
 {code}
 set hive.groupby.skewindata=true;
 select l_returnflag, count(*), count(distinct l_linestatus)
 from lineitem
 group by l_returnflag
 limit 10;
 {code}
 The query plan shows that it generates only one MapReduce job instead of two 
 theoretically, which is dictated by hive.groupby.skewindata=true.
 The problem arises only when {noformat}count(*){noformat} and 
 {noformat}count(distinct){noformat} exist together.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-10971) count(*) with count(distinct) gives wrong results when hive.groupby.skewindata=true

2015-06-09 Thread wangmeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-10971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

wangmeng updated HIVE-10971:

Attachment: (was: HIVE-10971.01.patch)

 count(*) with count(distinct) gives wrong results when 
 hive.groupby.skewindata=true
 ---

 Key: HIVE-10971
 URL: https://issues.apache.org/jira/browse/HIVE-10971
 Project: Hive
  Issue Type: Bug
  Components: Logical Optimizer
Affects Versions: 1.2.0
Reporter: wangmeng
Assignee: wangmeng
 Attachments: HIVE-10971.01.patch


 When hive.groupby.skewindata=true, the following query based on TPC-H gives 
 wrong results:
 {code}
 set hive.groupby.skewindata=true;
 select l_returnflag, count(*), count(distinct l_linestatus)
 from lineitem
 group by l_returnflag
 limit 10;
 {code}
 The query plan shows that it generates only one MapReduce job instead of two 
 theoretically, which is dictated by hive.groupby.skewindata=true.
 The problem arises only when {noformat}count(*){noformat} and 
 {noformat}count(distinct){noformat} exist together.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-10971) count(*) with count(distinct) gives wrong results when hive.groupby.skewindata=true

2015-06-09 Thread wangmeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-10971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14578602#comment-14578602
 ] 

wangmeng commented on HIVE-10971:
-

{code}
hive set  hive.groupby.skewindata=true;
hive explain select l_returnflag,count(*),count(distinct  l_linestatus) from 
lineitem  group by l_returnflag  limit 10;
OK
STAGE DEPENDENCIES:
  Stage-1 is a root stage
  Stage-0 is a root stage

STAGE PLANS:
  Stage: Stage-1
Map Reduce
  Map Operator Tree:
  TableScan
alias: lineitem
Statistics: Num rows: 1008537518 Data size: 201707503616 Basic 
stats: COMPLETE Column stats: NONE
Select Operator
  expressions: l_returnflag (type: string), l_linestatus (type: 
string)
  outputColumnNames: l_returnflag, l_linestatus
  Statistics: Num rows: 1008537518 Data size: 201707503616 Basic 
stats: COMPLETE Column stats: NONE
  Group By Operator
aggregations: count(), count(DISTINCT l_linestatus)
keys: l_returnflag (type: string), l_linestatus (type: string)
mode: hash
outputColumnNames: _col0, _col1, _col2, _col3
Statistics: Num rows: 1008537518 Data size: 201707503616 Basic 
stats: COMPLETE Column stats: NONE
Reduce Output Operator
  key expressions: _col0 (type: string), _col1 (type: string)
  sort order: ++
  Map-reduce partition columns: _col0 (type: string)
  Statistics: Num rows: 1008537518 Data size: 201707503616 
Basic stats: COMPLETE Column stats: NONE
  value expressions: _col2 (type: bigint)
  Reduce Operator Tree:
Group By Operator
  aggregations: count(VALUE._col0), count(DISTINCT KEY._col1:0._col0)
  keys: KEY._col0 (type: string)
  mode: complete
  outputColumnNames: _col0, _col1, _col2
  Statistics: Num rows: 504268759 Data size: 100853751808 Basic stats: 
COMPLETE Column stats: NONE
  Select Operator
expressions: _col0 (type: string), _col1 (type: bigint), _col2 
(type: bigint)
outputColumnNames: _col0, _col1, _col2
Statistics: Num rows: 504268759 Data size: 100853751808 Basic 
stats: COMPLETE Column stats: NONE
Limit
  Number of rows: 10
  Statistics: Num rows: 10 Data size: 2000 Basic stats: COMPLETE 
Column stats: NONE
  File Output Operator
compressed: true
Statistics: Num rows: 10 Data size: 2000 Basic stats: COMPLETE 
Column stats: NONE
table:
input format: org.apache.hadoop.mapred.TextInputFormat
output format: 
org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe

  Stage: Stage-0
Fetch Operator
  limit: 10
{code}

When hive.groupby.skewindata=false, the Group By operator has mode 
mergepartial, which gives the correct results.

 count(*) with count(distinct) gives wrong results when 
 hive.groupby.skewindata=true
 ---

 Key: HIVE-10971
 URL: https://issues.apache.org/jira/browse/HIVE-10971
 Project: Hive
  Issue Type: Bug
  Components: Hive
Affects Versions: 1.2.0
Reporter: wangmeng
Assignee: wangmeng

 When hive.groupby.skewindata=true, the following query based on TPC-H gives 
 wrong results:
 {code}
 set hive.groupby.skewindata=true;
 select l_returnflag, count(*), count(distinct l_linestatus)
 from lineitem
 group by l_returnflag
 limit 10;
 {code}
 The query plan shows that it generates only one MapReduce job instead of two 
 theoretically, which is dictated by hive.groupby.skewindata=true.
 The problem arises only when {noformat}count(*){noformat} and 
 {noformat}count(distinct){noformat} exist together.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HIVE-10971) count(*) with count(distinct) gives wrong results when hive.groupby.skewindata=true

2015-06-09 Thread wangmeng (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-10971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

wangmeng updated HIVE-10971:

Description: 
When hive.groupby.skewindata=true, the following query based on TPC-H gives 
wrong results:

{code}
set hive.groupby.skewindata=true;

select l_returnflag, count(*), count(distinct l_linestatus)
from lineitem
group by l_returnflag
limit 10;
{code}

The query plan shows that it generates only one MapReduce job instead of two 
theoretically, which is dictated by hive.groupby.skewindata=true.

The problem arises only when {noformat}count(*){noformat} and 
{noformat}count(distinct){noformat} exist together.

  was:
When hive.groupby.skewindata=true, the following query based on TPC-H gives 
wrong results:

{code}
set hive.groupby.skewindata=true;

select l_returnflag, count(*), count(distinct l_linestatus)
from lineitem
group by l_returnflag
limit 10;
{code}

The query plan shows that it generates only one MapReduce job instead of two, 
which is dictated by hive.groupby.skewindata=true.

The problem arises only when {noformat}count(*){noformat} and 
{noformat}count(distinct){noformat} exist together.


 count(*) with count(distinct) gives wrong results when 
 hive.groupby.skewindata=true
 ---

 Key: HIVE-10971
 URL: https://issues.apache.org/jira/browse/HIVE-10971
 Project: Hive
  Issue Type: Bug
  Components: Hive
Affects Versions: 1.2.0
Reporter: wangmeng
Assignee: wangmeng

 When hive.groupby.skewindata=true, the following query based on TPC-H gives 
 wrong results:
 {code}
 set hive.groupby.skewindata=true;
 select l_returnflag, count(*), count(distinct l_linestatus)
 from lineitem
 group by l_returnflag
 limit 10;
 {code}
 The query plan shows that it generates only one MapReduce job instead of two 
 theoretically, which is dictated by hive.groupby.skewindata=true.
 The problem arises only when {noformat}count(*){noformat} and 
 {noformat}count(distinct){noformat} exist together.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-10478) resolved

2015-06-09 Thread wangmeng (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-10478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14579978#comment-14579978
 ] 

wangmeng commented on HIVE-10478:
-

Hi, I also encountered the same problem ,how did you solve it ?  SET 
hive.exec.parallel=false?  Thanks.

 resolved
 

 Key: HIVE-10478
 URL: https://issues.apache.org/jira/browse/HIVE-10478
 Project: Hive
  Issue Type: Task
  Components: Hive
Reporter: anna ken
  Labels: hadoop, hive, hue, kryo

 resolved



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

74 matches

Mail list logo