ListBucketing feature does not support uppercase string.

2016-05-05 Thread Jim Green
Hi Team, I found when there is uppercase string as the skew value, ListBucketing is not working. https://issues.apache.org/jira/browse/HIVE-13697 is filed: For example: 1. This is good: CREATE TABLE testskew (id INT, a STRING) SKEWED BY (a) ON ('abc', 'xyz') STORED AS DIRECTORIES; set

Anyway to show current user name on beeline

2016-02-11 Thread Jim Green
Hi Team, I could not find a way to show current logged-on user on beeline. Is there any way to show that? Something like: Show current_user; ? -- Thanks, www.openkb.info (Open KnowledgeBase for Hadoop/Database/OS/Network/Tool)

Difference between hive.mapjoin.smalltable.filesize and hive.auto.convert.join.noconditionaltask.size

2016-01-31 Thread Jim Green
Sharing one article about the difference between hive.mapjoin.smalltable.filesize and hive.auto.convert.join.noconditionaltask.size. Although both of them can control the behavior of map join. http://www.openkb.info/2016/01/difference-between-hivemapjoinsmalltabl.html -- Thanks, www.openkb.info

Which user should start the local task if Hive impersonation is enabled

2015-11-10 Thread Jim Green
Hi Team, I am trying to understand what is the expected behavior of Hive impersonation is enabled. Say HiveServer2 process is running as userA, and userB is connecting to beeline. If userB create a table, the table file should owned by userB because impersonation is enabled. However If userB is

Table level stats are not shown after insert starting in Hive 0.13?

2015-09-03 Thread Jim Green
*Hive 0.12:* After insert SQL: Partition default.mytablepar{id=111} stats: [num_files: 1, num_rows: 0, total_size: 4, raw_data_size: 0] Table default.mytablepar stats: [num_partitions: 1, num_files: 1, num_rows: 0, total_size: 4, raw_data_size: 0] *Hive 0.13:* After insert SQL: Partition

Re: Table level stats are not shown after insert starting in Hive 0.13?

2015-09-03 Thread Jim Green
Adding Dev user list. Could somebody help take a look? On Thu, Sep 3, 2015 at 12:25 PM, Jim Green <openkbi...@gmail.com> wrote: > Also tried Hive 1.0, and the result is the same as Hive 0.13. > Is there any reason why we do not print the table level stats for > partition table? &g

Re: Wrong results from join query in Hive 0.13 and also 1.0 with reproduce.

2015-09-02 Thread Jim Green
criteria. You can avoid this bug if you > rewrite your query such that it violates one of the requirement (listed > above) to trigger the bug. > > Ashutosh > > > On Wed, Sep 2, 2015 at 10:19 AM, Jim Green <openkbi...@gmail.com> wrote: > >> Hi Ashutosh, >>

Wrong results from join query in Hive 0.13 and also 1.0 with reproduce.

2015-09-01 Thread Jim Green
Hi Team, Below is the minimum reproduce of wrong results in Hive 0.13: *1. Create 4 tables* CREATE EXTERNAL TABLE testjoin1( joincol string ); CREATE EXTERNAL TABLE testjoin2( anothercol string , joincol string); CREATE EXTERNAL TABLE testjoin3( anothercol string); CREATE EXTERNAL TABLE

Re: Wrong results from join query in Hive 0.13 and also 1.0 with reproduce.

2015-09-01 Thread Jim Green
Seems Hive 1.2 fixed this issue. But not sure what is the JIRA related and the possibility to backport this fix into Hive 0.13? On Tue, Sep 1, 2015 at 5:35 PM, Jim Green <openkbi...@gmail.com> wrote: > Hi Team, > > Below is the minimum reproduce of wrong results in Hive 0.13: &

Re: Reading tables will lock entire database in Hive 0.13

2015-08-31 Thread Jim Green
not have this issue. > > Thanks > > On Thu, Feb 26, 2015 at 1:52 PM, Jim Green <openkbi...@gmail.com> wrote: > >> Hi Chaoyu, >> >> Thanks for the information. >> Actually by default, if we set hive.support.con

Re: Reading tables will lock entire database in Hive 0.13

2015-08-31 Thread Jim Green
I found the answer…hive.txn.manager is added starting in hive 0.13. Thanks all. On Mon, Aug 31, 2015 at 3:54 PM, Jim Green <openkbi...@gmail.com> wrote: > Hi Team, > > This issue only happens on hive 0.13, not in Hive 0.12. > Does anybody know what is the change between 0

Tez : Anyway to avoid creating subdirectories by Insert with union all” ?

2015-08-19 Thread Jim Green
Hi Team, Below insert with union-all will create sub-directories: set hive.execution.engine=tez; create table h1_passwords_target like h1_passwords; insert overwrite table h1_passwords_target select * from (select * from h1_passwords limit 1 union all select * from h1_passwords limit 2 )

UDTF fails with java.lang.ClassCastException

2015-08-06 Thread Jim Green
Hi Team, One UDTF fails in Hive 1.0 with below stacktrace: Caused by: java.lang.ClassCastException: org.apache.hadoop.hive.serde2.lazy.LazyString cannot be cast to org.apache.hadoop.io.Text at

Re: Hive on Tez query failed with “wrong key class

2015-07-27 Thread Jim Green
IFile should be speficic to Tez. Hive does not have that code to check the key class and value class. Anyone knows why Tez will check the key and value class when doing sort stuff? Thanks. On Tue, Jul 21, 2015 at 5:26 PM, Jim Green openkbi...@gmail.com wrote: Sample stacktrace is : [Error

Hive on Tez query failed with “wrong key class

2015-07-21 Thread Jim Green
Hi Team, Env: Hive 1.0 on Tez 0.5.3 Query is a simple group-by on top of sequence table. It fails with below error on tez mode: *java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: * *java.io.IOException: java.io.IOException: wrong key class:

Re: Hive on Tez query failed with “wrong key class

2015-07-21 Thread Jim Green
(HiveContextAwareRecordReader.java:358) ... 21 more ], On Tue, Jul 21, 2015 at 11:26 AM, Bikas Saha bi...@hortonworks.com wrote: A full stack trace would help determine is this is a Tez issue or hive issue. *From:* Jim Green [mailto:openkbi...@gmail.com] *Sent:* Tuesday, July 21

Hive Tez support matrix

2015-07-07 Thread Jim Green
Hi Team, Is there any Hive - Tez support matrix? For example, Hive 1.2 should be on Tez which version? Tez 0.5.3 only supports which versions of Hive? etc… My understanding is that it does not matter which version of Hive and which version of Tez. -- Thanks, www.openkb.info (Open KnowledgeBase

Re: Hive Tez support matrix

2015-07-07 Thread Jim Green
and Tez-0.6.x are compatible with Tez-0.5.x. I believe Hive 0.13 is compatible only with Tez 0.4. For Hive 0.14 onwards ( including the Hive-1.x. releases ), they should work with anything in the range of Tez versions: 0.5.2 = x = 0.7.x . thanks — Hitesh On Jul 7, 2015, at 10:12 AM, Jim Green

Re: Hive Tez support matrix

2015-07-07 Thread Jim Green
the release pom.xml J Bikas *From:* Jim Green [mailto:openkbi...@gmail.com openkbi...@gmail.com] *Sent:* Tuesday, July 07, 2015 2:58 PM *To:* user@hive.apache.org *Cc:* u...@tez.apache.org *Subject:* Re: Hive Tez support matrix Do you know where is the hive documentation about

Re: Is hive 0.13 index working fine on partition tables?

2015-07-06 Thread Jim Green
Anyone knows the JIRAs related to this issue? On Mon, Jun 29, 2015 at 2:35 PM, Jim Green openkbi...@gmail.com wrote: Hi Team, On hive 0.13, I have a minimum reproduce for index on partition table issue: CREATE TABLE test_partition_index( id1 bigint, id2 bigint, id3 bigint) PARTITIONED

Hive 1.0 on Tez 0.5.3 reproduced HIVE-8204

2015-07-01 Thread Jim Green
Hi Team, Is HIVE-8204 really fixed or not? Below test can reproduce exactly the same stacktrace when dynamic partition pruning is enabled. *1. Create test tables* create table h1_fact(id int, id2 string) PARTITIONED BY (mmdd string) row format delimited fields terminated by ','; create table

Is hive 0.13 index working fine on partition tables?

2015-06-29 Thread Jim Green
Hi Team, On hive 0.13, I have a minimum reproduce for index on partition table issue: CREATE TABLE test_partition_index( id1 bigint, id2 bigint, id3 bigint) PARTITIONED BY ( dt string) row format delimited fields terminated by ','; cat sampledata 111,222,333 LOAD DATA LOCAL INPATH 'sampledata'

Hive transaction feature in Hive 1.0

2015-06-12 Thread Jim Green
Hi Team, Sharing the article which explains the Hive transaction features in Hive 1.0: Hive transaction feature in Hive 1.0 http://www.openkb.info/2015/06/hive-transaction-feature-in-hive-10.html -- Thanks, www.openkb.info (Open KnowledgeBase for Hadoop/Database/OS/Network/Tool)

Understanding Hive Outer Join Behavior

2015-06-05 Thread Jim Green
Sharing one article about how to understand Hive Outer Join Behavior http://www.openkb.info/2015/06/understanding-hive-outer-join-behavior.html . -- Thanks, www.openkb.info (Open KnowledgeBase for Hadoop/Database/OS/Network/Tool)

Re: Why hive 0.13 will initialize derby database if the metastore parameters are not set in hive-site.xml?

2015-03-06 Thread Jim Green
metastore_db Thanks. On Fri, Mar 6, 2015 at 6:17 PM, Edward Capriolo edlinuxg...@gmail.com wrote: Make sure hive autogather stats is false . Or aetup the stats db On Friday, March 6, 2015, Jim Green openkbi...@gmail.com wrote: Hi Team, Starting from hive 0.13, if the metastore parameters

Re: Why hive 0.13 will initialize derby database if the metastore parameters are not set in hive-site.xml?

2015-03-06 Thread Jim Green
And Starting from hive 0.13, hive.stats.dbclass=fs by default. So I am thinking autostats may not be related here. Could be some other features need derby…. On Fri, Mar 6, 2015 at 6:37 PM, Jim Green openkbi...@gmail.com wrote: Hi Edward, No matter if I set hive.stats.autogather=false in hive

Why hive 0.13 will initialize derby database if the metastore parameters are not set in hive-site.xml?

2015-03-06 Thread Jim Green
Hi Team, Starting from hive 0.13, if the metastore parameters are not set in hive-site.xml, but we set in .hiverc, hive will try to initialize derby database in current working directory. This behavior did not exist in hive 0.12. Is it a known bug? or behavior change? I have the repro as below:

Re: Reading tables will lock entire database in Hive 0.13

2015-02-26 Thread Jim Green
Chaoyu On Wed, Feb 25, 2015 at 11:57 PM, Alan Gates alanfga...@gmail.com wrote: What lock manager are you using? Alan. Jim Green openkbi...@gmail.com February 25, 2015 at 17:24 Hi Team, Here is old thread about this issue: http://qnalist.com/questions/5091386/doubt-about-locking

Reading tables will lock entire database in Hive 0.13

2015-02-25 Thread Jim Green
Hi Team, Here is old thread about this issue: http://qnalist.com/questions/5091386/doubt-about-locking-mechanism-in-hive I met the same issue that when reading one table in session A, we can not create a table in the same database in session B. Basically it causes database level lock in Hive

Regarding configuration of Storage based authorization in Hive 0.12

2015-01-16 Thread Jim Green
Hi Team, Per the documentation for Hive Storage based authorization in Hive, it has 2 uses: 1. Metastore side authorization 2. Client side authorization I have some questions because after enabling client side authorization by adding below configurations in hive-site.xml on client side, I got

Fwd: Question on ORC file stripe size.

2014-12-02 Thread Jim Green
Hi Team, I am creating this table: CREATE TABLE IF NOT EXISTS orctest2 ( id string, id2 string, id3 string, id4 string ) STORED AS ORC tblproperties (orc.stripe.size=1048576,orc.row.index.stride=”); The stripe size is set to 1MB. After loading data, the table file is about 60MB: -rwxr-xr-x