Re: Is YSmart integrated into Hive on tez ?

2015-08-31 Thread Jeff Zhang
Hi Pengcheng, Is there reason why the correlation optimization disabled in tez ? And even when I change the code to enable the correlation optimization in tez. I still get the same query plan. >>> Vertex dependency in root stage >>> Reducer 2 <- Map 1 (SIMPLE_EDGE), Map 4 (SIMPLE_EDGE) >>> Reduc

Re: Serializing dags

2015-08-31 Thread canan chen
Hi Raajay, Regarding how hive build DAG, you can check https://github.com/apache/hive/blob/72f97fc7760134465333983fc40766e9e864e643/ql/src/java/org/apache/hadoop/hive/ql/exec/tez/TezTask.java Check method TezTask#build Do you mind share what kind of changes you want to make on DAGs ? On Tue, Se

When use HiveServer2 to execute "write Sql result to LOCAL DIRECTORY " ,it write result to HiveServer2 Host rather than the Hive Client which connects HiveServer2

2015-08-31 Thread LifeLearner
Unlike Hive CLI , When I connected to HiveServer2 and then execute " insert overwrite LOCAL DIRECTORY "/data/wangmeng/hiveserver2" select * from textfileorders limit 10" . This query will write results to HiveServer2 Host rather than My Hive Client Host. Is there any way can se

Re: Reading tables will lock entire database in Hive 0.13

2015-08-31 Thread Jim Green
I found the answer…hive.txn.manager is added starting in hive 0.13. Thanks all. On Mon, Aug 31, 2015 at 3:54 PM, Jim Green wrote: > Hi Team, > > This issue only happens on hive 0.13, not in Hive 0.12. > Does anybody know what is the change between 0.12 and 0.13 which caused > this issue? > > Tha

Re: Reading tables will lock entire database in Hive 0.13

2015-08-31 Thread Jim Green
Hi Team, This issue only happens on hive 0.13, not in Hive 0.12. Does anybody know what is the change between 0.12 and 0.13 which caused this issue? Thanks. On Sat, Feb 28, 2015 at 3:47 AM, Chaoyu Tang wrote: > It was fixed for DummyTxnManager. DbTxnManager should not have this issue. > > Than

Re: Is it possible to set the data schema on a per-partition basis?

2015-08-31 Thread Dominik Choma
thanks for reply Im assuming I can't use avro without reprocess all my stored data(delimited,lzo-compressed text), Am I right ? So I'm wonder if I can use statement *ALTER TABLE xxx PARTITION (**) CHANGE COLUMN (...)* to create diffrent HCatalog schema for parttions based on datafiles with chan

Hive User Group Meeting Singapore

2015-08-31 Thread Xuefu Zhang
Dear Hive users, Hive community is considering a user group meeting during Hadoop World that will be held in Singarpore [1] Dec 1-3, 2015. As I understand, this will be the first time that this meeting ever happens in Asia Pacific even though there is a large user base in that region. As another g

Re: Is it possible to set the data schema on a per-partition basis?

2015-08-31 Thread Edward Capriolo
Yes. Specifically the avro ser-de like avro support "evolving schema". On Mon, Aug 31, 2015 at 5:15 PM, Dominik Choma wrote: > I have external hcat structures over lzo-compressed datafiles , data is > partitioned by date string > Is it possible to handle schema changes by setting diffrent schema

Is it possible to set the data schema on a per-partition basis?

2015-08-31 Thread Dominik Choma
I have external hcat structures over lzo-compressed datafiles , data is partitioned by date string Is it possible to handle schema changes by setting diffrent schema(column names & datatypes) per-partition? Thanks, Dominik.

Re: Hive on Spark

2015-08-31 Thread Xuefu Zhang
What you described isn't part of the functionality of Hive on Spark. Rather, Spark is used here as a general purpose engine similar to MR but without intemediate stages. It's batch origientated. Keeping 100T data in memory is hardly beneficial unless you know that that dataset is going to be used

Serializing dags

2015-08-31 Thread Raajay
Hello, Currently, I am running Hive on Tez. I wish to make some changes to the DAGs generated by HIve before running on Tez/Yarn. Which data structure should i serialize ? DAG or DagPlan ? - Raajay

Re: Is YSmart integrated into Hive on tez ?

2015-08-31 Thread Pengcheng Xiong
Hi Jeff, From code base point of view, YSmart is integrated into Hive on Tez because it is one of the optimization of the current Hive. However, from the execution point of view, it is now disabled when Hive is running on Tez. You may take look at the source code of Hive Optimizer.java, L17

RE: python libraries to execute or call hive queries

2015-08-31 Thread rakesh sharma
Hi Gopal Have you tried pyhs2 libraryIt has many useful functions to retrieve the data thanksrakesh > Date: Fri, 28 Aug 2015 11:53:20 -0700 > Subject: Re: python libraries to execute or call hive queries > From: gop...@apache.org > To: user@hive.apache.org > > > > Can anyone suggest any python l