Re: HIVE:1.2, Query taking huge time

2015-08-20 Thread Nishant Aggarwal
Thanks for the reply Noam. I have already tried the later point of dividing the query. But the challenge comes during the joining of the table. Thanks and Regards Nishant Aggarwal, PMP Cell No:- +91 99588 94305 On Thu, Aug 20, 2015 at 2:19 PM, Noam Hasson noam.has...@kenshoo.com wrote: Hi,

Re: query behaviors with subquery in clause

2015-08-20 Thread Nitin Pawar
any help guys ? On Thu, Aug 13, 2015 at 2:52 PM, Nitin Pawar nitinpawar...@gmail.com wrote: Hi, right now hive does not support the equality clause in sub-queries. for ex: select * from A where date = (select max(date) from B) It though supports IN clause select * from A where date in

Re: HIVE:1.2, Query taking huge time

2015-08-20 Thread Noam Hasson
Hi, Have you look at counters in Hadoop side? It's possible you are dealing with a bad join which causes multiplication of items, if you see huge number of record input/output in map/reduce phase and keeps increasing that's probably the case. Another thing I would try is to divide the job into

Re: HiveServer2 Kerberos

2015-08-20 Thread Loïc Chanel
Indeed, I don't need the password, but why is Beeline asking me for one ? To what does it correspond ? Thanks again, Loïc Loïc CHANEL Engineering student at TELECOM Nancy Trainee at Worldline - Villeurbanne 2015-08-19 18:22 GMT+02:00 Jary Du jary...@gmail.com: Correct me if I am wrong, my

HIVE:1.2, Query taking huge time

2015-08-20 Thread Nishant Aggarwal
Dear Hive Users, I am in process of running over a poc to one of my customer demonstrating the huge performance benefits of Hadoop BigData using Hive. Following is the problem statement i am stuck with. I have generate a large table with 28 columns( all are double). Table size on disk is 70GB

Re: HIVE:1.2, Query taking huge time

2015-08-20 Thread Jörn Franke
Additionally, although it is a PoC you should have a realistic data model. Furthermore, following good data modeling practices should be taken into account. Joining on a double is not one of them. It should be int. Furthermore, double is a type that is in most scenarios rarely used. In the

Re: hiveserver2 hangs

2015-08-20 Thread kulkarni.swar...@gmail.com
Sanjeev, One possibility is that you are running into[1] which affects hive 0.13. Is it possible for you to apply the patch on [1] and see if it fixes your problem? [1] https://issues.apache.org/jira/browse/HIVE-10410 On Thu, Aug 20, 2015 at 6:12 PM, Sanjeev Verma sanjeev.verm...@gmail.com

Re: query behaviors with subquery in clause

2015-08-20 Thread Noam Hasson
I observed in other situation, when ever you run queries where you don't specify statistics partitions, Hive doesn't pre-compute which one to take so it will take all the table. I would suggest implementing the max date by code in a separate query. On Thu, Aug 20, 2015 at 12:16 PM, Nitin Pawar

Re: hiveserver2 hangs

2015-08-20 Thread Noam Hasson
We had a case of retrieving a record which is bigger than the GC limit, for example a column with Array or Map type that has 1M cells. On Wed, Aug 19, 2015 at 9:35 PM, Sanjeev Verma sanjeev.verm...@gmail.com wrote: Can somebody gives me some pointer to looked upon? On Wed, Aug 19, 2015 at

Re: hiveserver2 hangs

2015-08-20 Thread Sanjeev Verma
We are using hive-0.13 with hadoop1. On Thu, Aug 20, 2015 at 11:49 AM, kulkarni.swar...@gmail.com kulkarni.swar...@gmail.com wrote: Sanjeev, Can you tell me more details about your hive version/hadoop version etc. On Wed, Aug 19, 2015 at 1:35 PM, Sanjeev Verma sanjeev.verm...@gmail.com

Query doubt

2015-08-20 Thread rakesh sharma
Hi All I am trying to get all the names starting with r.My query looks like select * from emp where name like 'r~%'; But it doesn't result anything; Any help thanksrakesh

Re: hiveserver2 hangs

2015-08-20 Thread kulkarni.swar...@gmail.com
Sanjeev, Can you tell me more details about your hive version/hadoop version etc. On Wed, Aug 19, 2015 at 1:35 PM, Sanjeev Verma sanjeev.verm...@gmail.com wrote: Can somebody gives me some pointer to looked upon? On Wed, Aug 19, 2015 at 9:26 AM, Sanjeev Verma sanjeev.verm...@gmail.com

FileNotFound Exception

2015-08-20 Thread rakesh sharma
Hi All Please help hive insert into emp values (1, rakesh);Query ID = rakesh_20150820223622_e538e1d0-26b0-4747-a553-e68a96d58954Total jobs = 3Launching Job 1 out of 3Number of reduce tasks is set to 0 since there's no reduce operatorjava.io.FileNotFoundException: File does not exist:

Re: Running python UDF in hive

2015-08-20 Thread Manjee, Sunile
Did you test your python script stand alone to verify it works as expected? From: rakesh sharma rakeshsharm...@hotmail.commailto:rakeshsharm...@hotmail.com Reply-To: user@hive.apache.orgmailto:user@hive.apache.org user@hive.apache.orgmailto:user@hive.apache.org Date: Thursday, August 20, 2015

Running python UDF in hive

2015-08-20 Thread rakesh sharma
Hi all I am running a python UDF in hive. I am getting the following error. hive select transform(id) using 'python transform_value.py' as (id string) from test;Query ID = 19659_20150820175050_ccb3b5e2-7e45-44a6-b16f-c4a4ad59e8f2Total jobs = 1Launching Job 1 out of 1 Status: Running

Re: HIVE:1.2, Query taking huge time

2015-08-20 Thread Xuefu Zhang
Please check out HIVE-11502. For your poc, you can simply get around using other data types instead of double. On Thu, Aug 20, 2015 at 2:08 AM, Nishant Aggarwal nishant@gmail.com wrote: Thanks for the reply Noam. I have already tried the later point of dividing the query. But the challenge

Re: query behaviors with subquery in clause

2015-08-20 Thread Nitin Pawar
Thanks Noam. As we are doing this via oozie, it will be either EL Action of something else I will just get around with a temp table and do a join with temp table with date column On Thu, Aug 20, 2015 at 5:27 PM, Noam Hasson noam.has...@kenshoo.com wrote: I observed in other situation, when

Re: HiveServer2 Kerberos

2015-08-20 Thread Jary Du
How does Beeline ask you? What happens if you just press enter? On Aug 20, 2015, at 12:15 AM, Loïc Chanel loic.cha...@telecomnancy.net wrote: Indeed, I don't need the password, but why is Beeline asking me for one ? To what does it correspond ? Thanks again, Loïc Loïc CHANEL

Re: HiveServer2 Kerberos

2015-08-20 Thread Loïc Chanel
!connect jdbc:hive2:// 192.168.6.210:1/db;principal=hive/hiveh...@westeros.wl org.apache.hive.jdbc.HiveDriver scan complete in 13ms Connecting to jdbc:hive2:// 192.168.6.210:1/db;principal=hive/hiveh...@westeros.wl Enter password for jdbc:hive2://

RE: Running python UDF in hive

2015-08-20 Thread Ryan Harris
remember that transform scripts in hive should receive data from STDIN and return results to STDOUT. So, to properly test your transform script try this: hive -e select id from test limit 10 testout.txt cat testout.txt | python transform_value.py if your transform script is working correctly,

Re: Query doubt

2015-08-20 Thread Pengcheng Xiong
Hi Rakesh, If you would like to get all the names starting with r, you can use select * from emp where name like 'r%'; or select * from emp where name rlike '^r'; or select * from emp where name regexp '^r'; Thanks. Pengcheng On Thu, Aug 20, 2015 at 11:40 AM, rakesh sharma

Partitioning hive

2015-08-20 Thread Jeetendra G
I all I have a folder structure like /housing/events/event1/date=20150820 stored in HDFS. I wanted to create the partition based on year month and day how can I do this during creating or loading data time? Regards Jeet

Hive over JDBC disable task conversion

2015-08-20 Thread Emil Berglind
I’m running a Hive query over JDBC in a Java app that I wrote. I want to be able to turn off task conversion as I am looking to stream the data back. I thought I could do that by using the following JDBC URL: jdbc:hive2:// 192.168.132.128:1/default?hive.fetch.task.conversion=none. My SQL

Re: HiveServer2 Kerberos

2015-08-20 Thread Jary Du
My understanding is that it will always ask you user/password even though you don’t need them. It is just the way how hive is setup. On Aug 20, 2015, at 8:28 AM, Loïc Chanel loic.cha...@telecomnancy.net wrote: !connect

Re: Partitioning hive

2015-08-20 Thread Vishal Dhavale
You can create table like Create table_name (colname type ) partitioned by (date string ) On 20-Aug-2015 9:39 PM, Jeetendra G jeetendr...@housing.com wrote: I all I have a folder structure like /housing/events/event1/date=20150820 stored in HDFS. I wanted to create the partition based on year