Thanks for the reply Noam. I have already tried the later point of dividing
the query. But the challenge comes during the joining of the table.
Thanks and Regards
Nishant Aggarwal, PMP
Cell No:- +91 99588 94305
On Thu, Aug 20, 2015 at 2:19 PM, Noam Hasson noam.has...@kenshoo.com
wrote:
Hi,
any help guys ?
On Thu, Aug 13, 2015 at 2:52 PM, Nitin Pawar nitinpawar...@gmail.com
wrote:
Hi,
right now hive does not support the equality clause in sub-queries.
for ex: select * from A where date = (select max(date) from B)
It though supports IN clause
select * from A where date in
Hi,
Have you look at counters in Hadoop side? It's possible you are dealing
with a bad join which causes multiplication of items, if you see huge
number of record input/output in map/reduce phase and keeps increasing
that's probably the case.
Another thing I would try is to divide the job into
Indeed, I don't need the password, but why is Beeline asking me for one ?
To what does it correspond ?
Thanks again,
Loïc
Loïc CHANEL
Engineering student at TELECOM Nancy
Trainee at Worldline - Villeurbanne
2015-08-19 18:22 GMT+02:00 Jary Du jary...@gmail.com:
Correct me if I am wrong, my
Dear Hive Users,
I am in process of running over a poc to one of my customer demonstrating
the huge performance benefits of Hadoop BigData using Hive.
Following is the problem statement i am stuck with.
I have generate a large table with 28 columns( all are double). Table size
on disk is 70GB
Additionally, although it is a PoC you should have a realistic data model.
Furthermore, following good data modeling practices should be taken into
account. Joining on a double is not one of them. It should be int.
Furthermore, double is a type that is in most scenarios rarely used. In the
Sanjeev,
One possibility is that you are running into[1] which affects hive 0.13. Is
it possible for you to apply the patch on [1] and see if it fixes your
problem?
[1] https://issues.apache.org/jira/browse/HIVE-10410
On Thu, Aug 20, 2015 at 6:12 PM, Sanjeev Verma sanjeev.verm...@gmail.com
I observed in other situation, when ever you run queries where you don't
specify statistics partitions, Hive doesn't pre-compute which one to take
so it will take all the table.
I would suggest implementing the max date by code in a separate query.
On Thu, Aug 20, 2015 at 12:16 PM, Nitin Pawar
We had a case of retrieving a record which is bigger than the GC limit, for
example a column with Array or Map type that has 1M cells.
On Wed, Aug 19, 2015 at 9:35 PM, Sanjeev Verma sanjeev.verm...@gmail.com
wrote:
Can somebody gives me some pointer to looked upon?
On Wed, Aug 19, 2015 at
We are using hive-0.13 with hadoop1.
On Thu, Aug 20, 2015 at 11:49 AM, kulkarni.swar...@gmail.com
kulkarni.swar...@gmail.com wrote:
Sanjeev,
Can you tell me more details about your hive version/hadoop version etc.
On Wed, Aug 19, 2015 at 1:35 PM, Sanjeev Verma sanjeev.verm...@gmail.com
Hi All
I am trying to get all the names starting with r.My query looks like select *
from emp where name like 'r~%';
But it doesn't result anything;
Any help
thanksrakesh
Sanjeev,
Can you tell me more details about your hive version/hadoop version etc.
On Wed, Aug 19, 2015 at 1:35 PM, Sanjeev Verma sanjeev.verm...@gmail.com
wrote:
Can somebody gives me some pointer to looked upon?
On Wed, Aug 19, 2015 at 9:26 AM, Sanjeev Verma sanjeev.verm...@gmail.com
Hi All
Please help
hive insert into emp values (1, rakesh);Query ID =
rakesh_20150820223622_e538e1d0-26b0-4747-a553-e68a96d58954Total jobs =
3Launching Job 1 out of 3Number of reduce tasks is set to 0 since there's no
reduce operatorjava.io.FileNotFoundException: File does not exist:
Did you test your python script stand alone to verify it works as expected?
From: rakesh sharma
rakeshsharm...@hotmail.commailto:rakeshsharm...@hotmail.com
Reply-To: user@hive.apache.orgmailto:user@hive.apache.org
user@hive.apache.orgmailto:user@hive.apache.org
Date: Thursday, August 20, 2015
Hi all
I am running a python UDF in hive. I am getting the following error.
hive select transform(id) using 'python transform_value.py' as (id string)
from test;Query ID =
19659_20150820175050_ccb3b5e2-7e45-44a6-b16f-c4a4ad59e8f2Total jobs =
1Launching Job 1 out of 1
Status: Running
Please check out HIVE-11502. For your poc, you can simply get around using
other data types instead of double.
On Thu, Aug 20, 2015 at 2:08 AM, Nishant Aggarwal nishant@gmail.com
wrote:
Thanks for the reply Noam. I have already tried the later point of
dividing the query. But the challenge
Thanks Noam.
As we are doing this via oozie, it will be either EL Action of something
else
I will just get around with a temp table and do a join with temp table with
date column
On Thu, Aug 20, 2015 at 5:27 PM, Noam Hasson noam.has...@kenshoo.com
wrote:
I observed in other situation, when
How does Beeline ask you? What happens if you just press enter?
On Aug 20, 2015, at 12:15 AM, Loïc Chanel loic.cha...@telecomnancy.net
wrote:
Indeed, I don't need the password, but why is Beeline asking me for one ? To
what does it correspond ?
Thanks again,
Loïc
Loïc CHANEL
!connect jdbc:hive2://
192.168.6.210:1/db;principal=hive/hiveh...@westeros.wl
org.apache.hive.jdbc.HiveDriver
scan complete in 13ms
Connecting to jdbc:hive2://
192.168.6.210:1/db;principal=hive/hiveh...@westeros.wl
Enter password for jdbc:hive2://
remember that transform scripts in hive should receive data from STDIN and
return results to STDOUT. So, to properly test your transform script try this:
hive -e select id from test limit 10 testout.txt
cat testout.txt | python transform_value.py
if your transform script is working correctly,
Hi Rakesh,
If you would like to get all the names starting with r, you can use
select * from emp where name like 'r%';
or
select * from emp where name rlike '^r';
or
select * from emp where name regexp '^r';
Thanks.
Pengcheng
On Thu, Aug 20, 2015 at 11:40 AM, rakesh sharma
I all I have a folder structure like /housing/events/event1/date=20150820
stored in HDFS.
I wanted to create the partition based on year month and day how can I do
this during creating or loading data time?
Regards
Jeet
I’m running a Hive query over JDBC in a Java app that I wrote. I want to be
able to turn off task conversion as I am looking to stream the data back. I
thought I could do that by using the following JDBC URL: jdbc:hive2://
192.168.132.128:1/default?hive.fetch.task.conversion=none. My SQL
My understanding is that it will always ask you user/password even though you
don’t need them. It is just the way how hive is setup.
On Aug 20, 2015, at 8:28 AM, Loïc Chanel loic.cha...@telecomnancy.net wrote:
!connect
You can create table like
Create table_name (colname type ) partitioned by (date string )
On 20-Aug-2015 9:39 PM, Jeetendra G jeetendr...@housing.com wrote:
I all I have a folder structure like /housing/events/event1/date=20150820
stored in HDFS.
I wanted to create the partition based on year
25 matches
Mail list logo