Re: Hive footprint

2016-04-19 Thread Mich Talebzadeh
The issue is that Hive has indexes (not index store) but they don't work so there we go. May be in later releases we can make use of these indexes for faster queries. Hive allows even bitmap indexes on Fact table but they are never used by COB. show indexes on sales; +---+

Re: Hive footprint

2016-04-19 Thread Mich Talebzadeh
BTW what is the situation with Impala? Dr Mich Talebzadeh LinkedIn * https://www.linkedin.com/profile/view?id=AAEWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw * http://talebzadehmich.wordpress.com On 19 April 2016 a

Re: Hive footprint

2016-04-19 Thread Amey Barve
Hi Peyman, You say: "you can use Hive storage handler to read data from HBase the performance would be lower than reading from HDFS directly for analytic." Why is it so? Is it slow as compared to ORC, Parquet, and even Text file format? Regards, Amey On Tue, Apr 19, 2016 at 4:32 AM, Peyman Mohaj

Question on Implementing CASE in Hive Join

2016-04-19 Thread Kishore A
Hi, I have a scenario to implement to cases in Hive Joins. I need to implement case on the value on which join condition to be applied. Table A Code// Type// Indicator// Value// A 1 XYZ John B 1 PQR Smith C 2 XYZ John C 2 PQR

RE: Question on Implementing CASE in Hive Join

2016-04-19 Thread Markovitz, Dudu
Before dealing with the technical aspect, can you please explain what is the point of using LEFT JOIN without selecting any field from table A? Thanks Dudu From: Kishore A [mailto:kishore.atmak...@gmail.com] Sent: Tuesday, April 19, 2016 2:29 PM To: user@hive.apache.org Subject: Question on Imp

Re: Question on Implementing CASE in Hive Join

2016-04-19 Thread Kishore A
Hi Dudu, Actually we use both fields from left and right tables, I mentioned right table just for my convenience to check whether ALL from right table can be pulled as per join condition match. One more reason why we use left join is we should not have extra columns after join. Kishore On Tue

RE: Question on Implementing CASE in Hive Join

2016-04-19 Thread Markovitz, Dudu
Please try the following two options. Option 2 might be better, performance wise (depending of the data volume and characteristics). P.s. I didn’t understand the explanation about the LEFT JOIN Dudu 1. select b.code ,b.value fromb left join

Re: Hive footprint

2016-04-19 Thread Peyman Mohajerian
Hi Amey, It is about seek vs scan. HBase is great in case a rowkey or a range of rowkeys is part of the where clause, then you do a seek and ORC/Parquest reading off HDFS would not do better in absence of an index. However for Data Warehouse that is generally not what you do, you mostly do scan, e

Data_encyption(rdbms_to_hive)

2016-04-19 Thread Ajay Chander
Hi Everyone, I am just trying to understand if there is any default 'data encryption/decryption' involved when we sqoop the data from rdbms to hive. If there is so, can someone point me to a material which reads about it? Thanks for your time! Regards, Aj

Re: Data_encyption(rdbms_to_hive)

2016-04-19 Thread Ajay Chander
This is to understand if sqoop has the capability to encrypt the data in transit from Man in the Middle attacks? Any pointers are appreciated. Thanks On Tuesday, April 19, 2016, Ajay Chander wrote: > Hi Everyone, > > I am just trying to understand if there is any default 'data > encryption/decr

Writing HFiles using Hive for an HBase bulk load - possible bug?

2016-04-19 Thread Tim Robertson
Hi folks, I am trying to create HFiles from a Hive table to bulk load into HBase and am following the HWX [1] tutorial. It creates the HFiles correctly but then fails when closing the RecordWriter with the following stack trace. Error: java.lang.RuntimeException: Hive Runtime Error while closing

Re: [VOTE] Bylaws change to allow some commits without review

2016-04-19 Thread Lars Francke
Thanks everyone! Vote runs for at least one more day. I'd appreciate it if you could ping/bump your colleagues to chime in here. I'm not entirely sure how many PMC members are active and how many votes we need but I think a few more are probably needed. On Mon, Apr 18, 2016 at 8:02 PM, Thejas Nai

Re: [VOTE] Bylaws change to allow some commits without review

2016-04-19 Thread Alpesh Patel
+1 On Tue, Apr 19, 2016 at 1:29 PM, Lars Francke wrote: > Thanks everyone! Vote runs for at least one more day. I'd appreciate it if > you could ping/bump your colleagues to chime in here. > > I'm not entirely sure how many PMC members are active and how many votes > we need but I think a few mo

Re: Standard Deviation in Hive 2 is still incorrect

2016-04-19 Thread Alan Gates
Have you filed a JIRA ticket for this? If not, please do so we can track it and fix it. Patches are welcomed as well. :) Alan. > On Apr 4, 2016, at 15:27, Mich Talebzadeh wrote: > > > Hi, > > I reported back in April 2015 that what Hive calls Standard Deviation > Function STDDEV is a poi

Re: [VOTE] Bylaws change to allow some commits without review

2016-04-19 Thread Jimmy Xiang
+1 On Tue, Apr 19, 2016 at 2:58 PM, Alpesh Patel wrote: > +1 > > On Tue, Apr 19, 2016 at 1:29 PM, Lars Francke > wrote: >> >> Thanks everyone! Vote runs for at least one more day. I'd appreciate it if >> you could ping/bump your colleagues to chime in here. >> >> I'm not entirely sure how many P

Re: [VOTE] Bylaws change to allow some commits without review

2016-04-19 Thread Tim Robertson
+1 On Wed, Apr 20, 2016 at 1:24 AM, Jimmy Xiang wrote: > +1 > > On Tue, Apr 19, 2016 at 2:58 PM, Alpesh Patel > wrote: > > +1 > > > > On Tue, Apr 19, 2016 at 1:29 PM, Lars Francke > > wrote: > >> > >> Thanks everyone! Vote runs for at least one more day. I'd appreciate it > if > >> you could p

Re: Hive footprint

2016-04-19 Thread Amey Barve
Thanks Peyman, Is running and evaluating TPCH queries with HBaseStorageHandler vs Hive's Text format comparable? What is the standard set of queries generally used for performance comparison, What queries did you use above? Regards, Amey On Tue, Apr 19, 2016 at 7:28 PM, Peyman Mohajerian wrot

Re: Standard Deviation in Hive 2 is still incorrect

2016-04-19 Thread Mich Talebzadeh
Will do thanks Dr Mich Talebzadeh LinkedIn * https://www.linkedin.com/profile/view?id=AAEWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw * http://talebzadehmich.wordpress.com On 19 April 2016 at 23:33, Alan Gates wro