Hive-Hbase vs Phoenix-Hbase

2016-05-05 Thread Mich Talebzadeh
Hi, on this topic, All along the conclusion seem to be "quote" 1. The Hive is batch oriented(aka slow), it transfer the SQL query to MapReduce jobs, it mostly used in offline data processing. 2. The Phoenix is a SQL layer between applications and Hbase, it provide ad-hoc queries in

ListBucketing feature does not support uppercase string.

2016-05-05 Thread Jim Green
Hi Team, I found when there is uppercase string as the skew value, ListBucketing is not working. https://issues.apache.org/jira/browse/HIVE-13697 is filed: For example: 1. This is good: CREATE TABLE testskew (id INT, a STRING) SKEWED BY (a) ON ('abc', 'xyz') STORED AS DIRECTORIES; set

RE: Spark Streaming, Batch interval, Windows length and Sliding Interval settings

2016-05-05 Thread Ryan Harris
This is really outside of the scope of Hive and would probably be better addressed by the Spark community, however I can say that this very much depends on your use case Take a look at this discussion if you haven't already:

Re: multiple selects on a left join give incorrect result

2016-05-05 Thread Jesus Camacho Rodriguez
I created https://issues.apache.org/jira/browse/HIVE-13693 to track the issue. I have already submitted a fix. Thanks, Jesús From: Frank Luo > Reply-To: "user@hive.apache.org"

Re: Predicates for 'like' and 'between' operators to custom storage handler.

2016-05-05 Thread Mich Talebzadeh
Right. What is the underlying Hive table format? Is that Parquet, Avro, ORC ..? Also do you store your time as raw time in Hive table? For example this is the way I store timestamp it in an ORC table TO_DATE(FROM_UNIXTIME(UNIX_TIMESTAMP(TransactionDate,'dd/MM/'),'-MM-dd')) AS

Re: Predicates for 'like' and 'between' operators to custom storage handler.

2016-05-05 Thread Amey Barve
Hi, Do you have the equivalent of that operation in pure SQL. ---> This is my hive query: *select count(*) from u_data where unixtime like '%888904884%'* Query evaluates and results are correct. But point is that hive does not give like operator during predicate push down to custom storage

Re: Predicates for 'like' and 'between' operators to custom storage handler.

2016-05-05 Thread Mich Talebzadeh
Hi, Do you have the equivalent of that operation in pure SQL. Also have you tried Spark query tool with Hive table. I gather you are doing this through Java? Dr Mich Talebzadeh LinkedIn * https://www.linkedin.com/profile/view?id=AAEWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw

Re: Predicates for 'like' and 'between' operators to custom storage handler.

2016-05-05 Thread Amey Barve
Thanks Mich, It does work, but with operators other than '=', '!=', '<', '<=', '>' and '>=' , my code with custom storage handler gets null expression. *final String expression = conf.get(TableScanDesc.FILTER_EXPR_CONF_STR); // expression is null for hive query that has like operator* Why does

Re: Predicates for 'like' and 'between' operators to custom storage handler.

2016-05-05 Thread Mich Talebzadeh
On a normal query using sql in* Hive 2* LIKE predicate works fine. Case in point in a 1 billion rows table with the column random_string of varchar(50) I have one row that satisfies the followinh@

Predicates for 'like' and 'between' operators to custom storage handler.

2016-05-05 Thread Amey Barve
Hi All, I have implemented custom storage-handler and able to get predicates from hive for '=', '!=', '<', '<=', '>' and '>=' operators. But I cannot get predicates from hive for 'like', 'between' operators. Here's my code: *final String expression =

Re: Spark Streaming, Batch interval, Windows length and Sliding Interval settings

2016-05-05 Thread Mich Talebzadeh
Any ideas/experience on this? Dr Mich Talebzadeh LinkedIn * https://www.linkedin.com/profile/view?id=AAEWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw * http://talebzadehmich.wordpress.com On 4 May 2016 at 21:45,