Re: Parquet support for Timestamp in 0.14

2015-02-03 Thread Szehon Ho
Hi Yang I saw you posted this question in several places, I gave an answer in HIVE-6394 as I saw that one first, to the timestamp query. Can't speak about about date support, as its not in my knowledge. Thanks Szehon On Mon, Feb 2, 2015 at 4:15 PM, Yang tedd...@gmail.com wrote: the

Re: Which [open-souce] SQL engine atop Hadoop?

2015-02-03 Thread Samuel Marks
Thanks Devopam, In my initial post I did mention Presto, with his review: can query Hive, Cassandra http://cassandra.apache.org/, relational DBs etc. Doesn't seem to be designed for low-latency responses across small clusters, or support UPDATE operations. It is optimized for data warehousing or

Hive - regexp_replace function for multiple strings

2015-02-03 Thread Viral Parikh
Hi Everyone, I am using hive 0.13! I want to find multiple tokens like hip hop and rock music in my data and replace them with hiphop and rockmusic - basically replace them without white space. I have used the regexp_replace function in hive. Below is my query and it works great for above 2

failed to create an external hive table on parquet files (hive 0.14)

2015-02-03 Thread Yang
we want to use the Timestamp feature, so have to use 0.14 according to this https://cwiki.apache.org/confluence/display/Hive/Parquet for 0.13 above CREATE TABLE parquet_test ( x int ) STORED AS PARQUET location '/tmp/myparquet_files'; it works fine with creation. but select * from

Re: Hive - regexp_replace function for multiple strings

2015-02-03 Thread Pradeep Gollakota
I don't think this is doable using the out of the box regexp_replace() UDF. That way I would do it, is using a file to create a mapping between a regexp and it's replacement and write a custom UDF that loads this file and applies all regular expressions on the input. Hope this helps. On Tue, Feb

Re: Which [open-souce] SQL engine atop Hadoop?

2015-02-03 Thread Devopam Mittra
hi Samuel, Apologies for the delay in response as well as overlooking Presto mention in your initial post itself. #IMHO : Presto is lightweight, easy to install and configure. It does not support UPDATE .. hmm , i don't need updates in Big Data analytics where i can have a temp / intermediate

Re: Hive - regexp_replace function for multiple strings

2015-02-03 Thread Devopam Mittra
hi Viral, Unless you are strictly bound to change the text to achieve your objectives... you may actually wish to explore ngrams and context_ngram combinations to uniquely identify the patterns you want to explore and move them to a new table for further processinng Better do it at file level on