how to enable serde property hive.serialization.extend.nesting.levels for CTAS statment

2015-05-14 Thread Jie Zhang
Hi, experts, My application uses a CTAS query to create a result table in hive, the source table has deeply nested struct column (7 levels). CTAS query fails with the following exception. jdbc:hive2://localhost:1/default CREATE TABLE IF NOT EXISTS reporting.test1 AS select row_number()

Re: ACID ORC file reader issue with uncompacted data

2015-05-14 Thread Alan Gates
Ok, I think I understand now. I also get why OrcSplit.getPath returns just up to the partition keys and not the delta directories. In most cases there will be more than one delta directory, so which one would it pick? It seems you already know the file type you are working on before you

Re: HCatInputFormat combine splits

2015-05-14 Thread Pradeep Gollakota
The following property has been to no effect. mapreduce.input.fileinputformat.split.maxsize = 67108864 I'm still getting 1 Mapper per file. On Thu, May 14, 2015 at 10:27 AM, Ankit Bhatnagar ank...@yahoo-inc.com wrote: you can explicitly set the split size On Wednesday, May 13, 2015

Re: HCatInputFormat combine splits

2015-05-14 Thread Ankit Bhatnagar
try thesemapred.max.split.size= mapred.min.split.size=   mapreduce.input.fileinputformat.split.maxsize= mapreduce.input.fileinputformat.split.minsize=   On Thursday, May 14, 2015 11:04 AM, Pradeep Gollakota pradeep...@gmail.com wrote: The following property has been to no effect.

hive job not making progress due to Number of reduce tasks is set to 0 since there's no reduce operator

2015-05-14 Thread Bhagwan S. Soni
Hi Hive Users, I'm using Cloudera distribution and Hive's 13th version on my cluster. I came across a problem where job is not making any progress after writing log line - *Number of reduce tasks is set to 0 since there's no reduce operator* Below is the log for the same, could you help me what

Re: Unable to create table in Hive

2015-05-14 Thread Ibrar Ahmed
Hive : 0.13 Hbase: 1.0.1 On Fri, May 15, 2015 at 1:26 AM, kulkarni.swar...@gmail.com kulkarni.swar...@gmail.com wrote: Hi Ibrar, It seems like your hive and hbase versions are incompatible. What version of hive and hbase are you on? On Thu, May 14, 2015 at 3:21 PM, Ibrar Ahmed

Re: Unable to create table in Hive

2015-05-14 Thread kulkarni.swar...@gmail.com
Yeah. 0.13 isn't compatible with 1.0 HBase. We haven't made the jump the HBase 1.0 yet. But Hive 1.1 is on HBase 0.98. And from what I know, there aren't many breaking changes from 0.98 to 1.0 so you might give that a shot a see if it works. On Thu, May 14, 2015 at 3:30 PM, Ibrar Ahmed

Unable to create table in Hive

2015-05-14 Thread Ibrar Ahmed
Hi, While creating a table in Hive I am getting this error message. CREATE TABLE abcd(key int, value string) STORED BY 'org.apache.hadoop.hive.hbase.HBaseStorageHandler' WITH SERDEPROPERTIES (hbase.columns.mapping = :key,cf1:val) TBLPROPERTIES (hbase.table.name = xyz); [Hive Error]: Query

Re: Unable to create table in Hive

2015-05-14 Thread Ibrar Ahmed
I have also tried ADD FILE /usr/local/hbase/conf/hbase-site.xml; ADD JAR /usr/local/hive/lib/zookeeper-3.4.5.jar; ADD JAR /usr/local/hive/lib/hive-hbase-handler-0.13.0.jar; ADD JAR /usr/local/hive/lib/guava-11.0.2.jar; ADD JAR /usr/local/hbase/lib/hbase-client-1.0.1.jar; ADD JAR

Re: Unable to create table in Hive

2015-05-14 Thread kulkarni.swar...@gmail.com
Hi Ibrar, It seems like your hive and hbase versions are incompatible. What version of hive and hbase are you on? On Thu, May 14, 2015 at 3:21 PM, Ibrar Ahmed ibrar.ah...@gmail.com wrote: Hi, While creating a table in Hive I am getting this error message. CREATE TABLE abcd(key int, value

Partition Columns

2015-05-14 Thread Appan Thirumaligai
Hi, I have a question on Hive Optimizer. I have a table with partition columns eg.,Sales partitioned by year, month, day. Assume that I have two years worth of data on this table. I'm running two queries on this table. Query 1: Select * from Sales where year=2015 and month = 5 and day between 1

Re: Partition Columns

2015-05-14 Thread Viral Bajaria
Hi Appan, In my experience I have seen that Query 2 does not use partition pruning because it's not a straight up filtering and involves using functions (aka UDFs). What version of Hive are you using ? Thanks, Viral On Thu, May 14, 2015 at 1:48 PM, Appan Thirumaligai appanhiv...@gmail.com

Re: Partition Columns

2015-05-14 Thread Mungeol Heo
Hi, Appan. you can just simply check the amount of data your query reads from the table. or the number of the mapper for running that query. then, you can know whether it filtering or scanning all table. Of course, it is a lazy approach. but, you can give a try. I think query 1 should work fine.

Re: Partition Columns

2015-05-14 Thread Appan Thirumaligai
Mungeol, I did check the # of mappers and that did not change between the two queries but when I ran a count(*) query the total execution time reduced significantly for Query1 vs Query2. Also, the amount data the query reads does change when the where clause changes. I still can't explain why one

HCatInputFormat combine splits

2015-05-14 Thread Pradeep Gollakota
Hi All, I'm writing an MR job to read data using HCatInputFormat... however, the job is generating too many splits. I don't have this problem when running queries in Hive since it combines splits by default. Is there an equivalent in MR so that I'm not generating thousands of mappers? Thanks,

Re: Hive/Hbase Integration issue

2015-05-14 Thread Ibrar Ahmed
Now my hbase is working fine now, but i am still getting the same error [127.0.0.1:1] hive CREATE TABLE hbase_table_1(key int, value string) STORED BY 'org.apache.hadoop.hive.hbase. HBaseStorageHandler' WITH SERDEPROPERTIES (hbase.columns.mapping

user matching query does not exist

2015-05-14 Thread amit kumar
Hi, After re-create my account in Hue, i receives “User matching query does not exist” when attempting to perform hive query. The query is succeed in hive command line. Please suggest on this, Thanks you Amit