RE: CombineHiveInputFormat not working

2015-09-30 Thread Ryan Harris
what are your values for: mapred.min.split.size mapred.max.split.size hive.hadoop.supports.splittable.combineinputformat From: Pradeep Gollakota [mailto:pradeep...@gmail.com] Sent: Wednesday, September 30, 2015 2:20 PM To: user@hive.apache.org Subject: CombineHiveInputFormat not working Hi all

CombineHiveInputFormat not working

2015-09-30 Thread Pradeep Gollakota
Hi all, I have an external table of with the following DDL. ``` DROP TABLE IF EXISTS raw_events; CREATE EXTERNAL TABLE IF NOT EXISTS raw_events ( raw_event_string string) PARTITIONED BY (dc string, community string, dt string) STORED AS TEXTFILE LOCATION

RE: CombineHiveInputFormat not working

2015-09-30 Thread Ryan Harris
Also... mapreduce.input.fileinputformat.split.maxsize and, what is the size of your input files? From: Ryan Harris Sent: Wednesday, September 30, 2015 2:37 PM To: 'user@hive.apache.org' Subject: RE: CombineHiveInputFormat not working what are your values for: mapred.min.split.size

Re: CombineHiveInputFormat not working

2015-09-30 Thread Pradeep Gollakota
com> wrote: > Also... > > mapreduce.input.fileinputformat.split.maxsize > > > > and, what is the size of your input files? > > > > *From:* Ryan Harris > *Sent:* Wednesday, September 30, 2015 2:37 PM > *To:* 'user@hive.apache.org' > *Subject:* RE: CombineH

RE: CombineHiveInputFormat not working

2015-09-30 Thread Ryan Harris
: CombineHiveInputFormat not working mapred.min.split.size = mapreduce.input.fileinputformat.split.maxsize = 1 mapred.max.split.size = mapreduce.input.fileinputformat.split.maxsize = 134217728 hive.hadoop.supports.splittable.combineinputformat = false My average file size is pretty small... it's

Re: CombineHiveInputFormat not working

2015-09-30 Thread Pradeep Gollakota
..@gmail.com] > *Sent:* Wednesday, September 30, 2015 3:33 PM > *To:* user@hive.apache.org > *Subject:* Re: CombineHiveInputFormat not working > > > > mapred.min.split.size = mapreduce.input.fileinputformat.split.maxsize = 1 > mapred.max.split.size = mapreduce.input.fileinput