subject:"What are all the factors that go into the number of mappers \- ORC"

Re: What are all the factors that go into the number of mappers - ORC

2014-02-03 Thread John Omernik

No the size is closer to 10GB, the difference between the tables is only around 2000 bytes. I will try to get exact numbers for you soon, I am traveling right now, but I'll get you better data to work with shortly. Thanks! On Mon, Feb 3, 2014 at 12:22 AM, Prasanth Jayachandran < pjayachand...@

Re: What are all the factors that go into the number of mappers - ORC

2014-02-02 Thread Prasanth Jayachandran

Hi John Number of mappers is equal to the number of splits generated. Following are the factors that go into split generation 1) HDFS block size 2) Max split size a split is cut when 1) the cumulative size of all adjacent stripes are greater than HDFS block size 2) the cumulative size of all adj

What are all the factors that go into the number of mappers - ORC

2014-02-02 Thread John Omernik

I have two clusters, but small dev clusters, and I loaded the same dataset into both of them. The data size on disk is within 2000 Bytes. Both are ORC, one is Hive 11 and one is Hive 12. One is allocating about 8 more mappers to the exact same query. I am just curious what settings would change

Re: What are all the factors that go into the number of mappers - ORC

Re: What are all the factors that go into the number of mappers - ORC

What are all the factors that go into the number of mappers - ORC

3 matches

Site Navigation

Mail list logo

Footer information