Regarding minimum number of partitions while reading data from Hadoop

twinkle sachdeva Thu, 19 Feb 2015 03:11:02 -0800

Hi,

In our job, we need to process the data in small chunks, so  as to avoid GC
and other stuff. For this, we are using old API of hadoop as that let us
specify parameter like minPartitions.


Does any one knows, If  there a way to do the same via newHadoopAPI also?
How that way will be different from older API?

I am little bit aware of split size stuff, but not much aware regarding any
promise that minimum number of partitions criteria gets satisfied or not.

Any pointers will be of help.

Thanks,
Twinkle

Regarding minimum number of partitions while reading data from Hadoop

Reply via email to