Just for reference of others who might see this thread. Jira corresponding to parameter on reduce input limit is MAPREDUCE-2324
On 7/14/12, Harsh J <ha...@cloudera.com> wrote: > Subir, > > On Sat, Jul 14, 2012 at 5:30 PM, Subir S <subir.sasiku...@gmail.com> wrote: >> Harsh, Thanks I think this is what I was looking for. I have 3 related >> questions. >> >> 1.) Will this work in 0.20.2-cdh3u3 > > Yes, will work. (Btw, best to ask CDH-specific questions on the > cdh-u...@cloudera.org lists) > >> 2.) What is the hard limit that you mean? > > If a reducer gets more data than this value, due to the map's outputs > growing large (for any partition), the job will begin to fail. > >> 3.)Can this be applied for streaming? > > Yes, streaming is still MR and this property is for MR (applied during > scheduling, so not streaming/java specific). > > -- > Harsh J >