I think, they are not overridden every times. If you do not give any configuration in your source code, the hadoop-site.xml will helps you configure the framework. At the same time, you will not configure all the parameters of hadoop framework in your program, then hadoop-site.xml helps.
On Tue, May 19, 2009 at 9:46 AM, Foss User <foss...@gmail.com> wrote: > On Tue, May 19, 2009 at 8:04 PM, He Chen <air...@gmail.com> wrote: > > change following parameter > > mapred.reduce.max.attempts 4 > > mapred.reduce.tasks 1To > > mapred.reduce.max.attempts 2 > > mapred.reduce.tasks 2 > > In your program source code! > > If these parameters in hadoop-site.xml is always going to be > overridden, then what is the use of having these properties in > hadoop-site.xml? > > I don't want to put these properties in my program source code so that > the program can be run in any cluster, large size or medium size. I > want these configurations to be mentioned in the cluster. > > Can someone tell me why the job XML has different properties than what > I have specified in hadoop-site.xml? Who overrides it and why? > -- Chen He RCF CSE Dept. University of Nebraska-Lincoln US