Hey, I have a mapreduce job (transactions loader) and the main problem of it is "reduce->copy" and "reduce->sort" phase which takes all IO and uses all disk resources, what are the possible ways to reduce this load? My cloud settings are:
ioSortFactor=80 ioSortMb=800 (mapredChildJavaOpts=Xmx1152m) I can lower those settings, what else can I tweak? Thanks, Marek M.