Hello all,
I am getting some odd behavior from hadoop which seems like a bug. I
have created a custom input format, and I am observing that my
"getSplits" method is being called twice. Each call is on a different
instance of the input format. The job, however, is only run once,
using the result from the second call to getSplits. The first call
receives the numSplits hint as expected, while in the second call that
value is overriden to 1. I am running hadoop in standalone mode. Does
anyone know anything about this issue?
Thanks,
Nathan Marz
Rapleaf