Hi hadoop devs,

I'm implementing a custom input format and want to understand how to make
use of data locality.

AFAIU, only file input format makes use of data locality since the job
tracker picks data locality based on the block location defined in the file
input split.

So, the job tracker code is partly responsible for this. So providing data
locality for a custom input format would be to either either extend file
input format or modify job tracker code (if that makes sense even).

Is my understanding correct?

-- 
Regards,

Tharindu

blog: http://mackiemathew.com/

Reply via email to