Hi, AFAIK, it is a hint. Depending on the block size, minimum split size and this hint the exact number of splits is computed. So if you have total_size/hint < block size but greater than min split size, you should see the exact number. This is how I understand it, please let me know if I'm going wrong.
Amogh On 3/22/10 12:33 AM, "Gang Luo" <lgpub...@yahoo.com.cn> wrote: Hi all, in InputFormat.getSplits(JobConf, splitNum), I think the splitNum should be a hint. The number of splits is equal to the numbers of mappers working on that file. But I do get the same number of splits as indicated by splitNum, and the sum of the split length is the length of that file. It seems the splitNum here is not a hint. Is it a bug, or did I do something wrong? Thanks, -Gang