Hi,

I am using HBase as a source of my MapReduce jobs.

I recently found out that TableInputFormat automatically splits the input
table so that each region of the table will be assigned to a single Map job.

But what I want to do is to split the input table so that user-specified
lines of row will be assigned to each Mapper.

For example, if I set a certain parameter to 100, then each Mapper will get
100 lines from the input Table.

Is there a method for this kind of operation?
Or do I have to modify the getSplits() of
org.apache.hadoop.hbase.mapreduce.TableInputFormatBase?

Any answer or opinion will be much appreciated!!

Ed

Reply via email to