[ http://issues.apache.org/jira/browse/HADOOP-451?page=all ]
Doug Cutting updated HADOOP-451: -------------------------------- Status: Resolved (was: Patch Available) Resolution: Fixed I just committed this. Thanks, Owen! > Add a Split interface > --------------------- > > Key: HADOOP-451 > URL: http://issues.apache.org/jira/browse/HADOOP-451 > Project: Hadoop > Issue Type: Improvement > Components: mapred > Affects Versions: 0.9.2 > Reporter: Doug Cutting > Assigned To: Owen O'Malley > Fix For: 0.10.0 > > Attachments: input-split-2.patch, input-split.patch > > > The InputFormat interface has a method: > FileSplit[] getSplits(); > This should change to: > Split[] getSplits(); > The Split interface would look like: > public interface Split extends Writable { > /** Returns a list of hosts that contain this split. > This is only used to optimize task placement, so this may be empty. */ > String[] getLocations(FileSystem fs); > /** The relative, estimated cost of operating on this. Typically the size > of the data in the split. > Used to prioritize tasks in a job (high-cost tasks are run first). */ > long getCost(); > } -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira