[ https://issues.apache.org/jira/browse/HADOOP-11569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Vinayakumar B updated HADOOP-11569: ----------------------------------- Attachment: HADOOP-11569-001.patch Attaching the patch. This will do the merge in below manner. # Read first key/value from all input files to keys/values array # Select the least key and corresponding value # Write the selected key and value to output file # Read the next key/value of selected input # Repeat step 2-4 till all keys are read Please review > Provide Merge API for MapFile to merge multiple similar MapFiles to one > MapFile > ------------------------------------------------------------------------------- > > Key: HADOOP-11569 > URL: https://issues.apache.org/jira/browse/HADOOP-11569 > Project: Hadoop Common > Issue Type: Improvement > Reporter: Vinayakumar B > Assignee: Vinayakumar B > Attachments: HADOOP-11569-001.patch > > > If there are multiple similar MapFiles of the same keyClass and value > classes, then these can be merged together to One MapFile to allow search > easier. > Provide an API similar to {{SequenceFile#merge()}}. > Merging will be easy with the fact that MapFiles are already sorted. -- This message was sent by Atlassian JIRA (v6.3.4#6332)