[ https://issues.apache.org/jira/browse/MAPREDUCE-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rajesh Balamohan updated MAPREDUCE-1461: ---------------------------------------- Attachment: mr-1461-trunk-with-testcases.patch Attaching the patch with -ve testcase as well. > Feature to instruct rumen-folder utility to skip jobs worth of specific > duration > -------------------------------------------------------------------------------- > > Key: MAPREDUCE-1461 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-1461 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: tools/rumen > Affects Versions: 0.23.0 > Reporter: Rajesh Balamohan > Attachments: MR-1461-trunk.patch, mapreduce-1461--2010-02-05.patch, > mapreduce-1461--2010-03-04.patch, mr-1461-trunk-with-testcases.patch > > > JSON outputs of rumen on production logs can be huge in the order of multiple > GB. Rumen's folder utility helps in getting a smaller snapshot of this JSON > data. > It would be helpful to have an option in rumen-folder, wherein user can > specify a duration from which rumen-folder should start processing data. > Related JIRA link: https://issues.apache.org/jira/browse/MAPREDUCE-1295 -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira