Rolling mechanism for demux output
----------------------------------
Key: HADOOP-4462
URL: https://issues.apache.org/jira/browse/HADOOP-4462
Project: Hadoop Core
Issue Type: Improvement
Components: contrib/chukwa
Reporter: Jerome Boulon
Assignee: Jerome Boulon
In order to reduce the number of file on HDFS we need to have a rolling
mechanism for the demux output
- avoid immediate merging if there's already file for the same time range,
create a spill file instead
- merge all raw files every hours
- merge all hourly files every days
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.