Reading sequence file consumes 100% cpu with maximum throughput being about
5MB/sec per process
-----------------------------------------------------------------------------------------------
Key: HADOOP-2608
URL: https://issues.apache.org/jira/browse/HADOOP-2608
Project: Hadoop
Issue Type: Improvement
Components: io
Reporter: Runping Qi
I did some tests on the throughput of scanning block-compressed sequence files.
The sustained throughput was bounded at 5MB/sec per process, with the cpu of
each process maxed at 100%.
It seems to me that the cpu consumption is too high and the throughput is too
low for just scanning files.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.