Reading sequence file consumes 100% cpu with maximum throughput being about 
5MB/sec per process
-----------------------------------------------------------------------------------------------

                 Key: HADOOP-2608
                 URL: https://issues.apache.org/jira/browse/HADOOP-2608
             Project: Hadoop
          Issue Type: Improvement
          Components: io
            Reporter: Runping Qi



I did some tests on the throughput of scanning block-compressed sequence files.
The sustained throughput was bounded at 5MB/sec per process, with the cpu of 
each process maxed at 100%.

It seems to me that the cpu consumption is too high and the throughput is too 
low for just scanning files.



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to