Reading sequence file consumes 100% cpu with maximum throughput being about 5MB/sec per process -----------------------------------------------------------------------------------------------
Key: HADOOP-2608 URL: https://issues.apache.org/jira/browse/HADOOP-2608 Project: Hadoop Issue Type: Improvement Components: io Reporter: Runping Qi I did some tests on the throughput of scanning block-compressed sequence files. The sustained throughput was bounded at 5MB/sec per process, with the cpu of each process maxed at 100%. It seems to me that the cpu consumption is too high and the throughput is too low for just scanning files. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.