[ 
https://issues.apache.org/jira/browse/HADOOP-2342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12548353
 ] 

Doug Cutting commented on HADOOP-2342:
--------------------------------------

> which suggests that writing checksums is pretty expensive

But reading checksums does not seem to be too expensive, which is nice to see.  
However HDFS reads are *much* slower than local reads, which is worrisome.  
That seems to be the biggest outlier in your data: checksums add ~25%, while 
non-local reads adds ~90%.

> create a micro-benchmark for measure local-file versus hdfs read
> ----------------------------------------------------------------
>
>                 Key: HADOOP-2342
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2342
>             Project: Hadoop
>          Issue Type: Test
>          Components: dfs
>            Reporter: Owen O'Malley
>            Assignee: Owen O'Malley
>             Fix For: 0.16.0
>
>
> We should have a benchmark that measures reading a 10g file from hdfs and 
> from local disk.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to