[ 
https://issues.apache.org/jira/browse/HADOOP-2705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12562235#action_12562235
 ] 

Raghu Angadi commented on HADOOP-2705:
--------------------------------------

The real issue might be overloaded use of "io.file.buffer.size". If a 
particular compression has a more optimal buffer size, shouldn't it use that , 
or somthing like max(io.buffer.size, more_optimal_than_default)? There are many 
other places this might need to be fixed. I think Konstantin filed another jira 
that propose a different buffer size for Datanode node disk (and may be 
network). I think io.file.buffer.size can't be tweaked be decent for so many 
different contexts.


> io.file.buffer.size should default to a value larger than 4k
> ------------------------------------------------------------
>
>                 Key: HADOOP-2705
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2705
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: conf
>            Reporter: Chris Douglas
>            Priority: Minor
>             Fix For: 0.16.0
>
>         Attachments: 2705-0.patch
>
>
> Tests using HADOOP-2406 suggest that increasing this to 32k from 4k improves 
> read times for block, lzo compressed SequenceFiles by over 40%; 32k is a 
> relatively conservative bump.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to