[ 
https://issues.apache.org/jira/browse/HDFS-941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12862854#action_12862854
 ] 

bc Wong commented on HDFS-941:
------------------------------

The variance is large on the tests. But they show that the patch isn't slower 
than trunk. Tests executed on a 5 node cluster:

* TestDFSIO -read -fileSize 512 -bufferSize 4096 -nrFiles 10

||-||trunk||patched||
|Num trials|6|5|
|Throughput (MB/s)|92|93|
|Avg IO (MB/s)|150|134|
|Std dev|122|77|

* TestDFSIO -read -fileSize 512 -bufferSize 4096 -nrFiles 20

||-||trunk||patched||
|Num trials|5|5|
|Throughput (MB/s)|78|83|
|Avg IO (MB/s)|114|121|
|Std dev|75|76|

* Distributed {{bin/hadoop fs -cat /benchmarks/TestDFSIO/io_data/test_io_$i > 
/dev/null}}, for i in [0,9]

||-||trunk||patched||
|Num trials|5|5|
|Avg time (sec)|47.8|48.0|
|Std dev|4.2|3.6|

> Datanode xceiver protocol should allow reuse of a connection
> ------------------------------------------------------------
>
>                 Key: HDFS-941
>                 URL: https://issues.apache.org/jira/browse/HDFS-941
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: data-node, hdfs client
>    Affects Versions: 0.22.0
>            Reporter: Todd Lipcon
>            Assignee: bc Wong
>         Attachments: HDFS-941-1.patch, HDFS-941-2.patch, HDFS-941-3.patch, 
> HDFS-941-3.patch
>
>
> Right now each connection into the datanode xceiver only processes one 
> operation.
> In the case that an operation leaves the stream in a well-defined state (eg a 
> client reads to the end of a block successfully) the same connection could be 
> reused for a second operation. This should improve random read performance 
> significantly.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to