[ 
https://issues.apache.org/jira/browse/HDFS-11234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18048693#comment-18048693
 ] 

ASF GitHub Bot commented on HDFS-11234:
---------------------------------------

github-actions[bot] commented on PR #172:
URL: https://github.com/apache/hadoop/pull/172#issuecomment-3704266566

   We're closing this stale PR because it has been open for 100 days with no 
activity. This isn't a judgement on the merit of the PR in any way. It's just a 
way of keeping the PR queue manageable.
   If you feel like this was a mistake, or you would like to continue working 
on it, please feel free to re-open it and ask for a committer to remove the 
stale tag and review again.
   Thanks all for your contribution.




> distcp performance is suboptimal for high bandwidth/high latency setups
> -----------------------------------------------------------------------
>
>                 Key: HDFS-11234
>                 URL: https://issues.apache.org/jira/browse/HDFS-11234
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: hdfs
>    Affects Versions: 2.7.1
>            Reporter: Suresh Bahuguna
>            Assignee: Suresh Bahuguna
>            Priority: Major
>
> Because distcp uses tcp socket with buffer size set to 128K, for a setup 
> which has very high bandwidth but also a very high latency, the throughput is 
> quite poor. This is because tcp stops sending more data till the time it gets 
> the ACKs. By not setting the socket size and letting linux kernel manage the 
> socket, we should be able to get optimal performance.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to