[ 
https://issues.apache.org/jira/browse/YARN-7713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17333483#comment-17333483
 ] 

Eric Badger commented on YARN-7713:
-----------------------------------

Thanks for taking this up, [~ChrisKarampeazis]. I noticed that you weren't a 
contributor in JIRA yet so I've added you as one. You may now assign JIRAs to 
yourself in all of the Hadoop projects (YARN, Common, HDFS, Mapreduce).

In general I think the PR looks good, but I think it would be nice and not too 
awfully difficult to sort the list of files to be localized by file size and 
then split the list into chunks based on that. That way we don't end up with 1 
thread downloading 4 files of 2 KB and another thread downloading 4 files of 4 
GB.

> Add parallel copying of directories into FSDownload
> ---------------------------------------------------
>
>                 Key: YARN-7713
>                 URL: https://issues.apache.org/jira/browse/YARN-7713
>             Project: Hadoop YARN
>          Issue Type: Improvement
>            Reporter: Miklos Szegedi
>            Assignee: Christos Karampeazis-Papadakis
>            Priority: Major
>              Labels: newbie, pull-request-available
>          Time Spent: 50m
>  Remaining Estimate: 0h
>
> YARN currently copies directories sequentially when localizing. This could be 
> improved to do in parallel, since the source blocks are normally on different 
> nodes.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to