[ 
https://issues.apache.org/jira/browse/ACCUMULO-3193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Eric Newton updated ACCUMULO-3193:
----------------------------------
    Fix Version/s:     (was: 1.5.3)
                       (was: 1.6.2)

> bulkImport file rename is a bottleneck
> --------------------------------------
>
>                 Key: ACCUMULO-3193
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-3193
>             Project: Accumulo
>          Issue Type: Improvement
>          Components: master
>    Affects Versions: 1.5.0, 1.5.1, 1.5.2, 1.6.0, 1.6.1
>         Environment: very large cluster
>            Reporter: Eric Newton
>            Assignee: Eric Newton
>             Fix For: 1.7.0
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> On a very large cluster, importing a few thousand files takes several 
> minutes.  Most of that time is spent renaming the user's files into the 
> accumulo bulk-load directory.  In this case, the master is competing against 
> the other demands on the NN.  The master could adopt the same strategy as the 
> file GC, and run the renames in parallel, to push more operations into the NN 
> at one time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to