[ https://issues.apache.org/jira/browse/ACCUMULO-3193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Eric Newton updated ACCUMULO-3193: ---------------------------------- Fix Version/s: (was: 1.5.3) (was: 1.6.2) > bulkImport file rename is a bottleneck > -------------------------------------- > > Key: ACCUMULO-3193 > URL: https://issues.apache.org/jira/browse/ACCUMULO-3193 > Project: Accumulo > Issue Type: Improvement > Components: master > Affects Versions: 1.5.0, 1.5.1, 1.5.2, 1.6.0, 1.6.1 > Environment: very large cluster > Reporter: Eric Newton > Assignee: Eric Newton > Fix For: 1.7.0 > > Time Spent: 10m > Remaining Estimate: 0h > > On a very large cluster, importing a few thousand files takes several > minutes. Most of that time is spent renaming the user's files into the > accumulo bulk-load directory. In this case, the master is competing against > the other demands on the NN. The master could adopt the same strategy as the > file GC, and run the renames in parallel, to push more operations into the NN > at one time. -- This message was sent by Atlassian JIRA (v6.3.4#6332)