[ https://issues.apache.org/jira/browse/NUTCH-2165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15000923#comment-15000923 ]
Michael Joyce commented on NUTCH-2165: -------------------------------------- Note, the diff looks massive here. This is really just adding an extra loop over the parts directories in each segment directory. The tool could probably use a bit of cleanup love, but we can address that in a later patch. > FileDumper Util hard codes part-# folder name > --------------------------------------------- > > Key: NUTCH-2165 > URL: https://issues.apache.org/jira/browse/NUTCH-2165 > Project: Nutch > Issue Type: Bug > Components: tool > Affects Versions: 2.3, 1.10 > Reporter: Michael Joyce > Assignee: Michael Joyce > Fix For: 2.4, 1.11 > > Attachments: NUTCH-2165_joyce_11Nov2015.patch > > > Hi folks, [~lewismc] and I were just discussing this off list. It seems that > the part-##### folders seem to be hard coded to part-00000 in the [FileDumper > utility|https://github.com/apache/nutch/blob/trunk/src/java/org/apache/nutch/tools/FileDumper.java#L166-L167] > which could prove problematic. -- This message was sent by Atlassian JIRA (v6.3.4#6332)