[ https://issues.apache.org/jira/browse/NUTCH-1998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Chris A. Mattmann resolved NUTCH-1998. -------------------------------------- Resolution: Fixed Committed thanks [~gostep]! {noformat} [chipotle:~/tmp/nutch-trunk] mattmann% svn commit -m "Commit patch for NUTCH-1988 Add support for user-defined file extension to CommonCrawlDataDumper contributed by Giuseppe Totaro." Sending CHANGES.txt Sending src/java/org/apache/nutch/tools/CommonCrawlDataDumper.java Transmitting file data .. Committed revision 1678520. [chipotle:~/tmp/nutch-trunk] mattmann% {noformat} > Add support for user-defined file extension to CommonCrawlDataDumper > -------------------------------------------------------------------- > > Key: NUTCH-1998 > URL: https://issues.apache.org/jira/browse/NUTCH-1998 > Project: Nutch > Issue Type: Improvement > Components: tool > Reporter: Giuseppe Totaro > Assignee: Chris A. Mattmann > Priority: Minor > Labels: memex, patch > Fix For: 1.11 > > Attachments: NUTCH-1998.patch > > > {{CommonCrawlDataDumper}} tool is able to generate CBOR-encoded files, > extracted from Nutch crawled data, using the Common Crawl format. By default, > {{CommonCrawlDataDumper}} uses the original file extension. > We are going to add support for a command-line option (e.g., {{-extension}}) > that allows the user to provide a file extension to use in place of the > original one. -- This message was sent by Atlassian JIRA (v6.3.4#6332)