[ https://issues.apache.org/jira/browse/NUTCH-2150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sebastian Nagel reopened NUTCH-2150: ------------------------------------ Fails with an exception (see [NUTCH-2155|https://issues.apache.org/jira/browse/NUTCH-2155?focusedCommentId=14984379]): {noformat} % nutch protocolstats usage: ProtocolStatistics <crawl db> <output dir> [numOfReducer] % nutch protocolstats ./crawl/crawldb/ protocol_stats ./crawl/crawldb/ protocol_stats Exception in thread "main" java.io.FileNotFoundException: File file:.../crawl/crawldb/old/data does not exist at org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:511) {noformat} > Add ProtocolStatus Utility > -------------------------- > > Key: NUTCH-2150 > URL: https://issues.apache.org/jira/browse/NUTCH-2150 > Project: Nutch > Issue Type: Improvement > Components: util > Affects Versions: 1.10 > Reporter: Michael Joyce > Assignee: Chris A. Mattmann > Fix For: 1.11 > > > It would be nice to have a utility for dumping protocol status code > information for a crawl database. This will be a utility for getting a dump > of the protocol status codes that builds off of NUTCH-2129 -- This message was sent by Atlassian JIRA (v6.3.4#6332)