[ 
https://issues.apache.org/jira/browse/NUTCH-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14351387#comment-14351387
 ] 

Chris A. Mattmann commented on NUTCH-1954:
------------------------------------------

Here's the error I got running it on the NSF ACADIS/Polar data set from my 
class:

{noformat}
[mattmann@nsfpolardata local]$ ./bin/nutch commoncrawldump -outputDir out 
-segment /home/mattmann/polar-data/apache-nutch-1.9/bin/AcadisCrawl2/segments/
java.io.FileNotFoundException: 
out/redirect.html?link=http%3a%2f%2fdataportal.ucar.edu%2fmetadata%2fcadis%2fTerrestrial_Ecosystems%2fArctic_Ecosystem_Changes%2fBarrow_Atqasuk_ITEX_Detailed_Microclimate%2f1998-20XX%2520Barrow%2520Atqasuk%2520ITEX%2520Detailed%2520Microclimate%2520metadata.doc
 (File name too long)
        at java.io.FileOutputStream.open(Native Method)
        at java.io.FileOutputStream.<init>(FileOutputStream.java:221)
        at java.io.FileOutputStream.<init>(FileOutputStream.java:171)
        at 
org.apache.nutch.tools.CommonCrawlDataDumper.dump(CommonCrawlDataDumper.java:372)
        at 
org.apache.nutch.tools.CommonCrawlDataDumper.main(CommonCrawlDataDumper.java:235)
{noformat}


> FilenameTooLong error appears in CommonCrawlDumper
> --------------------------------------------------
>
>                 Key: NUTCH-1954
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1954
>             Project: Nutch
>          Issue Type: Bug
>          Components: commoncrawl
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.10
>
>
> The issue from NUTCH-1950 is appearing in the CommonCrawlDumper tool as well 
> (FilenameTooLong). I'm going to apply that fix here as well (based on 
> MD5/message digest).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to