[ 
https://issues.apache.org/jira/browse/NUTCH-2383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sebastian Nagel resolved NUTCH-2383.
------------------------------------
    Resolution: Not A Problem

Thanks [~yossi] for reporting this problem. Closing this as it can hardly be 
solved inside Nutch: it's clear that the default value "local" of 
{{mapreduce.framework.name}} does not allow to access hdfs:// paths. It's 
defined in 
[mapred-default.xml|https://hadoop.apache.org/docs/r2.7.2/hadoop-mapreduce-client/hadoop-mapreduce-client-core/mapred-default.xml]
 and should be set appropriately in mapred-site.xml which is not controlled by 
Nutch. It needs to be configured when setting up the Hadoop cluster. Please 
reopen if you see any option to fix this inside Nutch. Thanks!

> Wrong FS exception in Fetcher
> -----------------------------
>
>                 Key: NUTCH-2383
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2383
>             Project: Nutch
>          Issue Type: Bug
>          Components: fetcher
>    Affects Versions: 1.13
>         Environment: Hadoop 2.8 and Hadoop 2.7.2
>            Reporter: Yossi Tamari
>            Priority: Major
>         Attachments: crawl output.txt
>
>
> Running bin/crawl on either Hadoop 2.7.2 or Hadoop 2.8, the Injector and 
> Generator succeed, but the Fetcher throws: 
> {code}java.lang.IllegalArgumentException: Wrong FS: 
> hdfs://localhost:9000/user/root/crawl/segments/20170430084337/crawl_fetch, 
> expected: file:///{code}.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to