[ 
https://issues.apache.org/jira/browse/NUTCH-1669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995059#comment-13995059
 ] 

Julien Nioche commented on NUTCH-1669:
--------------------------------------

Hi Rafael

Looks like this issue went unnoticed. Could you please submit a patch for it? 
See https://wiki.apache.org/nutch/HowToContribute

Thanks

Julien

> FTP crawl does not use FTP's server root folder
> -----------------------------------------------
>
>                 Key: NUTCH-1669
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1669
>             Project: Nutch
>          Issue Type: Bug
>          Components: protocol
>    Affects Versions: 1.7
>         Environment: Linux Ubuntu
>            Reporter: Rafael Thomas Goz Coutinho
>            Priority: Minor
>              Labels: ftp, ftpclient, protocol
>
> Setup an FTP with root folder setup for a user (let's say test) pointing to 
> /home/test/ftphome/
> And create a folder under it called target with a test.txt file:
> /home/test/ftphome/target/test.txt
> Configure a URL to crawl as with depth of 1:
> ftp://FTP_SERVER/target/
> It will fail to crawl because the FTP plugin protocol assumes the path is 
> always absolute. It will look into /target/ and not /home/test/ftphome/target/



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to