[ https://issues.apache.org/jira/browse/NUTCH-1669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995059#comment-13995059 ]
Julien Nioche commented on NUTCH-1669: -------------------------------------- Hi Rafael Looks like this issue went unnoticed. Could you please submit a patch for it? See https://wiki.apache.org/nutch/HowToContribute Thanks Julien > FTP crawl does not use FTP's server root folder > ----------------------------------------------- > > Key: NUTCH-1669 > URL: https://issues.apache.org/jira/browse/NUTCH-1669 > Project: Nutch > Issue Type: Bug > Components: protocol > Affects Versions: 1.7 > Environment: Linux Ubuntu > Reporter: Rafael Thomas Goz Coutinho > Priority: Minor > Labels: ftp, ftpclient, protocol > > Setup an FTP with root folder setup for a user (let's say test) pointing to > /home/test/ftphome/ > And create a folder under it called target with a test.txt file: > /home/test/ftphome/target/test.txt > Configure a URL to crawl as with depth of 1: > ftp://FTP_SERVER/target/ > It will fail to crawl because the FTP plugin protocol assumes the path is > always absolute. It will look into /target/ and not /home/test/ftphome/target/ -- This message was sent by Atlassian JIRA (v6.2#6252)