[ https://issues.apache.org/jira/browse/NUTCH-1448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13446589#comment-13446589 ]
Hudson commented on NUTCH-1448: ------------------------------- Integrated in Nutch-nutchgora #334 (See [https://builds.apache.org/job/Nutch-nutchgora/334/]) NUTCH-1448 Redirected urls should be handled more cleanly (more like an outlink url) (Revision 1379438) Result = FAILURE ferdy : Files : * /nutch/branches/2.x/CHANGES.txt * /nutch/branches/2.x/conf/nutch-default.xml * /nutch/branches/2.x/src/java/org/apache/nutch/crawl/DbUpdateReducer.java * /nutch/branches/2.x/src/java/org/apache/nutch/fetcher/FetcherReducer.java * /nutch/branches/2.x/src/java/org/apache/nutch/parse/ParseUtil.java * /nutch/branches/2.x/src/java/org/apache/nutch/parse/ParserJob.java > Redirected urls should be handled more cleanly (more like an outlink url) > ------------------------------------------------------------------------- > > Key: NUTCH-1448 > URL: https://issues.apache.org/jira/browse/NUTCH-1448 > Project: Nutch > Issue Type: Improvement > Reporter: Ferdy Galema > Fix For: 2.1 > > Attachments: nutch-1448.txt > > > This is specifically for Nutch2.x. Handling a redirects url like an outlink > is much more cleaner because this makes it more simple to trace how new urls > are added to the webpage database. Instant fetching of redirects won't work, > but this is a small price to pay. (Note that this currently does not work at > all, because the http.max.redirect property has no effect). Will be attaching > a patch in the upcoming days. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira