When I do a dump of my segments I often find entries that looks like the
following

<HTML><HEAD>
<TITLE>301 Moved Permanently</TITLE>
</HEAD><BODY>
<H1>Moved Permanently</H1> 

I suppose that this means that the page wants to redirect. How can I make
nutch follow that redirection and crawl that page instead?

It's not just one or two pages that looks like this, it's very frequently.
-- 
View this message in context: 
http://www.nabble.com/Make-nutch-follow-redirections-tp23996457p23996457.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to