No, relative URL's are resolved in both parsers plugins. You can try to disable 
it manually. There's no way to remove them from the CrawlDB except some clever 
filtering. They're absolute now.

 
 
-----Original message-----
> From:webdev1977 <webdev1...@gmail.com>
> Sent: Tue 18-Sep-2012 15:24
> To: user@nutch.apache.org
> Subject: Relative urls - outlinks
> 
> Is there anyway to keep nutch from generating outlinks for any RELATIVE urls? 
> I basically don't want to use ANY relative urls that I find.. 
> 
> Then the next question is how do I get them out of my crawldb :-)
> 
> 
> 
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/Relative-urls-outlinks-tp4008601.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
> 

Reply via email to