I have succesfully implemented NUTCH as crawler for SOLR index on 
http://szukaj.ug.edu.pl http://szukaj.ug.edu.pl  site. But there is some
problem with HTTP REFERER. Nutch is not sending referer header when crawling
sites. 

Is it possible to order NUTCH to send referer header on request?

Scenario:
1. Nutch open www.domain.pl
2. Nutch founds www.domain.pl/abcd.pdf link.
3. Nutch requested www.domain.pl/abcd.pdf but without
HTTP_REFERER=www.domain.pl



--
View this message in context: 
http://lucene.472066.n3.nabble.com/HTTP-REFERER-is-missing-tp3987959.html
Sent from the Nutch - Agent mailing list archive at Nabble.com.

Reply via email to