When I look at nutch's log, I see this error:

2007-04-16 23:06:10,768 WARN  regex.RegexURLNormalizer - can't find
rules for scope 'outlink', using default

Can you please tell me how to setup rules for scope 'outlink'?

I think that is why I can't crawl the site yahoo.com.

Thank you.

-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to