Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "SetupProxyForNutch" page has been changed by LewisJohnMcgibbney:
http://wiki.apache.org/nutch/SetupProxyForNutch?action=diff&rev1=15&rev2=16

  
  Tinyproxy supports filtering of web sites based on URLs or domains. We need 
to specify the location of a text file containing the filter rules, one rule 
per line. This can be done as follows
  {{{
+ vi /etc/filter
- sudo mkdir /etc/filter
- sudo cat > filters.txt
  }}}
- and add sites urls to be blocked. The list should comprise of single URLs, 
one per line.
+ and add site urls to be blocked. The list should comprise of single URLs, one 
per line, just like the seed list for performing crawls. 
  {{{
  google.com
  apache.org

Reply via email to