Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "SetupProxyForNutch" page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/SetupProxyForNutch?action=diff&rev1=15&rev2=16 Tinyproxy supports filtering of web sites based on URLs or domains. We need to specify the location of a text file containing the filter rules, one rule per line. This can be done as follows {{{ + vi /etc/filter - sudo mkdir /etc/filter - sudo cat > filters.txt }}} - and add sites urls to be blocked. The list should comprise of single URLs, one per line. + and add site urls to be blocked. The list should comprise of single URLs, one per line, just like the seed list for performing crawls. {{{ google.com apache.org