> Prefix filter to cut off anything without "http://". And then a > (non-existent) domain-suffix filter, which considers only domain > suffixes - this is easy to implement based on the suffix filter that > ships with Nutch.
We should propably change the default filter to be something else than regex. -- Sami Siren ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
