scott green wrote:
HiHere is the main methond in RegexURLFilter, public static void main(String args[]) throws IOException { main(new RegexURLFilter(), args); } the empty RegexURLFilter() cotor will not load rules at all. So the main method should be run failed. It shoule be public static void main(String args[]) throws IOException { RegexURLFilter filter = new RegexURLFilter(); filter.setConf(NutchConfiguration.create()); main(filter, args); }
Yes, looks like you're right.
right? And i also did not find setConf() before filter() in URLFilters. I wonder where the rules are loaded from. Thanks
This is done for all plugins implementing Configurable interface in org.apache.nutch.plugin.Extension at line #162. -- Sami Siren
