I need input from people on the following: (1) does it make sense to have regex-urlfilter.txt as an attribute to plugin RegexURLFilter, instead of a property in nutch-default.xml? The same with prefix-urlfilter.txt?
I personal would say that make sense, in general I think configuration values of plugin should be in the plugin.xml
Since nutch-config is for nutch and not plugins.
(2) By default, url filters are applied in AND logic, anyone needs OR logical? I don't for now. But it may be a bad idea to get too complicated here. If you need filtering with fancy rules, a new plugin might have to be written.
AND logic sound fine, since all plugin would be 'filter' : -)
Thanks for the work! Stefan
------------------------------------------------------- This SF.Net email is sponsored by: IntelliVIEW -- Interactive Reporting Tool for open source databases. Create drag-&-drop reports. Save time by over 75%! Publish reports on the web. Export to DOC, XLS, RTF, etc. Download a FREE copy at http://www.intelliview.com/go/osdn_nl _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers
