I need input from people on the following: (1) does it make sense to have regex-urlfilter.txt as an attribute to plugin RegexURLFilter, instead of a property in nutch-default.xml? The same with prefix-urlfilter.txt?


I personal would say that make sense, in general I think configuration values of plugin should be in the plugin.xml
Since nutch-config is for nutch and not plugins.


(2) By default, url filters are applied in AND logic,
anyone needs OR logical? I don't for now. But it may be a
bad idea to get too complicated here. If you need filtering
with fancy rules, a new plugin might have to be written.



AND logic sound fine, since all plugin would be 'filter' : -)

Thanks for the work!
Stefan



-------------------------------------------------------
This SF.Net email is sponsored by: IntelliVIEW -- Interactive Reporting
Tool for open source databases. Create drag-&-drop reports. Save time
by over 75%! Publish reports on the web. Export to DOC, XLS, RTF, etc.
Download a FREE copy at http://www.intelliview.com/go/osdn_nl
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to