Doug Cutting wrote:
There are a number of duplicated libs in the plugins, namely:
commons-httpclient-3.0-beta1.jar src/plugin/parse-rss/lib
commons-httpclient-3.0.jar src/plugin/protocol-httpclient/lib
Not sure what was the reason to use the beta1, perhaps no reason except
that it was the latest available at the moment...
log4j-1.2.11.jar src/plugin/clustering-carrot2/lib
log4j-1.2.6.jar 1 src/plugin/parse-rss/lib
log4j-1.2.9.jar src/plugin/parse-pdf/lib
nekohtml-0.9.2.jar src/plugin/clustering-carrot2/lib
nekohtml-0.9.4.jar src/plugin/parse-html/lib
The differences here AFAIK are purely accidental, and I believe we can
just keep the latest releases.
xerces-2_6_2.jar lib
xercesImpl.jar src/plugin/parse-rss/lib
Not sure about these ones, but Xerces APIs are pretty stable, so I'd
risk removing xercesImpl.jar .
Are there any known reasons to keep multiple versions of things, or
should we move these each into their own plugin that can be shared?
The latter is what I advocated for log4j and various xml-related high
level API libs (jdom, dom4j, jaxen).
--
Best regards,
Andrzej Bialecki <><
___. ___ ___ ___ _ _ __________________________________
[__ || __|__/|__||\/| Information Retrieval, Semantic Web
___|||__|| \| || | Embedded Unix, System Integration
http://www.sigram.com Contact: info at sigram dot com
-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems? Stop! Download the new AJAX search engine that makes
searching your log files as easy as surfing the web. DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers