Hi, On 3/1/07, Grant Ingersoll <[EMAIL PROTECTED]> wrote:
Is the Droids lab at all related to that parsing project in Nutch?
Partly, yes. I've been looking at Droids and so far I think it's main focus has been on the crawling part rather than on the analysis of retrieved content. A generic content analysis toolkit would likely be a great companion for Droids. In fact I was earlier contemplating about starting a related effort in Apache Labs (see http://issues.apache.org/jira/browse/JCR-728), but there seems to be enough demand for such functionality that a more full-fledged project might be better.
There seems to be several efforts that are related here that could probably make for a nice new project under Lucene, IMO. They all seem to have to do with getting and preparing text for processing by some type of consumer of text.
Exactly. It would be great to see some consolidation of efforts. BR, Jukka Zitting --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]