Hi,

On 3/1/07, Grant Ingersoll <[EMAIL PROTECTED]> wrote:
Is the Droids lab at all related to that parsing project in Nutch?

Partly, yes. I've been looking at Droids and so far I think it's main
focus has been on the crawling part rather than on the analysis of
retrieved content. A generic content analysis toolkit would likely be
a great companion for Droids. In fact I was earlier contemplating
about starting a related effort in Apache Labs (see
http://issues.apache.org/jira/browse/JCR-728), but there seems to be
enough demand for such functionality that a more full-fledged project
might be better.

There seems to be several efforts that are related here that could
probably make for a nice new project under Lucene, IMO.  They all
seem to have to do with getting and preparing text for processing by
some type of consumer of text.

Exactly. It would be great to see some consolidation of efforts.

BR,

Jukka Zitting

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to