On Sep 20, 2010, at 6:35 AM, Tommaso Teofili wrote: > Hi all, > I am working on integrating Apache UIMA as un UpdateRequestProcessor for > Apache Solr and I am now at the first working snapshot. > I put the code on GoogleCode [1] and you can take a look at the tutorial > [2]. > > I would be glad to donate it to the Apache Solr project,
I think this would be a great addition. > as I think it could > be a useful module to trigger automatic content extraction while indexing > documents. > > At the moment the UIMAUpdateRequestProcessor base implementation can > automatically extract document's sentences, language, keywords, concepts and > named entities using Apache UIMA's HMMTagger, OpenCalaisAnnotator and > AlchemyAPIAnnotator components (but it can be easily expanded). > > Any feedback is welcome. > Have a nice day. > Tommaso > > [1] : http://code.google.com/p/solr-uima/ > [2] : http://code.google.com/p/solr-uima/wiki/5MinutesTutorial -------------------------- Grant Ingersoll http://lucenerevolution.org Apache Lucene/Solr Conference, Boston Oct 7-8
