Hi all, I just found this presentation that was held at latest (May 2010) Lucene EuroCon held in Prague, it seems that some work on Solr CAS Consumer has already been done :-) Regards, Tommaso
[1] : http://lucene-eurocon.org/slides/Integration-of-Natural-Language-Processing-tools-with-Solr_Joan-Codina-Filba.pdf 2010/9/23 Tommaso Teofili <[email protected]> > Hi Jörn, > > 2010/9/22 Jörn Kottmann <[email protected]> > > The Solr people told me at last years ApacheCon that >> they are interested in a collaboration with UIMA. >> >> Here is an interesting thread about how the integration >> could be done: >> http://www.mail-archive.com/[email protected]/msg24007.html >> >> I think the author is right that posting to a Solr server from the end >> of a analysis pipeline might be the best option to integrate UIMA, >> it will also work nicely with UIMA AS where analysis pipeline can >> run on a number of different machines. >> >> Here is the link to Tommasos post at the Solr ML: >> http://lucene.472066.n3.nabble.com/Solr-UIMA-integration-td1528253.html >> >> Jörn >> > > I see very interesting points and suggestions and it's good to see interest > around where Solr and UIMA could meet. > > Let me say that in my opinion there are two feasible perspectives of > integration, being the first one from UIMA point of view and the latter from > Solr point of view. > > A UIMA perspective would consider Solr like some sort of storing platform > (let's call it Use Case 1), so, in terms of communication between the two > systems, it's UIMA "asking" Solr to consume one or more CASs (probably > defining a Solr CAS Consumer as suggested in Jörn's link) and I agree it > would fit very well also with UIMA AS. > > From Solr point of view (Use Case 2), a UIMA system can be queried to > process some document while Solr is consuming it for indexing (an update > processors chain); so it's Solr asking UIMA to "enrich" its document; maybe > UIMA AS would fit well here too, I would put this point on the TODO list :-) > > My proposal tries to consider this second scenario (Solr making a request > to UIMA) but I think it would be great to have both of this use cases > implemented and I would be happy to work on UC1 too (UIMA making an indexing > requesto to Solr via a CAS Consumer). > > > 2010/9/23 Vaijanath Rao <[email protected]> > > Though I am not an expert on either of the two components UIMA/SOLR, I >> would >> like to to assist you with the integration of UIMA with SOLR. > > > Thanks Vaijanath, any hint from you, both on ideas or code, will be more > than welcome. > > > I created an issue on Solr Jira regarding the UC2 integration proposal [1] > and I am preparing the patch to attach so if you have any objections or > suggestions, again, it would be nice to hear them. > Regards, > Tommaso > > > [1] : https://issues.apache.org/jira/browse/SOLR-2129 > > p.s.: > as I side note, since maybe it's not a main concern at this time, I think > that UC2 integration fit better as a Solr module whereas UC1 (a > SolrCASConsumer) would fit better as a project inside the UIMA Sandbox > > > >
