There are several component libraries for UIMA on the net: http://incubator.apache.org/uima/external-resources.html
2010/2/18 José Moreira <matrixowns...@gmail.com>: > have you used UIMA? i did a quick read on the docs and it seems to do what > i'm looking for. > > 2010/2/11 Otis Gospodnetic <otis_gospodne...@yahoo.com> > >> Note that UIMA doesn't doe NER itself (as far as I know), but instead >> relies on GATE or OpenNLP or OpenCalais, AFAIK :) >> >> Those interested in UIMA and living close to New York should go to >> http://www.meetup.com/NYC-Search-and-Discovery/calendar/12384559/ >> >> >> Otis >> ---- >> Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch >> Hadoop ecosystem search :: http://search-hadoop.com/ >> >> >> >> ----- Original Message ---- >> > From: Jan Høydahl / Cominvent <jan....@cominvent.com> >> > To: solr-user@lucene.apache.org >> > Sent: Tue, February 9, 2010 9:57:26 AM >> > Subject: Re: Faceting >> > >> > NOTE: Please start a new email thread for a new topic (See >> > http://en.wikipedia.org/wiki/User:DonDiego/Thread_hijacking) >> > >> > Your strategy could work. You might want to look into dedicated entity >> > extraction frameworks like >> > http://opennlp.sourceforge.net/ >> > http://nlp.stanford.edu/software/CRF-NER.shtml >> > http://incubator.apache.org/uima/index.html >> > >> > Or if that is too much work, look at >> > http://issues.apache.org/jira/browse/SOLR-1725 for a way to plug in your >> entity >> > extraction code into Solr itself using a scripting language. >> > >> > -- >> > Jan Høydahl - search architect >> > Cominvent AS - www.cominvent.com >> > >> > On 5. feb. 2010, at 20.10, José Moreira wrote: >> > >> > > Hello, >> > > >> > > I'm planning to index a 'content' field for search and from that >> > > fields text content i would like to facet (probably) according to if >> > > the content has e-mails, urls and within urls, url's to pictures, >> > > videos and others. >> > > >> > > As i'm a relatively new user to Solr, my plan was to regexp the >> > > content in my application and add tags to a Solr field according to >> > > the content, so for example the content "m...@email.com >> > > http://www.site.com" would have the tags "email, link". >> > > >> > > If i follow this path can i then facet on "email" and/or "link" ? For >> > > example combining facet field with facet value params? >> > > >> > > Best >> > > >> > > -- >> > > http://pt.linkedin.com/in/josemoreira >> > > josemore...@irc.freenode.net >> > > http://djangopeople.net/josemoreira/ >> >> > > > -- > josemore...@irc.freenode.net > http://pt.linkedin.com/in/josemoreira > http://djangopeople.net/josemoreira/ > -- Lance Norskog goks...@gmail.com