have you used UIMA? i did a quick read on the docs and it seems to do what
i'm looking for.

2010/2/11 Otis Gospodnetic <otis_gospodne...@yahoo.com>

> Note that UIMA doesn't doe NER itself (as far as I know), but instead
> relies on GATE or OpenNLP or OpenCalais, AFAIK :)
>
> Those interested in UIMA and living close to New York should go to
> http://www.meetup.com/NYC-Search-and-Discovery/calendar/12384559/
>
>
> Otis
> ----
> Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
> Hadoop ecosystem search :: http://search-hadoop.com/
>
>
>
> ----- Original Message ----
> > From: Jan Høydahl / Cominvent <jan....@cominvent.com>
> > To: solr-user@lucene.apache.org
> > Sent: Tue, February 9, 2010 9:57:26 AM
> > Subject: Re: Faceting
> >
> > NOTE: Please start a new email thread for a new topic (See
> > http://en.wikipedia.org/wiki/User:DonDiego/Thread_hijacking)
> >
> > Your strategy could work. You might want to look into dedicated entity
> > extraction frameworks like
> > http://opennlp.sourceforge.net/
> > http://nlp.stanford.edu/software/CRF-NER.shtml
> > http://incubator.apache.org/uima/index.html
> >
> > Or if that is too much work, look at
> > http://issues.apache.org/jira/browse/SOLR-1725 for a way to plug in your
> entity
> > extraction code into Solr itself using a scripting language.
> >
> > --
> > Jan Høydahl  - search architect
> > Cominvent AS - www.cominvent.com
> >
> > On 5. feb. 2010, at 20.10, José Moreira wrote:
> >
> > > Hello,
> > >
> > > I'm planning to index a 'content' field for search and from that
> > > fields text content i would like to facet (probably) according to if
> > > the content has e-mails, urls and within urls, url's to pictures,
> > > videos and others.
> > >
> > > As i'm a relatively new user to Solr, my plan was to regexp the
> > > content in my application and add tags to a Solr field according to
> > > the content, so for example the content "m...@email.com
> > > http://www.site.com"; would have the tags "email, link".
> > >
> > > If i follow this path can i then facet on "email" and/or "link" ? For
> > > example combining facet field with facet value params?
> > >
> > > Best
> > >
> > > --
> > > http://pt.linkedin.com/in/josemoreira
> > > josemore...@irc.freenode.net
> > > http://djangopeople.net/josemoreira/
>
>


-- 
josemore...@irc.freenode.net
http://pt.linkedin.com/in/josemoreira
http://djangopeople.net/josemoreira/

Reply via email to