Is there a parser that can take a string and tell you what part is an address, and what is not?
Split the field into 2 fields? Search: Dr. Bell in Denver, CO Search: Dr. Smith near 10722 Main St, Denver, CO Search: Denver, CO for Cardiologist Thoughts? 2011/5/5 François Schiettecatte <fschietteca...@gmail.com>: > Rajani > > You might also want to look at Balie ( http://balie.sourceforge.net/ ), from > the web site: > > Features: > > • language identification > • tokenization > • sentence boundary detection > • named-entity recognition > > > Can't vouch for it though. > > > > > On May 5, 2011, at 4:58 AM, Jan Høydahl wrote: > >> Hi, >> >> Solr does not have lemmatization out of the box. >> >> You'll have to find 3rd party analyzers, and the most known such is from >> BasisTech. Please contact them to learn more. >> >> I'm not aware of any open source lemmatizers for Solr. >> >> -- >> Jan Høydahl, search solution architect >> Cominvent AS - www.cominvent.com >> >> On 5. mai 2011, at 10.34, rajini maski wrote: >> >>> Does the solr enable lemmatization concept? >>> >>> >>> >>> I found a documentation that gives an information as solr enables >>> lemmatization concept. Here is the link : >>> http://www.basistech.com/knowledge-center/search/2010-09-language-identification-language-support-and-entity-extraction.pdf >>> >>> Can anyone help me finding the jar specified in that document so that i can >>> add it as plugin. >>> jar :rlp.solr.RLPTokenizerFactory >>> >>> >>> Thanks and Regards, >>> Rajani Maski >> > >