Do stemming algorithms take into consideration abbreviations too? Some examples:
mg = milligrams US = United States CD = compact disc vcr = video casette recorder And, the next logical question, if stemming does not take care of abbreviations, are there any solutions that include abbreviations inside or outside of Lucene? Thanks, Kevin -----Original Message----- From: Chris Lamprecht [mailto:[EMAIL PROTECTED] Sent: Friday, January 21, 2005 5:51 PM To: Lucene Users List Subject: Re: Stemming Also if you can't wait, see page 2 of http://www.onjava.com/pub/a/onjava/2003/01/15/lucene.html or the LIA e-book ;) On Fri, 21 Jan 2005 09:27:42 -0500, Kevin L. Cobb <[EMAIL PROTECTED]> wrote: > OK, OK ... I'll buy the book. I guess its about time since I am deeply > and forever in love with Lucene. Might as well take the final plunge. > > > -----Original Message----- > From: Otis Gospodnetic [mailto:[EMAIL PROTECTED] > Sent: Friday, January 21, 2005 9:12 AM > To: Lucene Users List > Subject: Re: Stemming > > Hi Kevin, > > Stemming is an optional operation and is done in the analysis step. > Lucene comes with a Porter stemmer and a Filter that you can use in an > Analyzer: > > ./src/java/org/apache/lucene/analysis/PorterStemFilter.java > ./src/java/org/apache/lucene/analysis/PorterStemmer.java > > You can find more about it here: > http://www.lucenebook.com/search?query=stemming > You can also see mentions of SnowballAnalyzer in those search results, > and you can find an adapter for SnowballAnalyzers in Lucene Sandbox. > > Otis > > --- "Kevin L. Cobb" <[EMAIL PROTECTED]> wrote: > > > I want to understand how Lucene uses stemming but can't find any > > documentation on the Lucene site. I'll continue to google but hope > > that > > this list can help narrow my search. I have several questions on the > > subject currently but hesitate to list them here since finding a good > > document on the subject may answer most of them. > > > > > > > > Thanks in advance for any pointers, > > > > > > > > Kevin > > > > > > > > > > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]