On Tue, Apr 26, 2011 at 12:24 AM, Otis Gospodnetic
otis_gospodne...@yahoo.com wrote:
But somehow this feels bad (well, so does sticking word variations in what's
supposed to be a synonyms file), partly because it means that the person
adding
new synonyms would need to know what they stem to
Suppose your analysis stack includes lower-casing, but your synonyms are
only supposed to apply to upper-case tokens. For example, PET might
be a synonym of positron emission tomography, but pet wouldn't be.
-Mike
On 04/26/2011 09:51 AM, Robert Muir wrote:
On Tue, Apr 26, 2011 at 12:24 AM,
Mike, thanks a lot for your example: the idea here would be you would
put the lowercasefilter after the synonymfilter, and then you get this
exact flexibility?
e.g.
WhitespaceTokenizer
SynonymFilter - no lowercasing of tokens are done as it analyzes
your synonyms with just the tokenizer
Yes, I see. Makes sense. It is a bit hard to see a bad case for your
proposal in that light. Here is one other example; I'm not sure whether
it presents difficulties or not, and may be a bit contrived, but hey,
food for thought at least:
Say you have set up synonyms between names and
Hi Otis Robert,
- Original Message
How do people handle cases where synonyms are used and there are multiple
version of the original word that really need to point to the same set of
synonyms?
For example:
Consider singular and plural of the word responsibility. One
This has come up with stemming: you can stem your synonym list with
the FieldAnalyzer Solr http call, then save the final chewed-up terms
as a new synonym file. You then use that one in the analyzer stack
below the stemmer filter.
On Mon, Apr 25, 2011 at 9:15 PM, Otis Gospodnetic
://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/
- Original Message
From: Lance Norskog goks...@gmail.com
To: solr-user@lucene.apache.org
Sent: Tue, April 26, 2011 12:20:05 AM
Subject: Re: Automatic synonyms for multiple variations of a word