-
From: Kasun Perera [mailto:kas...@opensource.lk]
Sent: Saturday, April 28, 2012 6:03 AM
To: java-user@lucene.apache.org
Subject: Indexing with Semantics
I'm using Lucene's Term Freq vector to calculate cosine similarity between
documents, Say my docments has these 3 terms, owe owed owing. Lucene
I'm using Lucene's Term Freq vector to calculate cosine similarity between
documents, Say my docments has these 3 terms, owe owed owing. Lucene
takes this as 3 separate terms, but 3 of them means same owe. Is there
any functionality in Lucene that can be used to index by semantics? so that
it
stemmer
semantic is a large word, care to use it.
On Sat, Apr 28, 2012 at 11:02 AM, Kasun Perera kas...@opensource.lk wrote:
I'm using Lucene's Term Freq vector to calculate cosine similarity between
documents, Say my docments has these 3 terms, owe owed owing. Lucene
takes this as 3 separate