Re: Any way to ignore repeated terms in TF calculation?

Chris Hostetter Fri, 02 Jan 2009 21:08:47 -0800

: you can solve your problem at search time by passing a custom Similarity class


In particular, consider subclassing SeweetSpotSimilrity ... instead of a 
truely "flat" tf function, it makes it easy for you to define a 
"sweetspot" so 2 instances of a word can score a lot higher then 1 
instance, and 3 can score slightly higher then 2, but 3-1000 instances can 
all score equivilently.


-Hoss

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Any way to ignore repeated terms in TF calculation?

Reply via email to