Hey Eric,

What did you do to calc similarity?  I haven't had time, but was thinking of ways to 
add the ability to get the similarity score (as calculated when doing a search) given 
a term vector (or just a document id).  Any ideas on how to approach this would be 
appreciated.  The scoring in Lucene has always been a bit confusing to me, despite 
looking at the code several times, especially once you get into boolean queries, etc.

Thanks,
Grant

>>> [EMAIL PROTECTED] 06/01/04 06:01AM >>>
On May 31, 2004, at 2:17 PM, Stefan Groschupf wrote:
> Lucene can't help you.

What about using term vectors though?  I've been able to do rudimentary 
document similarity calculations using the new support in Lucene 1.4.  
Search the 'net for more info on term vectors and the formulas needed 
(elementary vector angle calculation, actually).

        Erik

> Am 31.05.2004 um 20:10 schrieb uddam chukmol:
>
>> Hi,
>>
>> I'm a newbie to Lucene and heard that it helps in the information 
>> retrieval process. However, my problem is not really related to the 
>> information retrieval but to the comparison of two texts. I think 
>> Lucene may help resolving it.
>>
>> I would like to have a clue on how to compare two given texts and 
>> finally say how much they are similar.
>>
>> Has anyone had this kind of experience? I will be very grateful to 
>> hear your ideas and your recommendations.
>>
>> Thanks before hand!
>>
>> Uddam CHUKMOL
>>
>>
>>
>>              
>> ---------------------------------
>> Do you Yahoo!?
>> Friends.  Fun. Try the all-new Yahoo! Messenger
> ---------------------------------------------------------------
> open technology:   http://www.media-style.com 
> open source:           http://www.weta-group.net 
> open discussion:    http://www.text-mining.org 
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED] 
> For additional commands, e-mail: [EMAIL PROTECTED] 


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED] 
For additional commands, e-mail: [EMAIL PROTECTED] 



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to