Re: Help interpreting explanation

2006-03-05 Thread Chris Hostetter
: cosine similarity and need some help. Can anyone tell me in which file : are the methods of the DefaultSimilarity methods called? Most of the Similarity methods are called by the various Scorers. A good IDE will tell you where they are called (or you could just grep the source, that's what I

Distributed Lucene..

2006-03-05 Thread Prasenjit Mukherjee
I already have an implementation of a distributed crawler farm, where crawler instances are runnign on different boxes. I want to come up with a distributed indexing scheme using lucene and take advantage of the distributed nature of my crawlers' distributed nature. Here is what I am thinking.

Re: Help interpreting explanation

2006-03-05 Thread Eugene
Thanks, for posting the "more like this" code. I just began coding my cosine similarity and need some help. Can anyone tell me in which file are the methods of the DefaultSimilarity methods called? For example, looking at the tf method i see that it takes in a float for freq instead of int.

How to intergrate snowball in lucene

2006-03-05 Thread Haritha_Parvatham
Hi, Can anyone giude me to intergrate snowball in lucene. I have downloaded snowball srcs.But some files are written in 'c' language.I have compiled it . Pls tell me how i add snowball in lucene for multilingual support. DISCLAIMER: This email (including any attachments) is intended for the sole

Re: Solr, the Lucene based Search Server

2006-03-05 Thread Chris Hostetter
: Solr is a new open-source search server that's based on Lucene, and : has XML/HTTP interfaces for updating and querying, declarative : specification of analyzers and field types via a schema, extensive : caching, replication, and a web admin interface. FYI, last year when I anounced that CNET w

carrot2 vs. vivisimo

2006-03-05 Thread zzzzz shalev
hey all, my team has been working for the last couple of days on integrating carrot2 into our project as a sort of src (search result clustering) solution. i was rather impressed with the results, until i checked out vivisimo's demo and saw a bit of a difference quality wise, ha

Re: Help interpreting explanation

2006-03-05 Thread Eric Jain
Eugene wrote: Any good links on extending the similarity class? A lot of posts discusses David Spencer's "More Like This" but i can;t find this anywhere. The "More Like This" code can be found here: http://svn.apache.org/viewcvs.cgi/lucene/java/trunk/contrib/similarity/ --

Re: Help interpreting explanation

2006-03-05 Thread Eugene
I was wondering if anyone has any idea how i can start to implement my own similarity. I wanna use the cosine similarity measure instead. I was looking through the past forums posts and saw that quite a few people have also discussed this, but no real method of doing it was mentioned. Any good

QueryParser dropping constraints?

2006-03-05 Thread Eric Jain
I've noticed that while the QueryParser (both the default QueryParser and the PrecedenceQueryParser) refuse to parse foo bar) baz they both seem to interpret foo bar( baz as foo bar Bug or feature? In any case, would be great if there was a "strict" mode, and a more lenient mode whe

MultiPhraseQuery

2006-03-05 Thread Eric Jain
I need to write a function that copies a MultiPhraseQuery and changes the field the query applies to. Unfortunately the API allows access to neither the contained terms nor the field! The other query classes I have so far dealt with all seem to allow access to the contained query terms...