scoring algorithm

2003-09-23 Thread Chris Hennen
Hi, what is the purpose of "tf_q * idf_t / norm_q" in Lucene's scoring algorithm: score_d = sum_t( tf_q * idf_t / norm_q * tf_d * idf_t / norm_d_t) I dont understand, why the score has to be higher, when the frequency of a term in the query is higher. What is normalized by "norm_q"? Thanks, Ch

taxonomy with lucene

2003-09-23 Thread Gwo Haur, Fun
Hi, Has anyone tried building taxonomies in Lucene? Any idea what is the likely approach to be taken? thanks - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Lucene demo ideas?

2003-09-23 Thread Steven J. Owens
On Wed, Sep 17, 2003 at 08:00:42AM -0400, Erik Hatcher wrote: > I'm about to start some refactorings on the web application demo that > ships with Lucene to show off its features and be usable more easily > and cleanly out of the box - i.e. just drop into Tomcat's webapps > directory and go. >

Re: taxonomy with lucene

2003-09-23 Thread Eric Jain
> Has anyone tried building taxonomies in Lucene? Any idea what is the > likely approach to be taken? I'm storing data with a hierarchical classification in a Lucene index, if that is what you mean. The approach is very simple. Every document has a field for a unique identifier, a field for the i

Proposition :adding minMergeDoc to IndexWriter

2003-09-23 Thread Julien Nioche
Hui, Concerning an other point of your request list I proposed a patch this week end on the lucene-dev list and i totally forgot that this feature was requested on the user list. This new feature should help you to set a number of Documents to be merged in memory independently of the mergeFactor.

Limitation in size of Query class.

2003-09-23 Thread Cecilio Cano Calonge
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Hi, all. In the Query class, and his subclasses. Are there any limitation in size ? Thanks in avance. - -- Cecilio Cano Calonge ยท Czy GNUpg Key = 5011 67C7 7C0B A513 C18F D93B 071B BA7C 9DF6 9399 -BEGIN PGP SIGNATURE- Version: GnuPG v1

RE: Confusion over wildcard search logic

2003-09-23 Thread Dan Quaroni
BTW, this is with lucene 1.2 Thanks! - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Confusion over wildcard search logic

2003-09-23 Thread Erik Hatcher
Ah, this is a fun one lots of fiddly issues with how queries work and how QueryParser works. I'll take a stab at some of these inline below On Monday, September 22, 2003, at 08:26 PM, Dan Quaroni wrote: I have a simple command line interface for testing. Interesting interface. Looks l

Re: Proposition :adding minMergeDoc to IndexWriter

2003-09-23 Thread hui
It is a great. Julien. Thanks. Next time I am going to post the requests to the developer groups. Regards, Hui - Original Message - From: "Julien Nioche" <[EMAIL PROTECTED]> To: "Lucene Users List" <[EMAIL PROTECTED]> Sent: Tuesday, September 23, 2003 5:38 AM Subject: Proposition :adding

Re: Confusion over wildcard search logic

2003-09-23 Thread Terry Steichen
Erik's analysis is comprehensive and useful. I think this example reflects a common (and understandable) oversight - that wildcards do *not* work with a phrase. Got caught on that many times myself. Also there may be confusion about the format -> field:(term1 term2), in that the examples provide

RE: Confusion over wildcard search logic

2003-09-23 Thread Dan Quaroni
Yeah, thanks a lot for your help! I'm using the release version of Lucene version 1.2. > not all your queries are truly "WildcardQuery"'s though. look at the > class it constructed to get a better idea of what is happening. Yeah, I printed the queries out to see what was going on and noticed

RE: Confusion over wildcard search logic

2003-09-23 Thread Dan Quaroni
Your email prompted me to re-read the query parser documentation. There are only two examples using parentheses, which seem to be the answer to my questions. They are: (jakarta OR apache) AND website And title:(+return +"pink panther") These leave a lot unanswered, though. I mean, for exa

RE: Confusion over wildcard search logic

2003-09-23 Thread Otis Gospodnetic
Hello, > I guess the answer is that I need to just play with it and find out, > but as > others have mentioned, the documentation is lacking in some respects > and I'd > say this is one of them... Maybe I'll submit some answers when I > figure them out. :) Thank you, always appreciated. Otis _

Re: Confusion over wildcard search logic

2003-09-23 Thread Erik Hatcher
Better yet, submit some JUnit test cases that show how this stuff works, if the ones in Lucene's codebase aren't comprehensive enough. This is an excellent way to "play" with an API and get a good understanding of it and documenting it at the same time. Erik On Tuesday, September 23, 2003, a

Multiple Index search using SearchBean.

2003-09-23 Thread Radhakrishnan, Velayudham
Hi folks, I have been using Lucene for a while. Our application needs to sort the result set by last modified date. I was really happy to see SearchBean and HitsIterator. My question is that can I use SearchBean for search using Multiple indices. I skimmed through the souce code but could not

Re: Confusion over wildcard search logic

2003-09-23 Thread Erik Hatcher
On Tuesday, September 23, 2003, at 10:09 AM, Dan Quaroni wrote: Yeah, thanks a lot for your help! I'm using the release version of Lucene version 1.2. Perhaps give the latest codebase a try too, just to see if any fixes (particularly in that WildcardQuery.toString) are there. you're getting hi

RE: Confusion over wildcard search logic

2003-09-23 Thread Dan Quaroni
> Perhaps give the latest codebase a try too, just to see if any fixes > (particularly in that WildcardQuery.toString) are there. It's our intention to put this into a production environment soon, so we were waiting on 1.3 to go final before attempting to use it. > i wouldn't worry about > memo

Is the lucene index serializable?

2003-09-23 Thread Albert Vila Puig
Can I send a small lucene index by SOAP/TCP/HTTP/RMI? Is there a way to serialize a Lucene Index? I wan to send it from the Indexer server to the Search Server, and then do a merge operation in the Search Server with the previous index file. Thanks. -

Re: Is the lucene index serializable?

2003-09-23 Thread petite_abeille
Can I send a small lucene index by SOAP/TCP/HTTP/RMI? Is there a way to serialize a Lucene Index? I wan to send it from the Indexer server to the Search Server, and then do a merge operation in the Search Server with the previous index file. Well, what about a very old fashioned way instead? Som

Design question

2003-09-23 Thread Jack Lauman
I, like a lot of other people are new to Lucene. Practical examples are pretty scarce. I have the following site: http://www.tasteofwhatcom.com It's built on JBoss 3.0.7/Tomcat 4.1.24, Apache 2.0.47/mod_jk 1.2.4, MySQL 3.23.57 and RedHat 9.0. I want to add search capabilites to the site to all

Re: Design question

2003-09-23 Thread petite_abeille
I, like a lot of other people are new to Lucene. Practical examples are pretty scarce. If you don't mind learning by example, take a look at the "Powered by Lucene" page. A fair number of those projects are open source. http://jakarta.apache.org/lucene/docs/powered.html PA. ---

Re: scoring algorithm

2003-09-23 Thread Ype Kingma
On Tuesday 23 September 2003 00:12, Chris Hennen wrote: > Hi, > > what is the purpose of "tf_q * idf_t / norm_q" in Lucene's scoring > algorithm: > score_d = sum_t( tf_q * idf_t / norm_q * tf_d * idf_t / norm_d_t) > > I dont understand, why the score has to be higher, when the frequency of a > term