Hi,
what is the purpose of "tf_q * idf_t / norm_q" in Lucene's scoring
algorithm:
score_d = sum_t( tf_q * idf_t / norm_q * tf_d * idf_t / norm_d_t)
I dont understand, why the score has to be higher, when the frequency of a
term in the query is higher. What is normalized by "norm_q"?
Thanks,
Ch
Hi,
Has anyone tried building taxonomies in Lucene? Any idea what is the
likely approach to be taken?
thanks
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
On Wed, Sep 17, 2003 at 08:00:42AM -0400, Erik Hatcher wrote:
> I'm about to start some refactorings on the web application demo that
> ships with Lucene to show off its features and be usable more easily
> and cleanly out of the box - i.e. just drop into Tomcat's webapps
> directory and go.
>
> Has anyone tried building taxonomies in Lucene? Any idea what is the
> likely approach to be taken?
I'm storing data with a hierarchical classification in a Lucene index,
if that is what you mean.
The approach is very simple. Every document has a field for a unique
identifier, a field for the i
Hui,
Concerning an other point of your request list I proposed a patch this week
end on the lucene-dev list and i totally forgot that this feature was
requested on the user list.
This new feature should help you to set a number of Documents to be merged
in memory independently of the mergeFactor.
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Hi, all.
In the Query class, and his subclasses.
Are there any limitation in size ?
Thanks in avance.
- --
Cecilio Cano Calonge ยท Czy
GNUpg Key = 5011 67C7 7C0B A513 C18F D93B 071B BA7C 9DF6 9399
-BEGIN PGP SIGNATURE-
Version: GnuPG v1
BTW, this is with lucene 1.2
Thanks!
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Ah, this is a fun one lots of fiddly issues with how queries work
and how QueryParser works. I'll take a stab at some of these inline
below
On Monday, September 22, 2003, at 08:26 PM, Dan Quaroni wrote:
I have a simple command line interface for testing.
Interesting interface. Looks l
It is a great. Julien. Thanks.
Next time I am going to post the requests to the developer groups.
Regards,
Hui
- Original Message -
From: "Julien Nioche" <[EMAIL PROTECTED]>
To: "Lucene Users List" <[EMAIL PROTECTED]>
Sent: Tuesday, September 23, 2003 5:38 AM
Subject: Proposition :adding
Erik's analysis is comprehensive and useful. I think this example reflects
a common (and understandable) oversight - that wildcards do *not* work with
a phrase. Got caught on that many times myself. Also there may be
confusion about the format -> field:(term1 term2), in that the examples
provide
Yeah, thanks a lot for your help! I'm using the release version of Lucene
version 1.2.
> not all your queries are truly "WildcardQuery"'s though. look at the
> class it constructed to get a better idea of what is happening.
Yeah, I printed the queries out to see what was going on and noticed
Your email prompted me to re-read the query parser documentation. There are
only two examples using parentheses, which seem to be the answer to my
questions. They are:
(jakarta OR apache) AND website
And
title:(+return +"pink panther")
These leave a lot unanswered, though. I mean, for exa
Hello,
> I guess the answer is that I need to just play with it and find out,
> but as
> others have mentioned, the documentation is lacking in some respects
> and I'd
> say this is one of them... Maybe I'll submit some answers when I
> figure them out. :)
Thank you, always appreciated.
Otis
_
Better yet, submit some JUnit test cases that show how this stuff
works, if the ones in Lucene's codebase aren't comprehensive enough.
This is an excellent way to "play" with an API and get a good
understanding of it and documenting it at the same time.
Erik
On Tuesday, September 23, 2003, a
Hi folks,
I have been using Lucene for a while. Our application needs to sort
the result set by last modified date. I was really happy to see SearchBean
and
HitsIterator.
My question is that can I use SearchBean for search using Multiple indices.
I skimmed through the souce code but could not
On Tuesday, September 23, 2003, at 10:09 AM, Dan Quaroni wrote:
Yeah, thanks a lot for your help! I'm using the release version of
Lucene
version 1.2.
Perhaps give the latest codebase a try too, just to see if any fixes
(particularly in that WildcardQuery.toString) are there.
you're getting hi
> Perhaps give the latest codebase a try too, just to see if any fixes
> (particularly in that WildcardQuery.toString) are there.
It's our intention to put this into a production environment soon, so we
were waiting on 1.3 to go final before attempting to use it.
> i wouldn't worry about
> memo
Can I send a small lucene index by SOAP/TCP/HTTP/RMI? Is there a way to
serialize a Lucene Index?
I wan to send it from the Indexer server to the Search Server, and then
do a merge operation in the Search Server with the previous index file.
Thanks.
-
Can I send a small lucene index by SOAP/TCP/HTTP/RMI? Is there a way
to serialize a Lucene Index?
I wan to send it from the Indexer server to the Search Server, and
then do a merge operation in the Search Server with the previous index
file.
Well, what about a very old fashioned way instead? Som
I, like a lot of other people are new to Lucene. Practical examples
are pretty scarce.
I have the following site:
http://www.tasteofwhatcom.com
It's built on JBoss 3.0.7/Tomcat 4.1.24, Apache 2.0.47/mod_jk 1.2.4,
MySQL 3.23.57 and RedHat 9.0.
I want to add search capabilites to the site to all
I, like a lot of other people are new to Lucene. Practical examples
are pretty scarce.
If you don't mind learning by example, take a look at the "Powered by
Lucene" page. A fair number of those projects are open source.
http://jakarta.apache.org/lucene/docs/powered.html
PA.
---
On Tuesday 23 September 2003 00:12, Chris Hennen wrote:
> Hi,
>
> what is the purpose of "tf_q * idf_t / norm_q" in Lucene's scoring
> algorithm:
> score_d = sum_t( tf_q * idf_t / norm_q * tf_d * idf_t / norm_d_t)
>
> I dont understand, why the score has to be higher, when the frequency of a
> term
22 matches
Mail list logo