Hi all
i want to ask what are the number of way you can filter out results i.e
reduce documents return. I m using the Lucene demo as an index so this may
retrict my options a bit
i have so far done searchthat matches all word
exclude some words
agreed (+0)
> -Original Message-
> From: Terry Steichen [mailto:[EMAIL PROTECTED]]
> Sent: Thursday, January 23, 2003 11:07 AM
> To: Lucene Users List
> Subject: Re: Interpreting the score asociated with the Term? |
>
>
> Otis,
>
> I think the effort you made in your previous message (t
Otis,
I think the effort you made in your previous message (to describe the basic
relevance measures in simple, non-algorithmic terms) is very important. If
you think that list is reasonably comprehensive (that is, it captures most
of what relevance means), I'd urge you to insert this into the
do
Yes, I believe so.
--- Terry Steichen <[EMAIL PROTECTED]> wrote:
> Otis,
>
> Didn't somebody (Doug?) also mention that a keyword in a shorter
> document is
> deemed more significant than in a longer one (because, I guess, it
> represents a larger percentage of the document)?
>
> Regards,
>
> Te
Again my apologies to Terry who was patient and confident despite my
misunderstanding.
I saw a change earlier today (on lucene-dev) with the QueryParser
syntax updated with more details on the range query format. I think
the ambiguity of saying a "date field" is still there though, since the
Otis,
Didn't somebody (Doug?) also mention that a keyword in a shorter document is
deemed more significant than in a longer one (because, I guess, it
represents a larger percentage of the document)?
Regards,
Terry
- Original Message -
From: "Otis Gospodnetic" <[EMAIL PROTECTED]>
To: "Luc
Here is a simplified explanation of some basic stuff.
1. the more frequent the term (in a collection) the lower its weight
(significance). Makes sense - very popular words don't distinguish one
document from the other much, because they are present in so many docs.
2. the more frequent a word in
Hi Lukas,
1. My problem was not parser. I am able to extract the required text
fragments
from the html document and index it. But when lucene returns a Hit, I am not
sure
how I can correlate it back to different portions of html document. Assuming
that I use JTidy and I have
a DOM, how will I know
Erik,
That's good. Now I don't have to keep proving what is, is. Glad it finally
made sense.
Regards,
Terry
- Original Message -
From: "Erik Hatcher" <[EMAIL PROTECTED]>
To: "Lucene Users List" <[EMAIL PROTECTED]>
Sent: Wednesday, January 22, 2003 11:43 PM
Subject: Re: Range queries
Hi All,
I am using Lucene as a Search Engine for my work. I am new to this, so forgive me if I
am asking a cliched question!
I need to understand how the SCORE for the search TERMs is calculated for Lucene, so
that indexing can be appropriately be designed to return the most relevant results,
10 matches
Mail list logo