suggestion on Filtering Results

2003-01-23 Thread alex
Hi all i want to ask what are the number of way you can filter out results i.e reduce documents return. I m using the Lucene demo as an index so this may retrict my options a bit i have so far done searchthat matches all word exclude some words

RE: Interpreting the score asociated with the Term? |

2003-01-23 Thread Sale, Doug
agreed (+0) > -Original Message- > From: Terry Steichen [mailto:[EMAIL PROTECTED]] > Sent: Thursday, January 23, 2003 11:07 AM > To: Lucene Users List > Subject: Re: Interpreting the score asociated with the Term? | > > > Otis, > > I think the effort you made in your previous message (t

Re: Interpreting the score asociated with the Term? |

2003-01-23 Thread Terry Steichen
Otis, I think the effort you made in your previous message (to describe the basic relevance measures in simple, non-algorithmic terms) is very important. If you think that list is reasonably comprehensive (that is, it captures most of what relevance means), I'd urge you to insert this into the do

Re: Interpreting the score asociated with the Term? |

2003-01-23 Thread Otis Gospodnetic
Yes, I believe so. --- Terry Steichen <[EMAIL PROTECTED]> wrote: > Otis, > > Didn't somebody (Doug?) also mention that a keyword in a shorter > document is > deemed more significant than in a longer one (because, I guess, it > represents a larger percentage of the document)? > > Regards, > > Te

Date fields (was: Re: Range queries)

2003-01-23 Thread Erik Hatcher
Again my apologies to Terry who was patient and confident despite my misunderstanding. I saw a change earlier today (on lucene-dev) with the QueryParser syntax updated with more details on the range query format. I think the ambiguity of saying a "date field" is still there though, since the

Re: Interpreting the score asociated with the Term? |

2003-01-23 Thread Terry Steichen
Otis, Didn't somebody (Doug?) also mention that a keyword in a shorter document is deemed more significant than in a longer one (because, I guess, it represents a larger percentage of the document)? Regards, Terry - Original Message - From: "Otis Gospodnetic" <[EMAIL PROTECTED]> To: "Luc

Re: Interpreting the score asociated with the Term? |

2003-01-23 Thread Otis Gospodnetic
Here is a simplified explanation of some basic stuff. 1. the more frequent the term (in a collection) the lower its weight (significance). Makes sense - very popular words don't distinguish one document from the other much, because they are present in so many docs. 2. the more frequent a word in

Re: Correlating matched terms with Document

2003-01-23 Thread Mailing Lists Account
Hi Lukas, 1. My problem was not parser. I am able to extract the required text fragments from the html document and index it. But when lucene returns a Hit, I am not sure how I can correlate it back to different portions of html document. Assuming that I use JTidy and I have a DOM, how will I know

Re: Range queries

2003-01-23 Thread Terry Steichen
Erik, That's good. Now I don't have to keep proving what is, is. Glad it finally made sense. Regards, Terry - Original Message - From: "Erik Hatcher" <[EMAIL PROTECTED]> To: "Lucene Users List" <[EMAIL PROTECTED]> Sent: Wednesday, January 22, 2003 11:43 PM Subject: Re: Range queries

Interpreting the score asociated with the Term? |

2003-01-23 Thread Rishabh Bajpai
Hi All, I am using Lucene as a Search Engine for my work. I am new to this, so forgive me if I am asking a cliched question! I need to understand how the SCORE for the search TERMs is calculated for Lucene, so that indexing can be appropriately be designed to return the most relevant results,