How can we know if 2 lucene indexes are same?

2008-08-29 Thread Noble Paul നോബിള്‍ नोब्ळ्
hi, I wish to know if the contents of two indexes have same data. will all the files be exactly same if I put same set of documents to both? --Noble - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMA

Re: How can we know if 2 lucene indexes are same?

2008-08-29 Thread tom
AUTOMATIC REPLY Tom Roberts is out of the office till 2nd September 2008. LUX reopens on 1st September 2008 - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Re: How can we know if 2 lucene indexes are same?

2008-08-29 Thread tom
AUTOMATIC REPLY Tom Roberts is out of the office till 2nd September 2008. LUX reopens on 1st September 2008 - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: How can we know if 2 lucene indexes are same?

2008-08-29 Thread Karl Wettin
29 aug 2008 kl. 11.35 skrev Noble Paul നോബിള്‍ नोब्ळ्: hi, I wish to know if the contents of two indexes have same data. will all the files be exactly same if I put same set of documents to both? If you insert the documents in the same order with the same settings and both indices are

Re: Clarity: Is there a Query boosting 50-50 over 1000-1 ?

2008-08-29 Thread Sébastien Rainville
I'm curious... what do you mean by "It's not perfect (there is no such thing) but it works pretty well in most cases, and works great if you spend a little time figuring out the right length normalization factors." ? Can you plz elaborate a little more on what are the length normalization factors e

getting a random doc from index

2008-08-29 Thread Cam Bazz
hello, how could I possibly get a select a random document out of a document collection inside a lucene index? best regards, -C.B.

Re: getting a random doc from index

2008-08-29 Thread Karl Wettin
29 aug 2008 kl. 15.21 skrev Cam Bazz: hello, how could I possibly get a select a random document out of a document collection inside a lucene index? Something like public Document randomDoc() { int docNo = (int)(new java.util.Random().nextDouble() * indexReader.maxDoc()); return inde

reusing Document with multiple fields in lucene 2.3

2008-08-29 Thread Gunjan Juyal
Hello all! I have a document structure such that each document can have multiple and variable number of fields of a given name. For example: - Document-1 has two fields with the name "fld1" and three fields with the name "fld2"; - Document-2 has five fields with the name "fld1" and one field with

Re: Clarity: Is there a Query boosting 50-50 over 1000-1 ?

2008-08-29 Thread Grant Ingersoll
On Aug 29, 2008, at 7:53 AM, Sébastien Rainville wrote: I'm curious... what do you mean by "It's not perfect (there is no such thing) but it works pretty well in most cases, and works great if you spend a little time figuring out the right length normalization factors." ? Can you plz elabor

RE: Clarity: Is there a Query boosting 50-50 over 1000-1 ?

2008-08-29 Thread Shi Hui Liu
Sorry to mislead you. The query output like: +((TITLE:A | BODY:A) (TITLE:B | BODY:B)). Let me explain it. Say S(A) means score of query A; TF(A) means term freq of A in the current document. Currently I'm using BooleanQuery to combine them to a query, if TF(A)=1000 and TF(B)=1 in document D1, TF

RE: boost freshness instead of sorting

2008-08-29 Thread Yannis Pavlidis
Steven, Andrzej, Thanks for your suggestions. Turning off the norms did it and it has been working as I have initially expected. Regards, Yannis. -Original Message- From: Andrzej Bialecki [mailto:[EMAIL PROTECTED] Sent: Thu 8/28/2008 11:40 AM To: java-user@lucene.apache.org Subject:

Pre-filtering for expensive query

2008-08-29 Thread Matt Ronge
Hi all, I am working on implementing a new Query, Weight and Scorer that is expensive to run. I'd like to limit the number of documents I run this query on by first building a candidate set of documents with a boolean query. Once I have that candidate set, I was hoping I could build a fil

MoreLikeThis return no results

2008-08-29 Thread davood
Hi, I'm trying to get MoreLikeThis working but it just returns no results. I have lucene working for normal queries and indexing but MoreLikeThis Just returns nothing. This is what I'm trying IndexReader reader = IndexReader.open(INDEX_PATH); IndexSearcher searcher = new IndexSearcher(INDEX_PAT

Re: MoreLikeThis return no results

2008-08-29 Thread tom
AUTOMATIC REPLY Tom Roberts is out of the office till 2nd September 2008. LUX reopens on 1st September 2008 - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Re: MoreLikeThis return no results

2008-08-29 Thread tom
AUTOMATIC REPLY Tom Roberts is out of the office till 2nd September 2008. LUX reopens on 1st September 2008 - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]