RE: A question about scoring function in Lucene

2004-12-15 Thread Nhan Nguyen Dang
"Space optimizations for > Total Ranking" paper > > (Linked > > here > http://lucene.sourceforge.net/publications.html) > > -Read section 6 to 6.4 of > > > http://www.csee.umbc.edu/cadip/readings/IR.report.120600.book.pdf > > -Read section 1 of &

A question about scoring function in Lucene

2004-12-14 Thread Nhan Nguyen Dang
Hi all, Lucene score document based on the correlation between the query q and document t: (this is raw function, I don't pay attention to the boost_t, coord_q_d factor) score_d = sum_t( tf_q * idf_t / norm_q * tf_d * idf_t / norm_d_t) (*) Could anybody explain it in detail ? Or are there any p

Re: Using Lucene to store document

2004-11-14 Thread Nhan Nguyen Dang
ht out of CVS. How to work with CVS is another story, probably described somewhere on jakarta.apache.org site. Otis --- Nhan Nguyen Dang wrote: > Hi Otis, > Please let me know what HEAD version of Lucene is? > Actually, I'm consider the advantages of storing document using > Lucen

Re: Using Lucene to store document

2004-11-10 Thread Nhan Nguyen Dang
Hi Otis, Please let me know what HEAD version of Lucene is? Actually, I'm consider the advantages of storing document using Lucene Stored field - For my Search engine. I've tested with thousands of documents and see that retrieve document (in this case XML file) with Lucene is a little bit faste

Using Lucene to store document

2004-11-09 Thread Nhan Nguyen Dang
Hi all, I'm using Lucene to index XML document/ file (may be millions of documents in future, each about 5-10KB) Beside the index for searching, I want to use Lucene to store whole document content with UnIndexed fields -content field(instead of store each document in a XML file). All the docume