RE: the efficiency of creating indexes

2009-02-18 Thread Fang_Li
Did you try? The cost of index merging grows when indexes are getting bigger. Try to limit the max document size in a segment by setting setMaxMergeDocs in IndexWriter. -Original Message- From: 治江 王 [mailto:wangzhijiang...@yahoo.com.cn] Sent: Monday, February 16, 2009 1:49 PM To:

Re: the efficiency of creating indexes

2009-02-18 Thread Michael McCandless
If not for merging, I believe indexing is simply linear. Merging adds only a logarithmic (in total index size) cost. Using as large an IndexWriter RAM buffer as you can will minimize the amount of merging. (Also increasing mergeFactor, or decreasing maxMergeMB/Docs, but these will impact