Re: Controlling the maximum size of a segment during indexing

2010-04-09 Thread Lance Norskog
I should mention - I tried it with: config.setRAMBufferSizeMB(1.0); and should have posted that version. It still comes up with one 5mb CFS segment file. On Fri, Apr 9, 2010 at 2:55 PM, Lance Norskog wrote: > If the IndexWriterConfig.ram buffer size and the mergeMB size on the > policy

Re: Controlling the maximum size of a segment during indexing

2010-04-09 Thread Lance Norskog
equation where max-segment-size = function(four or five variables). I'm hunting for that function. This function should exist for file size because the variables appear to be in bytes. (For doc size it is difficult because of the buffer size part.) Thanks for your time, Lance Norskog On Fri,

Re: TestCodecs running time

2010-04-09 Thread Lance Norskog
seems to be that its threads make (each) >> 4000 iterations ... is that really required to ensure correctness? >> >> Shai >> > > - > To unsubscribe, e-mail: java-dev-unsubscr...@luce

Controlling the maximum size of a segment during indexing

2010-04-08 Thread Lance Norskog
Here is a Java unit test that uses the LogByteSizeMergePolicy to control the maximum size of segment files during indexing. That is, it tries. It does not succeed. Will someone who truly understands the merge policy code please examine it. There is probably one tiny parameter missing. It adds 20 d

[jira] Commented: (LUCENE-1761) low level Field metadata is never removed from index

2010-02-16 Thread Lance Norskog (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12834567#action_12834567 ] Lance Norskog commented on LUCENE-1761: --- Does this cause any performance

[jira] Commented: (LUCENE-517) norm compression breaks ranking for small fields

2009-11-20 Thread Lance Norskog (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12780901#action_12780901 ] Lance Norskog commented on LUCENE-517: -- [LUCENE-1360|http://issues.apache.org/

[jira] Issue Comment Edited: (LUCENE-1360) A Similarity class which has unique length norms for numTerms <= 10

2009-11-20 Thread Lance Norskog (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12780897#action_12780897 ] Lance Norskog edited comment on LUCENE-1360 at 11/21/09 3:0

[jira] Issue Comment Edited: (LUCENE-1360) A Similarity class which has unique length norms for numTerms <= 10

2009-11-20 Thread Lance Norskog (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12780897#action_12780897 ] Lance Norskog edited comment on LUCENE-1360 at 11/21/09 2:5

[jira] Updated: (LUCENE-1360) A Similarity class which has unique length norms for numTerms <= 10

2009-11-20 Thread Lance Norskog (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lance Norskog updated LUCENE-1360: -- Attachment: LUCENE-1380 visualization.pdf This is a graph of the standard norms against the

[jira] Commented: (LUCENE-1360) A Similarity class which has unique length norms for numTerms <= 10

2009-09-23 Thread Lance Norskog (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12758966#action_12758966 ] Lance Norskog commented on LUCENE-1360: --- Is this code still interesting? Tha

[jira] Commented: (LUCENE-1343) A replacement for ISOLatin1AccentFilter that does a more thorough job of removing diacritical marks or non-spacing modifiers.

2008-08-13 Thread Lance Norskog (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12622345#action_12622345 ] Lance Norskog commented on LUCENE-1343: --- Some languages like Cyrillic ha

[jira] Commented: (LUCENE-997) Add search timeout support to Lucene

2007-10-26 Thread Lance Norskog (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12538144 ] Lance Norskog commented on LUCENE-997: -- I stumbled above; I do not yet know Jira :) The Solr code is SOLR-392

[jira] Commented: (LUCENE-997) Add search timeout support to Lucene

2007-10-26 Thread Lance Norskog (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12538142 ] Lance Norskog commented on LUCENE-997: -- I just requested a more fancy feature in the Solr Jira. My apologies