Re: [jira] Commented: (LUCENE-834) Payload Queries

2007-04-18 Thread Grant Ingersoll
OK, I need to take a step back, Michael, b/c I thought I understood your original comment, but I went to make the change and am no longer sure. By "first term position" are you referring to multiple terms per position or do you mean the same term in different positions? When I implemen

Re: [jira] Resolved: (LUCENE-730) Restore top level disjunction performance

2007-04-18 Thread Paul Elschot
On Wednesday 18 April 2007 00:05, Otis Gospodnetic (JIRA) wrote: > > [ https://issues.apache.org/jira/browse/LUCENE-730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] > > Otis Gospodnetic resolved LUCENE-730. > - > >Resolut

Re: optimize() method call

2007-04-18 Thread Grant Ingersoll
Has anyone done in benchmarking to approximate how long it takes to optimize different size indexes? Is the merging linear, sub-linear, etc.? On Apr 8, 2007, at 1:01 AM, Otis Gospodnetic wrote: I'd advise against calling optimize() at all in an environment whose indices are constantly upd

[jira] Updated: (LUCENE-848) Add supported for Wikipedia English as a corpus in the benchmarker stuff

2007-04-18 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Parkes updated LUCENE-848: - Attachment: LUCENE-848.txt Upgrade to Xerces 2. Xerces 1 passes the sanity check, but fails for w

[jira] Updated: (LUCENE-848) Add supported for Wikipedia English as a corpus in the benchmarker stuff

2007-04-18 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Parkes updated LUCENE-848: - Attachment: xerces.jar > Add supported for Wikipedia English as a corpus in the benchmarker stuff

[jira] Updated: (LUCENE-848) Add supported for Wikipedia English as a corpus in the benchmarker stuff

2007-04-18 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Parkes updated LUCENE-848: - Attachment: xml-apis.jar Now I see the button for attach multiple files. Oh, well. Anyway, both

Re: optimize() method call

2007-04-18 Thread Sean Timm
In the brief test I did indexing 500K documents and optimizing every 10K documents, I found that indexing is constant time (flat) and optimize() time increases linearly. -Sean Grant Ingersoll wrote on 4/18/2007, 4:29 PM: > Has anyone done in benchmarking to approximate how long it takes to >

RE: optimize() method call

2007-04-18 Thread Steven Parkes
I think can be greater than linear. It would be linear if optimize only copied each segment into the result. However, it will only merge maxMerge segments at a time, so in some cases, some segment data is going to be copied more than once. So something like O(n log n)? -Original Message- F

[jira] Commented: (LUCENE-736) Sloppy Phrase Scoring Misbehavior

2007-04-18 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12489930 ] Doron Cohen commented on LUCENE-736: Need to see if the parts of the test (in QueryUtils) that were disabled by