[jira] Commented: (LUCENE-1260) Norm codec strategy in Similarity

2008-04-09 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12587446#action_12587446 ] Karl Wettin commented on LUCENE-1260: - I notice there is a tyop in the patch. And ther

[jira] Commented: (LUCENE-1260) Norm codec strategy in Similarity

2008-04-09 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12587445#action_12587445 ] Karl Wettin commented on LUCENE-1260: - {quote} 1) "norms" is a vague term. currently "

[jira] Updated: (LUCENE-1262) IndexOutOfBoundsException from FieldsReader after problem reading the index

2008-04-09 Thread Trejkaz (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Trejkaz updated LUCENE-1262: Attachment: Test.java Attaching a test program to reproduce the problem under 2.3.1. It occurs approximat

[jira] Updated: (LUCENE-1262) IndexOutOfBoundsException from FieldsReader after problem reading the index

2008-04-09 Thread Trejkaz (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Trejkaz updated LUCENE-1262: Affects Version/s: (was: 2.1) 2.3.1 Summary: IndexOutOfBoundsExcep

[jira] Commented: (LUCENE-1260) Norm codec strategy in Similarity

2008-04-09 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12587435#action_12587435 ] Hoss Man commented on LUCENE-1260: -- bq. My use case is really about document boost and no

[jira] Updated: (LUCENE-1262) NullPointerException from FieldsReader after problem reading the index

2008-04-09 Thread Trejkaz (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Trejkaz updated LUCENE-1262: Affects Version/s: (was: 2.2) 2.1 Okay I'll eat my words now, it is indeed 2.1

[jira] Updated: (LUCENE-1262) NullPointerException from FieldsReader after problem reading the index

2008-04-09 Thread Trejkaz (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Trejkaz updated LUCENE-1262: Affects Version/s: (was: 2.3.1) 2.2 Whoops. I don't think it's 2.1 but it must

Re: Flexible indexing design

2008-04-09 Thread Marvin Humphrey
On Apr 9, 2008, at 6:35 AM, Michael Busch wrote: We also need to come up with a good solution for the dictionary, because a term with frq/prx postings needs to store two (or three for skiplist) file pointers in the dictionary, whereas e. g. a "binary" posting list only needs one pointer.

[jira] Commented: (LUCENE-1260) Norm codec strategy in Similarity

2008-04-09 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12587290#action_12587290 ] Karl Wettin commented on LUCENE-1260: - {quote} As long as the norm remains a fixed siz

index reopen question

2008-04-09 Thread John Wang
Hi: Have been reading the 2.3.1 release code and have a few questions regarding indexReader reopen: 1) looking at the code: if (this.hasChanges || this.isCurrent()) { // the index hasn't changed - nothing to do here return this; } Shouldn't it be !this.hasChanges?

Re: [jira] Created: (LUCENE-1257) Port to Java5

2008-04-09 Thread Yonik Seeley
Just for kicks, I tried it on a 64 bit Athlon, linux_x86_64, jvm=64 bit Sun 1.6 -server. The explicit loop counter was 50% faster (for N=10... the inner loop) -Yonik On Tue, Apr 8, 2008 at 8:21 PM, Yonik Seeley <[EMAIL PROTECTED]> wrote: > On Tue, Apr 8, 2008 at 7:48 PM, robert engels <[EMAIL PRO

Re: [jira] Created: (LUCENE-1257) Port to Java5

2008-04-09 Thread robert engels
I think it is going to be highly JVM dependent. I reworked it to call each twice (and reordered the tests)... the foreach is still faster. Ialso ran it on Windows (under Parallels) and got similar results, but in some cases the indexed was faster. "server" times are tough to judge because n

RE: [jira] Created: (LUCENE-1257) Port to Java5

2008-04-09 Thread melix
Hi, I confirm your results. I didn't think there could be a difference using foreach constructs... Cedric Steven A Rowe wrote: > > On 04/04/2008 at 4:40 AM, Toke Eskildsen wrote: >> On Wed, 2008-04-02 at 09:30 -0400, Mark Miller wrote: >> > > - replacement of indexed for loops with for each c

Flexible indexing design (was Re: Pooling of posting objects in DocumentsWriter)

2008-04-09 Thread Michael Busch
Thanks for your quick answers. Michael McCandless wrote: Hi Michael, I've actually been working on factoring DocumentsWriter, as a first step towards flexible indexing. Cool, yeah separating the DocumentsWriter into multiple classes certainly helped understanding the complex code better.

Re: Storing phrases in index

2008-04-09 Thread Mathieu Lecarme
palexv a écrit : Hello all. I have a question to advanced in lucene. I have a set of phrases which I need to store in index. Is there is a way of storing phrases as terms in index? How is the best way of writing such index? Should this field be tokenized? not tokenized What is the best wa

Storing phrases in index

2008-04-09 Thread palexv
Hello all. I have a question to advanced in lucene. I have a set of phrases which I need to store in index. Is there is a way of storing phrases as terms in index? How is the best way of writing such index? Should this field be tokenized? What is the best way of searching phrases by mask in suc

RE: [jira] Created: (LUCENE-1257) Port to Java5

2008-04-09 Thread Steven A Rowe
Hi Toke, On 04/09/2008 at 2:43 AM, Toke Eskildsen wrote: > On Tue, 2008-04-08 at 18:48 -0500, robert engels wrote: > > That is opposite of my testing:... > > > > The 'foreach' is consistently faster. The time difference is > > independent of the size of the array. What I know about JVM > > implem

Re: StandardTokenizerConstants in 2.3

2008-04-09 Thread Antony Bowesman
Thanks Mike/Hoss for the clarification. Antony Michael McCandless wrote: Chris Hostetter wrote: : > But, StandardTokenizer is public? It "exports" those constants for you? : : Really? Sorry, but I can't find them - in 2.3.1 sources, there are no : references to those statics. Javadocs h

[jira] Commented: (LUCENE-1262) NullPointerException from FieldsReader after problem reading the index

2008-04-09 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12587117#action_12587117 ] Michael McCandless commented on LUCENE-1262: Those stack traces look like 2.1

[jira] Updated: (LUCENE-1150) The token types of the standard tokenizer is not accessible

2008-04-09 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1150: --- Fix Version/s: 2.3.2 Backported fix to 2.3.2. > The token types of the standard tok

Re: StandardTokenizerConstants in 2.3

2008-04-09 Thread Michael McCandless
Chris Hostetter wrote: : > But, StandardTokenizer is public? It "exports" those constants for you? : : Really? Sorry, but I can't find them - in 2.3.1 sources, there are no : references to those statics. Javadocs have no reference to them in : StandardTokenizer I think Michael is forg

Re: Optimise Indexing time using lucene..

2008-04-09 Thread Mathieu Lecarme
lucene4varma a écrit : Hi all, I am new to lucene and am using it for text search in my web application, and for that i need to index records in database. We are using jdbc directory to store the indexes. Now the problem is when is start the process of indexing the records for the first time it