[jira] Assigned: (LUCENE-2215) paging collector

2010-01-19 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll reassigned LUCENE-2215: --- Assignee: Grant Ingersoll > paging collector > > >

[jira] Created: (LUCENE-2227) separate chararrayset interface from impl

2010-01-19 Thread Robert Muir (JIRA)
separate chararrayset interface from impl - Key: LUCENE-2227 URL: https://issues.apache.org/jira/browse/LUCENE-2227 Project: Lucene - Java Issue Type: Task Components: Analysis Affects Ve

[jira] Commented: (LUCENE-1410) PFOR implementation

2010-01-19 Thread Renaud Delbru (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12802235#action_12802235 ] Renaud Delbru commented on LUCENE-1410: --- On another aspect, why is the PFOR/FOR is e

[jira] Issue Comment Edited: (LUCENE-1410) PFOR implementation

2010-01-19 Thread Renaud Delbru (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12802235#action_12802235 ] Renaud Delbru edited comment on LUCENE-1410 at 1/19/10 1:10 PM:

[jira] Updated: (LUCENE-2111) Wrapup flexible indexing

2010-01-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-2111: --- Attachment: LUCENE-2111.patch Attached patch w/ various fixes: - Switch over payl

[jira] Commented: (LUCENE-2215) paging collector

2010-01-19 Thread Adam Heinz (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12802276#action_12802276 ] Adam Heinz commented on LUCENE-2215: Awesome, thanks! I'll schedule some time in the

[jira] Updated: (LUCENE-2213) Small improvements to ArrayUtil.getNextSize

2010-01-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-2213: --- Attachment: LUCENE-2213.patch New patch, just renaming to ArrayUtil.oversize. > Sma

[jira] Commented: (LUCENE-1410) PFOR implementation

2010-01-19 Thread Paul Elschot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12802335#action_12802335 ] Paul Elschot commented on LUCENE-1410: -- The only reason why the number of compressed

Re: Lucene memory consumption

2010-01-19 Thread Sanne Grinovero
Hello Frederic, I'm CCing java-dev@lucene.apache.org as Michael McCandless has been very helpful on IRC in discussing the ThreadLocal implication, and it would be nice you could provide first-hand information. There's a good reading to start from at http://issues.apache.org/jira/browse/LUCENE-1383

[jira] Commented: (LUCENE-2217) SortedVIntList allocation should use ArrayUtils.getNextSize()

2010-01-19 Thread Paul Elschot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12802449#action_12802449 ] Paul Elschot commented on LUCENE-2217: -- Btw. shouldn't IndexInput.bytes also be reall

[jira] Commented: (LUCENE-2217) SortedVIntList allocation should use ArrayUtils.getNextSize()

2010-01-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12802456#action_12802456 ] Michael McCandless commented on LUCENE-2217: bq. Btw. shouldn't IndexInput.byt

[jira] Commented: (LUCENE-2217) SortedVIntList allocation should use ArrayUtils.getNextSize()

2010-01-19 Thread Paul Elschot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12802510#action_12802510 ] Paul Elschot commented on LUCENE-2217: -- Well, it's not that I'm searching, but I'll p

[jira] Commented: (LUCENE-2205) Rework of the TermInfosReader class to remove the Terms[], TermInfos[], and the index pointer long[] and create a more memory efficient data structure.

2010-01-19 Thread Deepak (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12802535#action_12802535 ] Deepak commented on LUCENE-2205: Hi Aaron I hope you will be able to post the files today

[jira] Commented: (LUCENE-1990) Add unsigned packed int impls in oal.util

2010-01-19 Thread Toke Eskildsen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12802542#action_12802542 ] Toke Eskildsen commented on LUCENE-1990: Introducing yet another level of indirect

[jira] Commented: (LUCENE-1488) multilingual analyzer based on icu

2010-01-19 Thread Vilaythong Southavilay (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12802568#action_12802568 ] Vilaythong Southavilay commented on LUCENE-1488: I am developing an IR sys

NRT and IndexSearcher performance

2010-01-19 Thread jchang
The javadocs for IndexSearcher in Lucene 3.0.0 read: "For performance reasons it is recommended to open only one IndexSearcher and use it for all of your searches." However, to use NRT, it seems I have to do this for every search, which contradicts the advice above: IndexSearcher myIndexSear

[jira] Commented: (LUCENE-1488) multilingual analyzer based on icu

2010-01-19 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12802596#action_12802596 ] Robert Muir commented on LUCENE-1488: - Thanks for sharing those results! Yes the bigra

[jira] Updated: (LUCENE-2205) Rework of the TermInfosReader class to remove the Terms[], TermInfos[], and the index pointer long[] and create a more memory efficient data structure.

2010-01-19 Thread Aaron McCurry (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aaron McCurry updated LUCENE-2205: -- Attachment: TermInfosReaderIndexDefault.java TermInfosReaderIndex.java

[jira] Updated: (LUCENE-2205) Rework of the TermInfosReader class to remove the Terms[], TermInfos[], and the index pointer long[] and create a more memory efficient data structure.

2010-01-19 Thread Aaron McCurry (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aaron McCurry updated LUCENE-2205: -- Attachment: TermInfosReaderIndexSmall.java Here's the last file. I have also back patched 3.0

[jira] Issue Comment Edited: (LUCENE-2205) Rework of the TermInfosReader class to remove the Terms[], TermInfos[], and the index pointer long[] and create a more memory efficient data structure.

2010-01-19 Thread Aaron McCurry (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12802632#action_12802632 ] Aaron McCurry edited comment on LUCENE-2205 at 1/20/10 2:57 AM:

Re: NRT and IndexSearcher performance

2010-01-19 Thread Jason Rutherglen
J, The javadocs are illustrating there's no need to create new IndexSearchers for each query. Jason On Tue, Jan 19, 2010 at 5:04 PM, jchang wrote: > > The javadocs for IndexSearcher in Lucene 3.0.0 read:  "For performance > reasons it is recommended to open only one IndexSearcher and use it for

Re: NRT and IndexSearcher performance

2010-01-19 Thread John Wang
I think the question here really is the cost of creating new IndexReader instances per query. Calling IndexWriter.getReader() for each query has shown to be expensive from our benchmark and previous discussions. -John On Tue, Jan 19, 2010 at 8:12 PM, Jason Rutherglen < jason.rutherg...@gmail.com