[jira] Commented: (SOLR-1731) ArrayIndexOutOfBoundsException when highlighting

2010-07-21 Thread Leonhard Maylein (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891018#action_12891018 ] Leonhard Maylein commented on SOLR-1731: We have the same problem whenever we search

Build failed in Hudson: Lucene-trunk #1245

2010-07-21 Thread Apache Hudson Server
See Changes: [yonik] LUCENE-2542: remove final from some TopDocsCollector methods -- [...truncated 2706 lines...] [junit] Testsuite: org.apache.lucene.search.TestPrefixQuery [jun

[jira] Updated: (LUCENE-2346) Explore other in-memory postinglist formats for realtime search

2010-07-21 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-2346: -- Fix Version/s: Realtime Branch (was: 4.0) > Explore other in-memory pos

[jira] Updated: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-21 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-2324: -- Fix Version/s: Realtime Branch (was: 4.0) > Per thread DocumentsWriters

[jira] Updated: (LUCENE-2312) Search on IndexWriter's RAM Buffer

2010-07-21 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-2312: -- Fix Version/s: Realtime Branch (was: 4.0) > Search on IndexWriter's RAM

Hudson build is back to normal : Lucene-3.x #72

2010-07-21 Thread Apache Hudson Server
See - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] Commented: (LUCENE-2346) Explore other in-memory postinglist formats for realtime search

2010-07-21 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890915#action_12890915 ] Jason Rutherglen commented on LUCENE-2346: -- Are there any additional thoughts on

Re: Sequence IDs for NRT deletes

2010-07-21 Thread Jason Rutherglen
> long[] is probably safe Yeah it's safe for most things... > short[] That could be a much better option for minimizing RAM usage, and then implement wraparound. On Wed, Jul 21, 2010 at 3:12 AM, Michael McCandless wrote: > On Tue, Jul 20, 2010 at 4:21 PM, Jason Rutherglen > wrote: >>> Right,

[jira] Commented: (LUCENE-2312) Search on IndexWriter's RAM Buffer

2010-07-21 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890909#action_12890909 ] Jason Rutherglen commented on LUCENE-2312: -- We need to fill in the blanks on the

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-21 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890905#action_12890905 ] Jason Rutherglen commented on LUCENE-2324: -- {quote}Implement logic to discard del

Re: API changes between 2.9.2 and 2.9.3

2010-07-21 Thread Bill Janssen
Andi Vajda wrote: > > On Jul 21, 2010, at 19:59, Bill Janssen wrote: > > > Bill Janssen wrote: > > > >> What's crashing with PyLucene 2.9.3 is this code: > >> > >> for field in x.getFields(): > >> > >> where "x" is an instance of org.apache.lucene.document.Document. I > >> can > >> print

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-21 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890893#action_12890893 ] Jason Rutherglen commented on LUCENE-2324: -- Looks like we're not using MergeDocID

Hudson build is back to normal : Solr-trunk #1208

2010-07-21 Thread Apache Hudson Server
See - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] Commented: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-07-21 Thread Stanislaw Osinski (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890848#action_12890848 ] Stanislaw Osinski commented on SOLR-1804: - {quote} Essentially, the dependency on th

Hudson build is back to normal : Solr-3.x #66

2010-07-21 Thread Apache Hudson Server
See - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] Resolved: (LUCENE-2542) TopDocsCollector should be abstract super class that is the real "TopDocsCollector" contract, a subclass should implement the priority-queue logic. e.g. PQTopDocsCollect

2010-07-21 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yonik Seeley resolved LUCENE-2542. -- Resolution: Fixed committed. > TopDocsCollector should be abstract super class that is the re

Re: API changes between 2.9.2 and 2.9.3

2010-07-21 Thread Bill Janssen
Bill Janssen wrote: > Bill Janssen wrote: > > > What's crashing with PyLucene 2.9.3 is this code: > > > > for field in x.getFields(): > > > > where "x" is an instance of org.apache.lucene.document.Document. I can > > print x and it looks OK, but an attempt to iterate over the list of >

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-21 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890812#action_12890812 ] Michael Busch commented on LUCENE-2324: --- {quote} We need to update the indexing chai

Build failed in Hudson: Lucene-trunk #1244

2010-07-21 Thread Apache Hudson Server
See Changes: [rmuir] LUCENE-2514: consume tokenstreams in QP like the indexer: dont create intermediate string [uschindler] revert accidental commit by buschmi [buschmi] LUCENE-2324: Committing second version of the patch to

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-21 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890790#action_12890790 ] Jason Rutherglen commented on LUCENE-2324: -- We need to update the indexing chain

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-21 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890789#action_12890789 ] Jason Rutherglen commented on LUCENE-2324: -- Michael, thanks for posting and commi

Re: API changes between 2.9.2 and 2.9.3

2010-07-21 Thread Bill Janssen
> I'm going back to 2.9.2 :-). For some reason, 2.9.2 installs JCC 2.4.1. Is that right? Shouldn't it be 2.5.1? Bill holmes : /tmp/pylucene-2.9.2-1/jcc 99 % sudo python setup.py install sudo python setup.py install running install running bdist_egg running egg_info writing JCC.egg-info/PKG-INF

[jira] Commented: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-07-21 Thread Stanislaw Osinski (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890757#action_12890757 ] Stanislaw Osinski commented on SOLR-1804: - {quote} Hi Stanislaw: this looks cool! So

[jira] Commented: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-07-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890748#action_12890748 ] Robert Muir commented on SOLR-1804: --- Hi Stanislaw: this looks cool! So, carrot2 jars don't

Re: API changes between 2.9.2 and 2.9.3

2010-07-21 Thread Bill Janssen
Thomas Koch wrote: > > ... > > I realize that PyLucene doesn't make that easy because it doesn't warn > > about deprecated API use. > > > [Thomas Koch] Well this is a general drawback in Python as interpreted > language I guess - wrong interfaces are only detected at runtime and are > thus harde

[jira] Commented: (LUCENE-2514) Change Term to use bytes

2010-07-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890740#action_12890740 ] Robert Muir commented on LUCENE-2514: - Committed LUCENE-2514_qp.patch revision 966254

[jira] Commented: (SOLR-1240) Numerical Range faceting

2010-07-21 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890728#action_12890728 ] Yonik Seeley commented on SOLR-1240: Thanks for the example, makes it so much easier to

[jira] Updated: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-07-21 Thread Stanislaw Osinski (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stanislaw Osinski updated SOLR-1804: Attachment: SOLR-1804-carrot2-3.4.0-dev-libs.zip Libs required for the Carrot2 3.4.0 update.

[jira] Updated: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-07-21 Thread Stanislaw Osinski (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stanislaw Osinski updated SOLR-1804: Attachment: SOLR-1804-carrot2-3.4.0-dev.patch Hi, As we're near the 3.4.0 release of Carrot

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890718#action_12890718 ] Robert Muir commented on LUCENE-1799: - bq. But... ICU's license is compatible w/ ASL (

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-21 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890709#action_12890709 ] Michael McCandless commented on LUCENE-1799: {quote} > Is there any reason no

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890706#action_12890706 ] Robert Muir commented on LUCENE-1799: - bq. Is there any reason not to make BOCU-1 Luce

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-21 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890703#action_12890703 ] Michael McCandless commented on LUCENE-1799: Is there any reason not to make B

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890654#action_12890654 ] Robert Muir commented on LUCENE-1799: - bq. You can use any Charset to encode your term

[jira] Commented: (LUCENE-2514) Change Term to use bytes

2010-07-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890646#action_12890646 ] Robert Muir commented on LUCENE-2514: - bq. This would also mean the BOCU-1 encoding co

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-21 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890643#action_12890643 ] Michael Busch commented on LUCENE-2324: --- OK, I committed to the branch. I'll try to

[jira] Updated: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-21 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-2324: -- Attachment: lucene-2324.patch Finally a new version of the patch! (Sorry for keeping you guys

Re: Sequence IDs for NRT deletes

2010-07-21 Thread Michael McCandless
On Tue, Jul 20, 2010 at 4:21 PM, Jason Rutherglen wrote: >> Right, much less GC if app frequently reopens.  But a 32X increase in >> RAM usage is not trivial; I think we shouldn't enable it by default? > > Right, the RAM usage is quite high!  Is there a more compact > representation we could use?

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-21 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890628#action_12890628 ] Michael McCandless commented on LUCENE-1799: This is fabulous! And a great ex

[jira] Commented: (LUCENE-2514) Change Term to use bytes

2010-07-21 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890630#action_12890630 ] Michael McCandless commented on LUCENE-2514: +1 to commit This would also mea

Re: is there any resource for improve lucene index/search performance

2010-07-21 Thread Michael McCandless
Lucene's dev list and the issue tracking system is the place for ideas on improving indexing/search performance. We are always looking to improve performance. Switching to int mult, using bitmaps, both sound interesting :) Mike On Tue, Jul 20, 2010 at 10:59 PM, Li Li wrote: > Or where to find

[jira] Updated: (LUCENE-1799) Unicode compression

2010-07-21 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1799: -- Attachment: LUCENE-1799.patch A new patch that completely separates the BOCU factory from the

[jira] Updated: (LUCENE-1799) Unicode compression

2010-07-21 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1799: -- Attachment: LUCENE-1799.patch Here is a 100% legally valid implementation: - Linking to icu4j