[jira] Created: (LUCENENET-372) NLS pack for Lucene.NET: BR, CJK, CN, CZ, DE, FR, NL, RU analyzers

2010-07-21 Thread Pasha Bizhan (JIRA)
NLS pack for Lucene.NET: BR, CJK, CN, CZ, DE, FR, NL, RU analyzers -- Key: LUCENENET-372 URL: https://issues.apache.org/jira/browse/LUCENENET-372 Project: Lucene.Net Issue

RE: API changes between 2.9.2 and 2.9.3

2010-07-21 Thread Thomas Koch
... I realize that PyLucene doesn't make that easy because it doesn't warn about deprecated API use. [Thomas Koch] Well this is a general drawback in Python as interpreted language I guess - wrong interfaces are only detected at runtime and are thus harder to test (unless you describe the

RE: API changes between 2.9.2 and 2.9.3

2010-07-21 Thread Andi Vajda
On Wed, 21 Jul 2010, Thomas Koch wrote: But I understand now that as long as you remove deprecated code from 2.9 it *should* work with 2.9 and 3.0 as well! Right? Correct. e.g. methodHits search(Query query) Is now deprecated as Hits will be removed in Lucene 3.0 2.9 already supports

Re: API changes between 2.9.2 and 2.9.3

2010-07-21 Thread Aric Coady
On Jul 21, 2010, at 12:18 AM, Thomas Koch wrote: The question remains if it's feasible to support 2.x *and* 3.x - as Bill mentioned ... I'd like to make it work on both. - me too. I did fear that this makes things much more complicated and you end up with code if

Re: API changes between 2.9.2 and 2.9.3

2010-07-21 Thread Bill Janssen
What's crashing with PyLucene 2.9.3 is this code: for field in x.getFields(): where x is an instance of org.apache.lucene.document.Document. I can print x and it looks OK, but an attempt to iterate over the list of fields seems broken. Is this another iterator change? Bill Thread 14

Re: API changes between 2.9.2 and 2.9.3

2010-07-21 Thread Andi Vajda
On Jul 21, 2010, at 19:59, Bill Janssen jans...@parc.com wrote: Bill Janssen jans...@parc.com wrote: What's crashing with PyLucene 2.9.3 is this code: for field in x.getFields(): where x is an instance of org.apache.lucene.document.Document. I can print x and it looks OK, but an

Re: API changes between 2.9.2 and 2.9.3

2010-07-21 Thread Andi Vajda
On Jul 21, 2010, at 20:38, Bill Janssen jans...@parc.com wrote: Bill Janssen jans...@parc.com wrote: Bill Janssen jans...@parc.com wrote: What's crashing with PyLucene 2.9.3 is this code: for field in x.getFields(): where x is an instance of org.apache.lucene.document.Document. I

Re: API changes between 2.9.2 and 2.9.3

2010-07-21 Thread Christian Heimes
Presumably that's no longer the case with JCC 2.6. Probably should be updated to whatever the current version does. Or perhaps versioned and checked into the source tree. It could be related to the --no-generics addition. Have you tried to recompile PyLucene with the option JCCFLAGS=?

Re: API changes between 2.9.2 and 2.9.3

2010-07-21 Thread Andi Vajda
On Jul 21, 2010, at 23:10, Bill Janssen jans...@parc.com wrote: Andi Vajda va...@apache.org wrote: On Jul 21, 2010, at 19:59, Bill Janssen jans...@parc.com wrote: Bill Janssen jans...@parc.com wrote: What's crashing with PyLucene 2.9.3 is this code: for field in x.getFields():

Re: API changes between 2.9.2 and 2.9.3

2010-07-21 Thread Bill Janssen
Andi Vajda va...@apache.org wrote: Porting your stuff to 3.0 is thus highly recommended instead of complaining about broken (my bad) long- deprecated APIs. Hey, take 2.9.3 down, and announce no further pylucene support for 2.x, and I'll stop talking about it. Bill

[jira] Updated: (LUCENE-2542) TopDocsCollector should be abstract super class that is the real TopDocsCollector contract, a subclass should implement the priority-queue logic. e.g. PQTopDocsCollecto

2010-07-21 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera updated LUCENE-2542: --- Fix Version/s: 3.1 Affects Version/s: (was: 3.0.2) Lucene Fields: [New, Patch

[jira] Commented: (LUCENE-2542) TopDocsCollector should be abstract super class that is the real TopDocsCollector contract, a subclass should implement the priority-queue logic. e.g. PQTopDocsCollec

2010-07-21 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890575#action_12890575 ] Shai Erera commented on LUCENE-2542: Patch looks good. I think it's ready to commit,

[jira] Commented: (LUCENE-2514) Change Term to use bytes

2010-07-21 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890630#action_12890630 ] Michael McCandless commented on LUCENE-2514: +1 to commit This would also

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-21 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890628#action_12890628 ] Michael McCandless commented on LUCENE-1799: This is fabulous! And a great

Re: Sequence IDs for NRT deletes

2010-07-21 Thread Michael McCandless
On Tue, Jul 20, 2010 at 4:21 PM, Jason Rutherglen jason.rutherg...@gmail.com wrote: Right, much less GC if app frequently reopens.  But a 32X increase in RAM usage is not trivial; I think we shouldn't enable it by default? Right, the RAM usage is quite high!  Is there a more compact

[jira] Updated: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-21 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-2324: -- Attachment: lucene-2324.patch Finally a new version of the patch! (Sorry for keeping you guys

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-21 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890643#action_12890643 ] Michael Busch commented on LUCENE-2324: --- OK, I committed to the branch. I'll try

[jira] Commented: (LUCENE-2514) Change Term to use bytes

2010-07-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890646#action_12890646 ] Robert Muir commented on LUCENE-2514: - bq. This would also mean the BOCU-1 encoding

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890654#action_12890654 ] Robert Muir commented on LUCENE-1799: - bq. You can use any Charset to encode your

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-21 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890703#action_12890703 ] Michael McCandless commented on LUCENE-1799: Is there any reason not to make

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890706#action_12890706 ] Robert Muir commented on LUCENE-1799: - bq. Is there any reason not to make BOCU-1

[jira] Commented: (LUCENE-1799) Unicode compression

2010-07-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890718#action_12890718 ] Robert Muir commented on LUCENE-1799: - bq. But... ICU's license is compatible w/ ASL

[jira] Updated: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-07-21 Thread Stanislaw Osinski (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stanislaw Osinski updated SOLR-1804: Attachment: SOLR-1804-carrot2-3.4.0-dev.patch Hi, As we're near the 3.4.0 release of

[jira] Updated: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-07-21 Thread Stanislaw Osinski (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stanislaw Osinski updated SOLR-1804: Attachment: SOLR-1804-carrot2-3.4.0-dev-libs.zip Libs required for the Carrot2 3.4.0

[jira] Commented: (SOLR-1240) Numerical Range faceting

2010-07-21 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890728#action_12890728 ] Yonik Seeley commented on SOLR-1240: Thanks for the example, makes it so much easier to

[jira] Commented: (LUCENE-2514) Change Term to use bytes

2010-07-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890740#action_12890740 ] Robert Muir commented on LUCENE-2514: - Committed LUCENE-2514_qp.patch revision 966254

Re: API changes between 2.9.2 and 2.9.3

2010-07-21 Thread Bill Janssen
Thomas Koch k...@orbiteam.de wrote: ... I realize that PyLucene doesn't make that easy because it doesn't warn about deprecated API use. [Thomas Koch] Well this is a general drawback in Python as interpreted language I guess - wrong interfaces are only detected at runtime and are thus

[jira] Commented: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-07-21 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890748#action_12890748 ] Robert Muir commented on SOLR-1804: --- Hi Stanislaw: this looks cool! So, carrot2 jars don't

[jira] Commented: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-07-21 Thread Stanislaw Osinski (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890757#action_12890757 ] Stanislaw Osinski commented on SOLR-1804: - {quote} Hi Stanislaw: this looks cool!

Re: API changes between 2.9.2 and 2.9.3

2010-07-21 Thread Bill Janssen
I'm going back to 2.9.2 :-). For some reason, 2.9.2 installs JCC 2.4.1. Is that right? Shouldn't it be 2.5.1? Bill holmes : /tmp/pylucene-2.9.2-1/jcc 99 % sudo python setup.py install sudo python setup.py install running install running bdist_egg running egg_info writing

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-21 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890789#action_12890789 ] Jason Rutherglen commented on LUCENE-2324: -- Michael, thanks for posting and

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-21 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890790#action_12890790 ] Jason Rutherglen commented on LUCENE-2324: -- We need to update the indexing chain

Build failed in Hudson: Lucene-trunk #1244

2010-07-21 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Lucene-trunk/1244/changes Changes: [rmuir] LUCENE-2514: consume tokenstreams in QP like the indexer: dont create intermediate string [uschindler] revert accidental commit by buschmi [buschmi] LUCENE-2324: Committing second version of the patch to

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-21 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890812#action_12890812 ] Michael Busch commented on LUCENE-2324: --- {quote} We need to update the indexing

Re: API changes between 2.9.2 and 2.9.3

2010-07-21 Thread Bill Janssen
Bill Janssen jans...@parc.com wrote: Bill Janssen jans...@parc.com wrote: What's crashing with PyLucene 2.9.3 is this code: for field in x.getFields(): where x is an instance of org.apache.lucene.document.Document. I can print x and it looks OK, but an attempt to iterate

[jira] Resolved: (LUCENE-2542) TopDocsCollector should be abstract super class that is the real TopDocsCollector contract, a subclass should implement the priority-queue logic. e.g. PQTopDocsCollect

2010-07-21 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yonik Seeley resolved LUCENE-2542. -- Resolution: Fixed committed. TopDocsCollector should be abstract super class that is the

Hudson build is back to normal : Solr-3.x #66

2010-07-21 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Solr-3.x/66/changes - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] Commented: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-07-21 Thread Stanislaw Osinski (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890848#action_12890848 ] Stanislaw Osinski commented on SOLR-1804: - {quote} Essentially, the dependency on

Hudson build is back to normal : Solr-trunk #1208

2010-07-21 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Solr-trunk/1208/changes - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-21 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890893#action_12890893 ] Jason Rutherglen commented on LUCENE-2324: -- Looks like we're not using

Re: API changes between 2.9.2 and 2.9.3

2010-07-21 Thread Bill Janssen
Andi Vajda va...@apache.org wrote: On Jul 21, 2010, at 19:59, Bill Janssen jans...@parc.com wrote: Bill Janssen jans...@parc.com wrote: What's crashing with PyLucene 2.9.3 is this code: for field in x.getFields(): where x is an instance of

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-21 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890905#action_12890905 ] Jason Rutherglen commented on LUCENE-2324: -- {quote}Implement logic to discard

[jira] Commented: (LUCENE-2312) Search on IndexWriter's RAM Buffer

2010-07-21 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890909#action_12890909 ] Jason Rutherglen commented on LUCENE-2312: -- We need to fill in the blanks on the

Re: Sequence IDs for NRT deletes

2010-07-21 Thread Jason Rutherglen
long[] is probably safe Yeah it's safe for most things... short[] That could be a much better option for minimizing RAM usage, and then implement wraparound. On Wed, Jul 21, 2010 at 3:12 AM, Michael McCandless luc...@mikemccandless.com wrote: On Tue, Jul 20, 2010 at 4:21 PM, Jason

[jira] Commented: (LUCENE-2346) Explore other in-memory postinglist formats for realtime search

2010-07-21 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12890915#action_12890915 ] Jason Rutherglen commented on LUCENE-2346: -- Are there any additional thoughts on

Hudson build is back to normal : Lucene-3.x #72

2010-07-21 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Lucene-3.x/72/ - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] Updated: (LUCENE-2312) Search on IndexWriter's RAM Buffer

2010-07-21 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-2312: -- Fix Version/s: Realtime Branch (was: 4.0) Search on IndexWriter's RAM

[jira] Updated: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-21 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-2324: -- Fix Version/s: Realtime Branch (was: 4.0) Per thread DocumentsWriters

[jira] Updated: (LUCENE-2346) Explore other in-memory postinglist formats for realtime search

2010-07-21 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-2346: -- Fix Version/s: Realtime Branch (was: 4.0) Explore other in-memory

Build failed in Hudson: Lucene-trunk #1245

2010-07-21 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Lucene-trunk/1245/changes Changes: [yonik] LUCENE-2542: remove final from some TopDocsCollector methods -- [...truncated 2706 lines...] [junit] Testsuite: org.apache.lucene.search.TestPrefixQuery