Re: lucene and solr trunk

2010-03-17 Thread Ian Holsman
what other libraries do is have a 'core' or a 'common' bit.. which is what the lucene library really is. looking at http://svn.apache.org/repos/asf/lucene/ today I see that nearly, but it's called 'java'. maybe just renaming 'java' to 'core' or 'common' (hadoop uses common) might make sense a

[jira] Created: (LUCENE-2329) Use parallel arrays instead of PostingList objects

2010-03-17 Thread Michael Busch (JIRA)
Use parallel arrays instead of PostingList objects -- Key: LUCENE-2329 URL: https://issues.apache.org/jira/browse/LUCENE-2329 Project: Lucene - Java Issue Type: Improvement Components

Re: #lucene IRC log [was: RE: lucene and solr trunk]

2010-03-17 Thread Ian Holsman
+1 I'd like to see the IRC logs added to things like http://search-lucene.com/ and http://www.lucidimagination.com/search/?q=IRC&Search=Search while it might not be great for decision making.. it is amazing for helping debug comm

[jira] Commented: (LUCENE-2323) reorganize contrib modules

2010-03-17 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846752#action_12846752 ] Robert Muir commented on LUCENE-2323: - bq. But I don't think we're talking about massi

[jira] Commented: (LUCENE-2323) reorganize contrib modules

2010-03-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846744#action_12846744 ] Shai Erera commented on LUCENE-2323: Robert - I think that's exactly what I was propos

[jira] Reopened: (LUCENE-2326) Remove SVN.exe and revision numbers from build.xml by svn-copy the backwards branch and linking snowball tests by svn:externals

2010-03-17 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir reopened LUCENE-2326: - This use of svn:externals causes a problem for snowball, it does not always fetch the correct revis

[jira] Commented: (LUCENE-2323) reorganize contrib modules

2010-03-17 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846719#action_12846719 ] Robert Muir commented on LUCENE-2323: - bq. it could be that I thought it was a really

[jira] Commented: (LUCENE-2323) reorganize contrib modules

2010-03-17 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846712#action_12846712 ] Hoss Man commented on LUCENE-2323: -- bq. I didn't know this was the goal, if what you say

[jira] Commented: (LUCENE-2323) reorganize contrib modules

2010-03-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846711#action_12846711 ] Mark Miller commented on LUCENE-2323: - This reorg is a great a great step for contrib

[jira] Commented: (LUCENE-2323) reorganize contrib modules

2010-03-17 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846709#action_12846709 ] Robert Muir commented on LUCENE-2323: - {quote} agreed ... IIRC the idea in this discus

[jira] Commented: (LUCENE-2323) reorganize contrib modules

2010-03-17 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846707#action_12846707 ] Hoss Man commented on LUCENE-2323: -- bq. Perhaps I want to refactor some code among our 7

[jira] Commented: (LUCENE-2323) reorganize contrib modules

2010-03-17 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846702#action_12846702 ] Robert Muir commented on LUCENE-2323: - Hoss Man, the only problem I have with what you

[jira] Commented: (LUCENE-2323) reorganize contrib modules

2010-03-17 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846700#action_12846700 ] Hoss Man commented on LUCENE-2323: -- I personally don't have a strong opinion on this, but

Re: lucene and solr trunk

2010-03-17 Thread Chris Hostetter
: build and nicely gets all dependencies to Lucene and Tika whenever I build : or release, no problem there and certainly no need to have it merged into : Lucene's svn! The key distinction is that Solr is allready in "Lucene's svn" -- The question is how reorg things in a way that makes it easier

[jira] Commented: (LUCENE-2320) Add MergePolicy to IndexWriterConfig

2010-03-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846665#action_12846665 ] Michael McCandless commented on LUCENE-2320: Shai this patch looks good -- tha

[jira] Commented: (LUCENE-2320) Add MergePolicy to IndexWriterConfig

2010-03-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846622#action_12846622 ] Mark Miller commented on LUCENE-2320: - +1 - I've had to do this in the past too. Just

[jira] Commented: (LUCENE-2320) Add MergePolicy to IndexWriterConfig

2010-03-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846621#action_12846621 ] Michael McCandless commented on LUCENE-2320: I think it's OK to add stubs to s

[jira] Commented: (LUCENE-2323) reorganize contrib modules

2010-03-17 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846620#action_12846620 ] Robert Muir commented on LUCENE-2323: - bq. I do want to propose to omit the component

[jira] Commented: (LUCENE-2320) Add MergePolicy to IndexWriterConfig

2010-03-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846619#action_12846619 ] Uwe Schindler commented on LUCENE-2320: --- In that case just remove the test in backwa

[jira] Commented: (LUCENE-2320) Add MergePolicy to IndexWriterConfig

2010-03-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846615#action_12846615 ] Shai Erera commented on LUCENE-2320: Uwe, I'm pretty familiar w/ how backwards goes ..

[jira] Commented: (LUCENE-2320) Add MergePolicy to IndexWriterConfig

2010-03-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846614#action_12846614 ] Uwe Schindler commented on LUCENE-2320: --- Its normally not the idea of backwards test

[jira] Updated: (LUCENE-2320) Add MergePolicy to IndexWriterConfig

2010-03-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera updated LUCENE-2320: --- Attachment: LUCENE-2320.patch Sorry ... I generated the patch on the wrong backwards folder (the one

[jira] Commented: (LUCENE-2323) reorganize contrib modules

2010-03-17 Thread Kay Kay (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846606#action_12846606 ] Kay Kay commented on LUCENE-2323: - When we talk about reorganization - it would be useful

[jira] Commented: (LUCENE-2312) Search on IndexWriter's RAM Buffer

2010-03-17 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846604#action_12846604 ] Jason Rutherglen commented on LUCENE-2312: -- Previously there was a discussion abo

[jira] Commented: (LUCENE-2323) reorganize contrib modules

2010-03-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846602#action_12846602 ] Shai Erera commented on LUCENE-2323: Robert - I think that's great reorganization. I

[jira] Updated: (LUCENE-2320) Add MergePolicy to IndexWriterConfig

2010-03-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera updated LUCENE-2320: --- Attachment: LUCENE-2320.patch Updating to the latest revision. This should be ok now. > Add MergePo

[jira] Updated: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-03-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-2324: -- Attachment: lucene-2324-no-pooling.patch All tests pass but I have to review if with the chang

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-03-17 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846586#action_12846586 ] Michael Busch commented on LUCENE-2324: --- bq. Michael, Agreed, can you outline how yo

[jira] Updated: (LUCENE-2320) Add MergePolicy to IndexWriterConfig

2010-03-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera updated LUCENE-2320: --- Attachment: LUCENE-2320.patch Attached patch w/ removing the IW-related ctors from MPs, as well as f

Re: IndexWriter.synced field accumulates data

2010-03-17 Thread Gregor Kaczor
followup in https://issues.apache.org/jira/browse/LUCENE-2328 Original-Nachricht > Datum: Wed, 17 Mar 2010 14:30:25 -0500 > Von: Michael McCandless > An: java-dev@lucene.apache.org > Betreff: Re: IndexWriter.synced field accumulates data > You're right! > > Really we should

[jira] Created: (LUCENE-2328) IndexWriter.synced field accumulates data leading to a Memory Leak

2010-03-17 Thread Gregor Kaczor (JIRA)
IndexWriter.synced field accumulates data leading to a Memory Leak --- Key: LUCENE-2328 URL: https://issues.apache.org/jira/browse/LUCENE-2328 Project: Lucene - Java Issue Type

Re: IndexWriter.synced field accumulates data

2010-03-17 Thread Gregor Kaczor
I will open an issue. Acually its not the size of occupied RAM. The leak is the problem. Original-Nachricht > Datum: Wed, 17 Mar 2010 14:30:25 -0500 > Von: Michael McCandless > An: java-dev@lucene.apache.org > Betreff: Re: IndexWriter.synced field accumulates data > You're rig

[jira] Commented: (LUCENE-2312) Search on IndexWriter's RAM Buffer

2010-03-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846546#action_12846546 ] Michael McCandless commented on LUCENE-2312: intUptoStart is used in THPF.writ

Re: IndexWriter.synced field accumulates data

2010-03-17 Thread Michael McCandless
You're right! Really we should delete from sync'd when we delete the files. We need to tie into IndexFileDeleter for that, maybe moving this set into there. Though in practice the amount of actual RAM used should rarely be an issue? But we should fix it... Can you open an issue? Mike On Wed,

[jira] Commented: (LUCENE-2323) reorganize contrib modules

2010-03-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846543#action_12846543 ] Michael McCandless commented on LUCENE-2323: bq. Here are my initial thoughts

[jira] Commented: (LUCENE-2305) Introduce Version in more places long before 4.0

2010-03-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846542#action_12846542 ] Mark Miller commented on LUCENE-2305: - Ah, yes - I didnt remember your comment right:

[jira] Commented: (LUCENE-2312) Search on IndexWriter's RAM Buffer

2010-03-17 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846540#action_12846540 ] Jason Rutherglen commented on LUCENE-2312: -- I think the DW index reader needs to

[jira] Commented: (LUCENE-2305) Introduce Version in more places long before 4.0

2010-03-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846537#action_12846537 ] Michael McCandless commented on LUCENE-2305: Hmm... I think true is likely the

[jira] Commented: (LUCENE-2312) Search on IndexWriter's RAM Buffer

2010-03-17 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846526#action_12846526 ] Jason Rutherglen commented on LUCENE-2312: -- Mike, can you clarify why intUptos an

[jira] Commented: (LUCENE-2323) reorganize contrib modules

2010-03-17 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846522#action_12846522 ] Robert Muir commented on LUCENE-2323: - Here are my initial thoughts on this. I don't

IndexWriter.synced field accumulates data

2010-03-17 Thread Gregor Kaczor
I am running into a strange OutOfMemoryError. My small test application does index and delete some few files. This is repeated for 60k times. Optimization is run from every 2k times a file is indexed. Index size is 50KB. I did analyze the HeapDumpFile and realized that IndexWriter.synced field

[jira] Created: (LUCENE-2327) IndexOutOfBoundsException in FieldInfos.java

2010-03-17 Thread Shane (JIRA)
IndexOutOfBoundsException in FieldInfos.java Key: LUCENE-2327 URL: https://issues.apache.org/jira/browse/LUCENE-2327 Project: Lucene - Java Issue Type: Bug Components: Index Affects

[jira] Commented: (LUCENE-2305) Introduce Version in more places long before 4.0

2010-03-17 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846462#action_12846462 ] Mark Miller commented on LUCENE-2305: - Hmm - if I remember right, this is one I brough

[jira] Commented: (LUCENE-2305) Introduce Version in more places long before 4.0

2010-03-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846461#action_12846461 ] Michael McCandless commented on LUCENE-2305: Sigh, yes, adding Version to MP m

[jira] Commented: (LUCENE-2305) Introduce Version in more places long before 4.0

2010-03-17 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846455#action_12846455 ] Shai Erera commented on LUCENE-2305: While working on LUCENE-2320, I've noticed these

[jira] Commented: (LUCENE-2320) Add MergePolicy to IndexWriterConfig

2010-03-17 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846451#action_12846451 ] Michael McCandless commented on LUCENE-2320: Patch looks good Shai! I'd rathe

Re: Proposed: New logged IRC channel: #lucene_dev

2010-03-17 Thread Shai Erera
I personally prefer that discussions happen on the list/JIRA. It is ok if some discussions outside these two come in to list/JIRA as a thread/issue. Such discussions are not limited to only IRC, but also a phone call, email, ApacheCon etc. However once this has been raised w/ the community, then an

Proposed: New logged IRC channel: #lucene_dev

2010-03-17 Thread Steven A Rowe
As I mentioned in another thread on this list, I'm interested in setting up a permanent, linkable-to archive (a.k.a. log) for the lucene IRC channel. On #lucene, some devs don't want to be logged, and so will not participate on a logged IRC channel. Other devs want logging, to be able to point

[jira] Updated: (LUCENE-2280) IndexWriter.optimize() throws NullPointerException

2010-03-17 Thread Ritesh Nigam (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ritesh Nigam updated LUCENE-2280: - Attachment: lucene.zip Lucene infostream log file. > IndexWriter.optimize() throws NullPointerE

[jira] Commented: (LUCENE-2280) IndexWriter.optimize() throws NullPointerException

2010-03-17 Thread Ritesh Nigam (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846408#action_12846408 ] Ritesh Nigam commented on LUCENE-2280: -- Yesterday again search indxer crashed for my

[jira] Resolved: (LUCENE-2326) Remove SVN.exe and revision numbers from build.xml by svn-copy the backwards branch and linking snowball tests by svn:externals

2010-03-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler resolved LUCENE-2326. --- Resolution: Fixed Lucene Fields: [New, Patch Available] (was: [New]) Committed revis

Re: lucene and solr trunk

2010-03-17 Thread Stefan Trcek
On Tuesday 16 March 2010 14:12:20 Mark Miller wrote: > On 03/16/2010 09:05 AM, Andrzej Bialecki wrote: > > > > You could have used git instead. There is a good integration > > between git and svn, and it's much easier (a giant > > understatement...) to handle branching and merging in git, both > >

[jira] Commented: (LUCENE-2326) Remove SVN.exe and revision numbers from build.xml by svn-copy the backwards branch and linking snowball tests by svn:externals

2010-03-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846362#action_12846362 ] Uwe Schindler commented on LUCENE-2326: --- Will commit soon to trunk and merge to flex

[jira] Updated: (LUCENE-2326) Remove SVN.exe and revision numbers from build.xml by svn-copy the backwards branch and linking snowball tests by svn:externals

2010-03-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-2326: -- Attachment: LUCENE-2326.patch New patch, which has some optimizations. It now also allows to r

Re: lucene and solr trunk

2010-03-17 Thread Earwin Burrfoot
Some of these people got traumatized by maven, now they only can think in terms of "mash everything together and sprinkle with hand-downloaded dependency jars". No offence : ) I, personally, prefer side-by-side layouts. You can add new stuff, and wire dependencies to the old one, without reorganiz

Re: lucene and solr trunk

2010-03-17 Thread Wouter Heijke
I'm just a surprised observer that doesn't seems to get all the trouble and need for this svn merge. I have my own private Solr-like framework around Lucene. It uses maven to build and nicely gets all dependencies to Lucene and Tika whenever I build or release, no problem there and certainly no ne