RE: Lucene 2.3 RC 1 available for testing

2008-01-07 Thread Uwe Schindler
Hi Michael, I tried the new 2.3 version today form y package "panFMP" (www.panFMP.org). In principle it works, but the package checks on startup for the minimum requirements of used libraries (in my case Lucene 2.2). This is done by java.lang.Package.isCompatibleWith(). According to the Sun Java A

FieldSortedHitQueue.fillFields() not visible

2008-01-07 Thread Antony Bowesman
I'm implementing a HitCollector to do sorting and will use FieldSortedHitQueue, but for some reason the fillFields() method is package private. Judging from the comments to the method, I don't need it, but if I do later on, I can't, unless of course I extend the class and copy the existing code

[jira] Commented: (LUCENE-566) Esperanto Analyzer

2008-01-07 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12556820#action_12556820 ] Hoss Man commented on LUCENE-566: - FWIW: the source repository moved, but was pretty easy t

[jira] Resolved: (LUCENE-566) Esperanto Analyzer

2008-01-07 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch resolved LUCENE-566. -- Resolution: Incomplete I'm resolving this issue because no code is attached and the provided li

Lucene 2.3 RC 1 available for testing

2008-01-07 Thread Michael Busch
Hi all, I just created the release artifacts (incl. maven artifacts) from the 2.3 branch and uploaded the files to http://people.apache.org/~buschmi/staging_area/lucene_2_3/rc1/. Let's try to use the next days for testing to ensure that we find serious bugs or build problems as early as possible.

Build failed in Hudson: Lucene-Nightly #331

2008-01-07 Thread hudson
See http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightly/331/changes -- [...truncated 764 lines...] A contrib/snowball/src/test/org/apache/lucene/analysis/snowball AU contrib/snowball/src/test/org/apache/lucene/analysis/snowball/Test

PLEASE READ: 2.3 branch created, feature freeze in effect

2008-01-07 Thread Michael Busch
Hi Team, thanks everyone for committing the outstanding issues on time! As announced I created a Lucene 2.3 branch today from trunk revision r609842 here: https://svn.apache.org/repos/asf/lucene/java/branches/lucene_2_3/ As of now a feature freeze on this branch is in effect. In detail this mean

[jira] Commented: (LUCENE-508) SegmentTermEnum.next() doesn't maintain prevBuffer at end

2008-01-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12556763#action_12556763 ] Michael McCandless commented on LUCENE-508: --- You're welcome! And sorry it took s

[jira] Commented: (LUCENE-508) SegmentTermEnum.next() doesn't maintain prevBuffer at end

2008-01-07 Thread Steven Tamm (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12556717#action_12556717 ] Steven Tamm commented on LUCENE-508: Thanks! -Steven > SegmentTermEnum.next() doesn

[jira] Resolved: (LUCENE-751) cannot retrieve term positions from indices using ISOLatin1AccentFilter

2008-01-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-751. --- Resolution: Incomplete Not enough detail on the issue, no test showing the issue, and

[jira] Resolved: (LUCENE-1054) TermDocs.skipTo fails with an ArrayOutOfBoundsException from BitVector

2008-01-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1054. Resolution: Duplicate This issue was fixed by the same fix for LUCENE-951. > Term

[jira] Resolved: (LUCENE-1014) IndexWriter.optimize() does not respect maxMergeDocs

2008-01-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1014. Resolution: Won't Fix I don't think we can cleanly fix this w/o breaking backwards

[jira] Resolved: (LUCENE-508) SegmentTermEnum.next() doesn't maintain prevBuffer at end

2008-01-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-508. --- Resolution: Fixed Fix Version/s: 2.3 I just committed this. Thanks Steven! >

Considerations for o.a.l.store.Directory refactoring

2008-01-07 Thread Matt Reynolds
During some recent testing and work with Lucene, my coworkers and I have noticed some things in Lucene's guts that I wanted to ask about. Directory is currently an abstract class that claims in its javadoc that "Directory is a flat list of files", then goes on to describe non-"flat list of files"

[jira] Assigned: (LUCENE-508) SegmentTermEnum.next() doesn't maintain prevBuffer at end

2008-01-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned LUCENE-508: - Assignee: Michael McCandless > SegmentTermEnum.next() doesn't maintain prevBuffer

[jira] Commented: (LUCENE-508) SegmentTermEnum.next() doesn't maintain prevBuffer at end

2008-01-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12556691#action_12556691 ] Michael McCandless commented on LUCENE-508: --- I have a unit test showing this ...

[jira] Resolved: (LUCENE-1037) Corrupt index: term out of order after forced stop during indexing

2008-01-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1037. Resolution: Duplicate I think very likely this is a dup of LUCENE-1044: on a hard

Re: Lucene lock file created

2008-01-07 Thread Michael McCandless
This is really a java-user sort of question. Likely you had your JVM crash / get killed while it was indexing? Which then leaves the lock file in the temp directory. Lucene's locking is based on existence of that lock file. If there is a time in your app (eg on startup) when you are cert

RE: Fullwidth alphanumeric characters, plus a question on Korean ranges

2008-01-07 Thread Steven A Rowe
Hi Daniel, I think this discussion belongs on java-dev, so I'm replying there. On 01/06/2008 at 7:47 PM, Daniel Noll wrote: > We discovered [in StandardTokenizer.jj] that fullwidth letters are > not treated as and fullwidth digits are not treated as . IMHO, this should be fixed in the JFlex ver

[jira] Updated: (LUCENE-1121) Use nio.transferTo when copying large blocks of bytes

2008-01-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1121: --- Attachment: LUCENE-1121.patch Attached patch. All tests pass. We shouldn't push th

[jira] Created: (LUCENE-1121) Use nio.transferTo when copying large blocks of bytes

2008-01-07 Thread Michael McCandless (JIRA)
Use nio.transferTo when copying large blocks of bytes - Key: LUCENE-1121 URL: https://issues.apache.org/jira/browse/LUCENE-1121 Project: Lucene - Java Issue Type: Improvement Comp

[jira] Commented: (LUCENE-1117) Intermittent thread safety issue with EnwikiDocMaker

2008-01-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12556602#action_12556602 ] Grant Ingersoll commented on LUCENE-1117: - OK, that works! Thanks, Mike > Interm

[jira] Commented: (LUCENE-1117) Intermittent thread safety issue with EnwikiDocMaker

2008-01-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12556598#action_12556598 ] Michael McCandless commented on LUCENE-1117: I think you just need to call doc

[jira] Commented: (LUCENE-1120) Use bulk-byte-copy when merging term vectors

2008-01-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12556593#action_12556593 ] Michael McCandless commented on LUCENE-1120: {quote} I wonder how accurate the

[jira] Commented: (LUCENE-1117) Intermittent thread safety issue with EnwikiDocMaker

2008-01-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12556591#action_12556591 ] Grant Ingersoll commented on LUCENE-1117: - I am now getting: Exception in thread "

[jira] Commented: (LUCENE-1117) Intermittent thread safety issue with EnwikiDocMaker

2008-01-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12556594#action_12556594 ] Grant Ingersoll commented on LUCENE-1117: - This is what I am doing: EnwikiDocMaker

[jira] Commented: (LUCENE-1120) Use bulk-byte-copy when merging term vectors

2008-01-07 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12556590#action_12556590 ] Michael Busch commented on LUCENE-1120: --- {quote} Indexing all of Wikipedia, with ter

[jira] Commented: (LUCENE-1120) Use bulk-byte-copy when merging term vectors

2008-01-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12556589#action_12556589 ] Michael McCandless commented on LUCENE-1120: {quote} Considering the file form

2.3 branch today?

2008-01-07 Thread Michael Busch
Hi Team, as announced last week I'm planning to create a 2.3 branch today. Then we'll have a code freeze on the branch for ~10days. Any objections? -Michael - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-

[jira] Commented: (LUCENE-1120) Use bulk-byte-copy when merging term vectors

2008-01-07 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12556586#action_12556586 ] Michael Busch commented on LUCENE-1120: --- {quote} I think we should commit this for 2

[jira] Updated: (LUCENE-1120) Use bulk-byte-copy when merging term vectors

2008-01-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1120: --- Attachment: LUCENE-1120.patch Attached patch. All tests pass. (Note that the TestB

[jira] Created: (LUCENE-1120) Use bulk-byte-copy when merging term vectors

2008-01-07 Thread Michael McCandless (JIRA)
Use bulk-byte-copy when merging term vectors Key: LUCENE-1120 URL: https://issues.apache.org/jira/browse/LUCENE-1120 Project: Lucene - Java Issue Type: Improvement Components: Index

Re: Too ambitious : but wanting to know the exact procedure.

2008-01-07 Thread Paul Elschot
Hi, In case you want to understand what happens when a document field is stored and indexed, try and add your example to the file formats as documented here: http://lucene.apache.org/java/docs/fileformats.html When that clears up your mind to an extent that was not available before, and you'd lik