Re: [PATCH] Bug on BrazilianAnalyzer

2008-11-24 Thread Adriano Crestani
Hi Rafael, I kind of agree with you. Practically all the StemFilters have the same logic, they might be combined into only one class. All StemFilters seems to have a setStemmer already, we could keep that and also allow to pass the stemmer as a constructor paramenter, like you said. I think you ca

[jira] Updated: (LUCENE-1467) Consolidate Solr's and Lucene's OpenBitSet classes

2008-11-24 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-1467: -- Attachment: lucene-1467.patch Simple patch that adds two methods to OpenBitSetIterator: nextDo

Re: Mark Miller as core Lucene committer

2008-11-24 Thread Mark Miller
Guess I'd forgotten: I'm a Vermonter stuck in the flat lands of Connecticut. I graduated from the University of Vermont in 2005 at the tail end of a rather unfruitful 17 years of education. I think like 6 other computer science majors graduated with me in my class (UVM has a pop of around 10,0

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2008-11-24 Thread Marvin Humphrey (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650412#action_12650412 ] Marvin Humphrey commented on LUCENE-1458: - > Be careful: it's the seeking that ki

[jira] Commented: (LUCENE-1469) isValid should be invoked after analyze rather than before it so it can validate the output of analyze

2008-11-24 Thread Vincent Li (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650390#action_12650390 ] Vincent Li commented on LUCENE-1469: On second thought - it might be a better idea to

[jira] Created: (LUCENE-1469) isValid should be invoked after analyze rather than before it so it can validate the output of analyze

2008-11-24 Thread Vincent Li (JIRA)
isValid should be invoked after analyze rather than before it so it can validate the output of analyze -- Key: LUCENE-1469 URL: https://issues.apache.org/jira/brow

[jira] Updated: (LUCENE-1465) NearSpansOrdered.getPayload does not return the payload from the minimum match span

2008-11-24 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1465: Attachment: LUCENE-1465.patch That still wasn't quite right. A third test and a third fix. I am pr

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2008-11-24 Thread Marvin Humphrey (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650380#action_12650380 ] Marvin Humphrey commented on LUCENE-1458: - >> I suppose we genericize this by addi

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2008-11-24 Thread Marvin Humphrey (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650377#action_12650377 ] Marvin Humphrey commented on LUCENE-1458: - >> Hmm, maybe we can conflate this with

[jira] Commented: (LUCENE-1451) Can't create NIOFSDirectory w/o setting a system property

2008-11-24 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650344#action_12650344 ] Doug Cutting commented on LUCENE-1451: -- A bit of history, if any care. Originally Lu

[jira] Updated: (LUCENE-1465) NearSpansOrdered.getPayload does not return the payload from the minimum match span

2008-11-24 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1465: Attachment: LUCENE-1465.patch Bah. Its even worse than that. Even after you get down to a min matc

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2008-11-24 Thread Marvin Humphrey (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650316#action_12650316 ] Marvin Humphrey commented on LUCENE-1458: - > Nathan Kurz and I brainstormed th

[jira] Updated: (LUCENE-1465) NearSpansOrdered.getPayload does not return the payload from the minimum match span

2008-11-24 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1465: Attachment: LUCENE-1465.patch > NearSpansOrdered.getPayload does not return the payload from the m

[jira] Commented: (LUCENE-1461) Cached filter for a single term field

2008-11-24 Thread Tim Sturge (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650308#action_12650308 ] Tim Sturge commented on LUCENE-1461: Looking at FieldCache and FieldDocSortedHitQueue

[jira] Commented: (LUCENE-1461) Cached filter for a single term field

2008-11-24 Thread Tim Sturge (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650303#action_12650303 ] Tim Sturge commented on LUCENE-1461: Paul, Mike, FieldCache.StringIndex doesn't behav

[jira] Updated: (LUCENE-1461) Cached filter for a single term field

2008-11-24 Thread Tim Sturge (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Sturge updated LUCENE-1461: --- Attachment: RangeMultiFilter.java This is a version of RangeMultiFilter built on top of FieldCache.

[jira] Commented: (LUCENE-1465) NearSpansOrdered.getPayload does not return the payload from the minimum match span

2008-11-24 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650290#action_12650290 ] Mark Miller commented on LUCENE-1465: - I plan on committing this soon. This is a real

Re: Mark Miller as core Lucene committer

2008-11-24 Thread Michael Busch
Hey Mark, there's this tradition that new committers write a short introduction about themselves. Let's keep this tradition up! :) -Michael Mark Miller wrote: Thanks all. Happy to be part of the excellent Lucene community.

[jira] Updated: (LUCENE-1465) NearSpansOrdered.getPayload does not return the payload from the minimum match span

2008-11-24 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1465: Fix Version/s: 2.9 > NearSpansOrdered.getPayload does not return the payload from the minimum > m

[jira] Commented: (LUCENE-1464) FSDirectory.getDirectory always creates index path

2008-11-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650199#action_12650199 ] Michael McCandless commented on LUCENE-1464: I suppose we could consider addin

[jira] Commented: (LUCENE-1468) FSDirectory.list() is inconsistent

2008-11-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650197#action_12650197 ] Michael McCandless commented on LUCENE-1468: I would tend to agree -- Director

[jira] Assigned: (LUCENE-1468) FSDirectory.list() is inconsistent

2008-11-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned LUCENE-1468: -- Assignee: Michael McCandless > FSDirectory.list() is inconsistent > --

RE: Indexing Open office documents

2008-11-24 Thread ganesh H D
Hi, open office documents are getting indexed but when i search for the words of those documents i am not seeing the correct result. regards, ganesh Uwe Schindler wrote: > > For converting full text to plain text for indexing look at Apache TIKA, > which has an converter for OpenDocument: http

Re: Indexing Open office documents

2008-11-24 Thread ganesh H D
Hi, open office documents are getting indexed but when i search for the words of those documents i am not seeing the correct result. regards, ganesh ganesh H D wrote: > > Hi, > > I have been working on Apache Lucene from past 3 days. I tried to deploy > the sample application which we get from

[jira] Updated: (LUCENE-1468) FSDirectory.list() is inconsistent

2008-11-24 Thread Marcel Reutegger (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcel Reutegger updated LUCENE-1468: - Attachment: DirectoryTest.java Test cases to illustrate the issue. > FSDirectory.list()

[jira] Created: (LUCENE-1468) FSDirectory.list() is inconsistent

2008-11-24 Thread Marcel Reutegger (JIRA)
FSDirectory.list() is inconsistent -- Key: LUCENE-1468 URL: https://issues.apache.org/jira/browse/LUCENE-1468 Project: Lucene - Java Issue Type: Bug Components: Store Affects Versions: 2.4, 2.3.2