ParseException

2009-08-28 Thread utuncdemir
Hello When search term has word 'OR', Lucene throws an ParseException but here the term 'OR' is not searched for to be interpreted by condition in query intentionally.The needs is simple to get the lucene document which has text for example ' Holland OR Germany '.in other words when i search for

RE: CachingTokenFilter extensibility and LUCENE-1685

2009-08-28 Thread Uwe Schindler
Hi David, What is exactly your problem? Even the old 2.4 CachingTokenFilter did not expose its internal structures, so overriding would not change its internal implementation. The only change now is, that *all* TokenFilters in core have final implementations, which is a consequence of the new

[jira] Closed: (LUCENE-1269) Analysers and Filters should not be final

2009-08-28 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler closed LUCENE-1269. - Resolution: Won't Fix See LUCENE-1753 for an explanation, why TokenStreams and TokenFilters

[jira] Commented: (LUCENE-1521) fdx size mismatch exception in StoredFieldsWriter.closeDocStore() when closing index with 500M documents

2009-08-28 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12748745#action_12748745 ] Michael McCandless commented on LUCENE-1521: Is there any thing in your env

[jira] Created: (LUCENE-1869) when checking tvx/fdx size mismatch, also include whether the file exists

2009-08-28 Thread Michael McCandless (JIRA)
when checking tvx/fdx size mismatch, also include whether the file exists - Key: LUCENE-1869 URL: https://issues.apache.org/jira/browse/LUCENE-1869 Project: Lucene - Java

[jira] Updated: (LUCENE-1869) when checking tvx/fdx size mismatch, also include whether the file exists

2009-08-28 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1869: --- Attachment: LUCENE-1869.patch Attached patch. The patch is trivial (only changes

Fwd: [jira] Updated: (LUCENE-1869) when checking tvx/fdx size mismatch, also include whether the file exists

2009-08-28 Thread Michael McCandless
Mark is it OK to commit this for 2.9? It just improves debugging when users hit the existing fdx/tvx mismatch exception. Mike -- Forwarded message -- From: Michael McCandless (JIRA) j...@apache.org Date: Fri, Aug 28, 2009 at 5:03 AM Subject: [jira] Updated: (LUCENE-1869) when

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-08-28 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12748765#action_12748765 ] Michael McCandless commented on LUCENE-1458: bq. Maybe we should break this

[jira] Commented: (LUCENE-1870) dists include analyzer contrib in src dist but not binary dist

2009-08-28 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12748801#action_12748801 ] Mark Miller commented on LUCENE-1870: - this one snuck by my dists include analyzer

[jira] Created: (LUCENE-1870) dists include analyzer contrib in src dist but not binary dist

2009-08-28 Thread Mark Miller (JIRA)
dists include analyzer contrib in src dist but not binary dist -- Key: LUCENE-1870 URL: https://issues.apache.org/jira/browse/LUCENE-1870 Project: Lucene - Java Issue Type: Bug

Re: Fwd: [jira] Updated: (LUCENE-1869) when checking tvx/fdx size mismatch, also include whether the file exists

2009-08-28 Thread Mark Miller
You won't see me complain. - Mark Michael McCandless wrote: Mark is it OK to commit this for 2.9? It just improves debugging when users hit the existing fdx/tvx mismatch exception. Mike -- Forwarded message -- From: Michael McCandless (JIRA) j...@apache.org Date: Fri,

[jira] Commented: (LUCENE-1870) dists include analyzer contrib in src dist but not binary dist

2009-08-28 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12748827#action_12748827 ] Mark Miller commented on LUCENE-1870: - okay - I think I've addressed it all - will

[jira] Updated: (LUCENE-1862) duplicate package.html files in queryParser and analsysis.cn packages

2009-08-28 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1862: Priority: Minor (was: Major) duplicate package.html files in queryParser and analsysis.cn

[jira] Assigned: (LUCENE-1868) update NOTICE.txt

2009-08-28 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller reassigned LUCENE-1868: --- Assignee: Mark Miller update NOTICE.txt - Key:

[jira] Resolved: (LUCENE-1868) update NOTICE.txt

2009-08-28 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller resolved LUCENE-1868. - Resolution: Fixed Thanks a lot Robert! update NOTICE.txt -

[jira] Resolved: (LUCENE-1867) replace collation/lib/icu4j.jar with a smaller icu jar

2009-08-28 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller resolved LUCENE-1867. - Resolution: Fixed Assignee: Mark Miller (was: Robert Muir) Thanks Robert! replace

[jira] Assigned: (LUCENE-1870) dists include analyzer contrib in src dist but not binary dist

2009-08-28 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller reassigned LUCENE-1870: --- Assignee: Mark Miller dists include analyzer contrib in src dist but not binary dist

[jira] Resolved: (LUCENE-1870) dists include analyzer contrib in src dist but not binary dist

2009-08-28 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller resolved LUCENE-1870. - Resolution: Fixed Okay, I think its good - that misc issue was actually in the last release so

RE: CachingTokenFilter extensibility and LUCENE-1685

2009-08-28 Thread David Kaelbling
Hi Uwe, The problem is that I need to have a random access token stream for other reasons, and don't want CachingTokenFilter to buffer up a redundant copy of it. In existing releases I subclass it to override all the methods to use my store, and ignore the LinkedList cache member. The old

Re: CachingTokenFilter extensibility and LUCENE-1685

2009-08-28 Thread Mark Miller
bq. If there were some way to tell WeightedSpanTermExtractor not wrap the stream (a new TokenStream.isCachingTokens() method, checking for an new CachedTokenStream interface rather than for CachingTokenFilter, some attribute, anything! :-) then I could still work with the public API. I didn't

Re: ParseException

2009-08-28 Thread Adriano Crestani
Hi, You can escape the query parser keywords: Holland \OR Germany This way the query parser will interpret it as as term instead of a boolean operator. Regards, Adriano Crestani On Fri, Aug 28, 2009 at 3:51 AM, utuncdemir cam...@gmail.com wrote: Hello When search term has word 'OR',

[jira] Created: (LUCENE-1871) Highlighter wraps caching token filters that are not CachingTokenFilter in CachingTokenFilter

2009-08-28 Thread Mark Miller (JIRA)
Highlighter wraps caching token filters that are not CachingTokenFilter in CachingTokenFilter - Key: LUCENE-1871 URL: https://issues.apache.org/jira/browse/LUCENE-1871

[jira] Updated: (LUCENE-1871) Highlighter wraps caching token filters that are not CachingTokenFilter in CachingTokenFilter

2009-08-28 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1871: Attachment: LUCENE-1871.patch Allows you to take ownership of providing an efficiently resettable

[jira] Commented: (LUCENE-1871) Highlighter wraps caching token filters that are not CachingTokenFilter in CachingTokenFilter

2009-08-28 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12748896#action_12748896 ] Uwe Schindler commented on LUCENE-1871: --- Are we also in feature freeze for contrib?

[jira] Commented: (LUCENE-1871) Highlighter wraps caching token filters that are not CachingTokenFilter in CachingTokenFilter

2009-08-28 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12748897#action_12748897 ] Mark Miller commented on LUCENE-1871: - With consensus on my side, I have no problem

Lucene 2.9 RC2

2009-08-28 Thread Mark Miller
Looks like people.apache.org just started working for me again. I'm going to go out for lunch and then put up RC2 when I get back. Unless someone objects before then, I'm going to include LUCENE-1871 in RC2. -- - Mark http://www.lucidimagination.com

[jira] Commented: (LUCENE-1871) Highlighter wraps caching token filters that are not CachingTokenFilter in CachingTokenFilter

2009-08-28 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12748901#action_12748901 ] Uwe Schindler commented on LUCENE-1871: --- I would like to have this in, because there

RE: Lucene 2.9 RC2

2009-08-28 Thread Uwe Schindler
Unless someone objects before then, I'm going to include LUCENE-1871 in RC2. +1 - To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e-mail: java-dev-h...@lucene.apache.org

RE: CachingTokenFilter extensibility and LUCENE-1685

2009-08-28 Thread David Kaelbling
Uwe, I kind of like the idea of changing WeightedSpanTermExtractor to test for !(tokenStream instanceof RandomAccess) :-) - David -- David Kaelbling Senior Software Engineer Black Duck Software, Inc. dkaelbl...@blackducksoftware.com T +1.781.810.2041 F +1.781.891.5145

Re: CachingTokenFilter extensibility and LUCENE-1685

2009-08-28 Thread Mark Miller
In the longer term, I think we do something that is more automatic and correct - but for now, adding this brute force option is best I think. David Kaelbling wrote: Uwe, I kind of like the idea of changing WeightedSpanTermExtractor to test for !(tokenStream instanceof RandomAccess) :-) -

[jira] Updated: (LUCENE-1871) Highlighter wraps caching token filters that are not CachingTokenFilter in CachingTokenFilter

2009-08-28 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1871: Lucene Fields: [New, Patch Available] (was: [New]) Fix Version/s: 2.9 Highlighter wraps

[jira] Resolved: (LUCENE-1871) Highlighter wraps caching token filters that are not CachingTokenFilter in CachingTokenFilter

2009-08-28 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller resolved LUCENE-1871. - Resolution: Fixed Highlighter wraps caching token filters that are not CachingTokenFilter in

[jira] Updated: (LUCENE-1342) 64bit JVM crashes on Linux

2009-08-28 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Rutherglen updated LUCENE-1342: - Attachment: jvmerror.log Here's the JVM error I'm seeing on Amazon EC2: java version

Re: [jira] Updated: (LUCENE-1342) 64bit JVM crashes on Linux

2009-08-28 Thread Ted Dunning
That looks like a slightly musty JVM. The current stable is 1.6_16, I think. That said, we use 1.6u12 with good results on EC2. What OS/version are you running? (we used alestic hardy heron ubuntu distros since they provide very simple configuration without building a new AMI) On Fri, Aug 28,