Custom Filter

2010-03-24 Thread Siraj Haider
Hello there, I am getting exception when running queries with new getDocIdSet() in my customer filter. Following is the code for my getDocIdSet() function: /public DocIdSet getDocIdSet(IndexReader reader) throws IOException { OpenBitSet bitSet = new OpenBitSet(reader.maxDoc()); for (in

Re: Custom Filter

2010-03-25 Thread Ian Lea
Could this maybe have something to do with per-segment readers, as mentioned in recent message from Daniel? Posting lucene version and the full stack trace dump is always a good idea. -- Ian. On Wed, Mar 24, 2010 at 6:56 PM, Siraj Haider wrote: > Hello there, > I am getting exception when run

Re: Custom Filter

2010-03-25 Thread Siraj Haider
I figured this one out... it was due to a mistake in my code... sorry for trouble. -siraj On 3/25/2010 5:48 AM, Ian Lea wrote: Could this maybe have something to do with per-segment readers, as mentioned in recent message from Daniel? Posting lucene version and the full stack trace dump is al

question on custom filter

2009-07-20 Thread OBender
Hi All! Let say I have a filter that produces new tokens based on the original ones. How bad will it be if my filter sets the start of each token to 0 and end to the length of a token? An example (based on the phrase "How are you?": Original token: [you?] (8,12) New tokens: [you]

Problem with Custom Filter

2007-01-26 Thread Paul Lynch
Hi, I am going mad trying to find out what I am doing wrong with my custom filter implementation (almost an exact copy of SpecialsFilter from LIA). I have put together a quick sample to illustrate my problem, if some kind soul has 2 minutes to take a quick look and tell me where I am being so

Re: question on custom filter

2009-07-20 Thread Robert Muir
Obender, I don't think its as difficult as you think. Your filter does not need to be aware of this issue at all. In unicode, right-to-left languages are encoded in the data in logical order. The rendering system is what converts it to display in right-to-left for RTL languages. For example in Ar

RE: question on custom filter

2009-07-20 Thread OBender
ssage- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Monday, July 20, 2009 1:43 PM To: java-user@lucene.apache.org Subject: Re: question on custom filter Obender, I don't think its as difficult as you think. Your filter does not need to be aware of this issue at all. In unicode, rig

Re: question on custom filter

2009-07-20 Thread Robert Muir
s and indexes). > > Am I missing something? > > -Original Message- > From: Robert Muir [mailto:rcm...@gmail.com] > Sent: Monday, July 20, 2009 1:43 PM > To: java-user@lucene.apache.org > Subject: Re: question on custom filter > > Obender, I don't think its as dif

RE: question on custom filter

2009-07-20 Thread OBender
ly 20, 2009 2:07 PM To: java-user@lucene.apache.org Subject: Re: question on custom filter Obender, This is not true. the text you pasted is the following in unicode: \N{HEBREW LETTER TET} \N{HEBREW LETTER VAV} \N{HEBREW POINT HOLAM} \N{HEBREW LETTER BET} \N{SPACE} \N{HEBREW LETTER AYIN} \N{HEBREW

RE: question on custom filter

2009-07-20 Thread OBender
Hold on a second, the phrase that you included link to is not in the correct order of words! -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Monday, July 20, 2009 2:07 PM To: java-user@lucene.apache.org Subject: Re: question on custom filter Obender, This is not

RE: question on custom filter

2009-07-20 Thread OBender
: Re: question on custom filter Obender, This is not true. the text you pasted is the following in unicode: \N{HEBREW LETTER TET} \N{HEBREW LETTER VAV} \N{HEBREW POINT HOLAM} \N{HEBREW LETTER BET} \N{SPACE} \N{HEBREW LETTER AYIN} \N{HEBREW POINT SEGOL} \N{HEBREW LETTER RESH} \N{HEBREW POINT SEGOL

Re: question on custom filter

2009-07-20 Thread Robert Muir
ser@lucene.apache.org > Subject: Re: question on custom filter > > Obender, This is not true. > the text you pasted is the following in unicode: > > \N{HEBREW LETTER TET} > \N{HEBREW LETTER VAV} > \N{HEBREW POINT HOLAM} > \N{HEBREW LETTER BET} > \N{SPACE} > \N{HEBREW LETTER

RE: question on custom filter

2009-07-20 Thread OBender
che.org Subject: Re: question on custom filter Obender, I think something in your environment / display environment might be causing some confusion. Are you using microsoft windows? If so, please verify that support for right-to-left languages is enabled [control panel/regional and language options].

Re: question on custom filter

2009-07-20 Thread Robert Muir
riginal Message- > From: Robert Muir [mailto:rcm...@gmail.com] > Sent: Monday, July 20, 2009 2:26 PM > To: java-user@lucene.apache.org > Subject: Re: question on custom filter > > Obender, I think something in your environment / display environment > might be causing some confu

Re: question on custom filter

2009-07-20 Thread Robert Muir
al String fieldName, final Reader >> reader ) >>        { >>                TokenStream ts  = new WhitespaceTokenizer( reader ); >>                ts                      = new XFilter( ts ); >> >>                return ts; >>        } >> } >> >> -Original M

RE: question on custom filter

2009-07-20 Thread OBender
3:03 PM To: java-user@lucene.apache.org Subject: Re: question on custom filter Obender, i ran your code and it did what I expected (but not what you pasted): First token is: (טוֹב,0,4) Second token is: (עֶרֶב,5,10) I also loaded up your SimpleWhitespaceAnalyzer in Luke, with the same results. On M

Re: question on custom filter

2009-07-20 Thread Robert Muir
onday, July 20, 2009 3:03 PM > To: java-user@lucene.apache.org > Subject: Re: question on custom filter > > Obender, i ran your code and it did what I expected (but not what you pasted): > > First token is: (טוֹב,0,4) > Second token is: (עֶרֶב,5,10) > > I also loaded up your Simp

RE: question on custom filter

2009-07-20 Thread OBender
I've checked, and it appears to be enabled. -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Monday, July 20, 2009 3:18 PM To: java-user@lucene.apache.org Subject: Re: question on custom filter Obender, based on your previous comments (that you see text display

Re: question on custom filter

2009-07-20 Thread Robert Muir
From: Robert Muir [mailto:rcm...@gmail.com] > Sent: Monday, July 20, 2009 3:18 PM > To: java-user@lucene.apache.org > Subject: Re: question on custom filter > > Obender, based on your previous comments (that you see text displayed > in the wrong order), I again recommend that you en

RE: question on custom filter

2009-07-20 Thread OBender
u :) ? -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Monday, July 20, 2009 3:34 PM To: java-user@lucene.apache.org Subject: Re: question on custom filter Obender, I think your input is incorrect. The hebrew text you pasted in your example appears incorrect. Its gonna be h

RE: question on custom filter

2009-07-20 Thread OBender
t (left to right, e.g., EREV left most then TOV). Should UTF-8 CSV file preserve the natural (language specific) order of words? -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Monday, July 20, 2009 3:49 PM To: java-user@lucene.apache.org Subject: Re: question on cus

RE: question on custom filter

2009-07-20 Thread OBender
Never mind, I think I got it. -Original Message- From: OBender [mailto:osya_ben...@hotmail.com] Sent: Monday, July 20, 2009 4:42 PM To: java-user@lucene.apache.org Subject: RE: question on custom filter No, it reversed in the e-mail. Funny though, when I insert it in to the Excel it

Re: Problem with Custom Filter

2007-01-26 Thread Erick Erickson
07, Paul Lynch <[EMAIL PROTECTED]> wrote: Hi, I am going mad trying to find out what I am doing wrong with my custom filter implementation (almost an exact copy of SpecialsFilter from LIA). I have put together a quick sample to illustrate my problem, if some kind soul has 2 minutes to take a

Re: Problem with Custom Filter

2007-01-26 Thread Paul Lynch
: Problem with Custom Filter I think you're only setting one bit in your filter. You're docs array is only one cell long, and your termDocs.read reads up to the length of docs (exactly one in this case) entries. So, you're getting only one doc ID. And setting it. Even if you made yo

Custom Filter for Splitting CamelCase?

2011-11-29 Thread Stephen Thomas
List, I have written my own CustomAnalyzer, as follows: public TokenStream tokenStream(String fieldName, Reader reader) { // TODO: add calls to RemovePuncation, and SplitIdentifiers here // First, convert to lower case TokenStream

RE: Custom Filter for Splitting CamelCase?

2011-11-29 Thread Uwe Schindler
e.org > Subject: Custom Filter for Splitting CamelCase? > > List, > > I have written my own CustomAnalyzer, as follows: > > public TokenStream tokenStream(String fieldName, Reader reader) { > > // TODO: add calls to RemovePuncation, and SplitIdentifiers >

Re: Custom Filter for Splitting CamelCase?

2011-11-29 Thread Stephen Thomas
>> -Original Message- >> From: stephen.warner.tho...@gmail.com >> [mailto:stephen.warner.tho...@gmail.com] On Behalf Of Stephen Thomas >> Sent: Tuesday, November 29, 2011 5:20 PM >> To: java-user@lucene.apache.org >> Subject: Custom Filter for Splitting CamelCase?

RE: Custom Filter for Splitting CamelCase?

2011-11-29 Thread Uwe Schindler
yzersTokenizersTokenFilters#solr.WordDelimit erFilterFactory Uwe - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: stephen.warner.tho...@gmail.com > [mailto:stephen.warner.tho...@gmail.com] On Be

Hits issue or custom filter issue?

2005-09-13 Thread Jeff Rodenburg
nitial thought is the problem lies in the custom filter I've created. myCustomFilter extends Filter, and I'm following the BitSet comparitive example as found in the LIA book. I've done nothing in myCustomFilter regarding caching. I'm doubting this is a bug, but rather something I've overlooked. thanks, jeff r.

Re: Hits issue or custom filter issue?

2005-09-13 Thread Chris Hostetter
: Hits h1 = oMultiSearcher.Search(new FilteredQuery(combinedQuery, new : myCustomFilter(1))); : Hits h2 = oMultiSearcher.Search(new FilteredQuery(combinedQuery, new : myCustomFilter(2))); ...do you get the same results if you use... Hits h1 = oMultiSearcher.search(combinedQuery, myCustomFilte

Re: Hits issue or custom filter issue?

2005-09-13 Thread Jeff Rodenburg
Might be the same issue, haven't been able to determine during a step-through on the code exec. You're right, no need to add a new FilteredQuery to the statement, just a search on combinedQuery with a new myCustomFilter. Unfortunately, no joy; same response. -- j On 9/13/05, Chris Hostetter <[E

Re: Hits issue or custom filter issue?

2005-09-13 Thread Chris Hostetter
AIL PROTECTED] : To: Chris Hostetter <[EMAIL PROTECTED]> : Cc: java-user@lucene.apache.org : Subject: Re: Hits issue or custom filter issue? : : Might be the same issue, haven't been able to determine during a : step-through on the code exec. : You're right, no need to add a new Filtered

Re: Hits issue or custom filter issue?

2005-09-13 Thread Chris Hostetter
your problem. : Date: Tue, 13 Sep 2005 17:22:49 -0700 : From: Jeff Rodenburg <[EMAIL PROTECTED]> : Reply-To: java-user@lucene.apache.org, [EMAIL PROTECTED] : To: Chris Hostetter <[EMAIL PROTECTED]> : Cc: java-user@lucene.apache.org : Subject: Re: Hits issue or custom filter issue? : :

Re: Hits issue or custom filter issue?

2005-09-13 Thread Jeff Rodenburg
e) that they have greater-than-zero scores. b) Is it possible that your filter isn't doing anything all, ie: is there > any execution path that may return a BitSet in which every bit is set? It's either doing nothing or not working at all. This seems to be the likely culprit. St

Re: Hits issue or custom filter issue?

2005-09-14 Thread Jeff Rodenburg
Good call, Chris.I followed the BitSet comparison route and found that the custom filter was working exactly as it should, but *I* wasn't passing it correct data. Rookie mistake. Doh! I hate it when that happens. -- j On 9/13/05, Jeff Rodenburg <[EMAIL PROTECTED]> wrote: >

Custom Filter implementations - necessary to check if doc is deleted ?

2006-12-08 Thread Øyvind Stegard
Hi, I have a question that is probably easy to answer for many of you. I'm using some custom Filters with Lucene, mostly imlemented by using TermEnum/TermDocs and checking some condition. Is it necessary to check the deleted-status of documents that the filter includes (and never actually include

Re: Custom Filter implementations - necessary to check if doc is deleted ?

2006-12-08 Thread Chris Hostetter
: Is it necessary to check the deleted-status of documents that the filter : includes (and never actually include deleted documents), or is this done ... : I'm trying to implementent an inverted version of a filter, simply by : flipping all the bits in the BitSet, after the filter has fini

Re: Custom Filter implementations - necessary to check if doc is deleted ?

2006-12-09 Thread Øyvind Stegard
2006-12-08 fre 15:41 -0800, Chris Hostetter: > : Is it necessary to check the deleted-status of documents that the filter > : includes (and never actually include deleted documents), or is this done > ... > : I'm trying to implementent an inverted version of a filter, simply by > : flipping a