RE: Duplicate Hits

2005-02-01 Thread Jerry Jalenak
l with this issue through some sort of filter on the query side, provided it doesn't impact performance to much. Thanks. Jerry Jalenak Senior Programmer / Analyst, Web Publishing LabOne, Inc. 10101 Renner Blvd. Lenexa, KS 66219 (913) 577-1496 [EMAIL PROTECTED] -Original Message- F

RE: Duplicate Hits

2005-02-01 Thread Jerry Jalenak
Just to make sure I understand Do you keep an IndexReader open at the same time you are running the IndexWriter? From what I can see in the JavaDocs, it looks like only IndexReader (or IndexSearch) can peek into the index and see if a document exists or not Thanks! Jerry Jalenak Senior

RE: Duplicate Hits

2005-02-01 Thread Jerry Jalenak
Nice idea John - one I hadn't considered. Once you have the checksum, do you 'check' in the index first before storing the second document? Or do you filter on the query side? Jerry Jalenak Senior Programmer / Analyst, Web Publishing LabOne, Inc. 10101 Renner Blvd. Lenexa, KS

RE: Duplicate Hits

2005-02-01 Thread Jerry Jalenak
forth, etc) documents. If the indexed fields have changed, then I want to index the 'new' document, and keep it. Given Erik's response of 'don't put duplicate documents in the index', how can I accomplish this in the IndexWriter? Jerry Jalenak Senior Programmer / An

Duplicate Hits

2005-02-01 Thread Jerry Jalenak
Is there a way to eliminate duplicate hits being returned from the index? Jerry Jalenak Senior Programmer / Analyst, Web Publishing LabOne, Inc. 10101 Renner Blvd. Lenexa, KS 66219 (913) 577-1496 [EMAIL PROTECTED] This transmission (and any information attached to it) may be confidential and

RE: Index Layout Question

2005-01-27 Thread Jerry Jalenak
That's good to know. I'm indexing on 11 fields (9 keyword, 2 text). The documents themselves are between 1K to 2K in size. Is there a point at which IndexSearcher performance begins to fall off? (in term of # of index records?) Jerry Jalenak Senior Programmer / Analyst, Web Publish

Index Layout Question

2005-01-27 Thread Jerry Jalenak
), but I'm not sure if the performance gains are really there. Would one monolithic index be better? Thanks. Jerry Jalenak Senior Programmer / Analyst, Web Publishing LabOne, Inc. 10101 Renner Blvd. Lenexa, KS 66219 (913) 577-1496 [EMAIL PROTECTED] This transmission (and any inform

RE: [HOWTO] Setting BooleanQuery MaxClauseCount

2005-01-26 Thread Jerry Jalenak
Never mind. These types of questions is what occurs when one is trying to do too many things at the same time. Jerry Jalenak Senior Programmer / Analyst, Web Publishing LabOne, Inc. 10101 Renner Blvd. Lenexa, KS 66219 (913) 577-1496 [EMAIL PROTECTED] -Original Message- From: Jerry

[HOWTO] Setting BooleanQuery MaxClauseCount

2005-01-26 Thread Jerry Jalenak
Is there a way to set the maxClauseCount field of BooleanQuery when using QueryParser? Jerry Jalenak Senior Programmer / Analyst, Web Publishing LabOne, Inc. 10101 Renner Blvd. Lenexa, KS 66219 (913) 577-1496 [EMAIL PROTECTED] This transmission (and any information attached to it) may be

RE: Filtering w/ Multiple Terms

2005-01-24 Thread Jerry Jalenak
). Everything is good now Thanks! Jerry Jalenak Senior Programmer / Analyst, Web Publishing LabOne, Inc. 10101 Renner Blvd. Lenexa, KS 66219 (913) 577-1496 [EMAIL PROTECTED] > -Original Message- > From: Erik Hatcher [mailto:[EMAIL PROTECTED] > Sent: Monday, January 24, 200

RE: Filtering w/ Multiple Terms

2005-01-24 Thread Jerry Jalenak
h AND (account:0011) and get hits back. When I add the filter back in (which should take care of the account:0011 part of the query), and enter only smith as my query, I get 0 hits. Jerry Jalenak Senior Programmer / Analyst, Web Publishing LabOne, Inc. 10101 Renner Blvd. Lenexa, KS 66219 (913) 577

RE: Filtering w/ Multiple Terms

2005-01-24 Thread Jerry Jalenak
n Setting bit on Leaving AccountFilter... Leaving AccountFilter... Leaving AccountFilter... ... Found 0 matching documents in 1000 ms Can anyone tell me what I've done wrong? Jerry Jalenak Senior Programmer / Analyst, Web Publishing LabOne, Inc. 10101 Renner Blvd. Lenexa, KS 66219 (913) 577-1496 [EMAI

RE: Filtering w/ Multiple Terms

2005-01-21 Thread Jerry Jalenak
OK. But isn't there a limit on the number of BooleanQueries that can be combined with AND / OR / etc? Jerry Jalenak Senior Programmer / Analyst, Web Publishing LabOne, Inc. 10101 Renner Blvd. Lenexa, KS 66219 (913) 577-1496 [EMAIL PROTECTED] > -Original Message- >

Filtering w/ Multiple Terms

2005-01-20 Thread Jerry Jalenak
lter, how are others handling this? Thanks! Jerry Jalenak Senior Programmer / Analyst, Web Publishing LabOne, Inc. 10101 Renner Blvd. Lenexa, KS 66219 (913) 577-1496 [EMAIL PROTECTED] This transmission (and any information attached to it) may be confidential and is intended solely for the use

RE: [newbie] Confused about PrefixQuery

2005-01-19 Thread Jerry Jalenak
Sorry. Thought Luke came bundled with Lucene, and I was just missing it.. Jerry Jalenak Senior Programmer / Analyst, Web Publishing LabOne, Inc. 10101 Renner Blvd. Lenexa, KS 66219 (913) 577-1496 [EMAIL PROTECTED] > -Original Message- > From: Erik Hatcher [mailto:[EMAIL PRO

RE: [newbie] Confused about PrefixQuery

2005-01-19 Thread Jerry Jalenak
Never mind. Stupid, stupid assumption on my part with the data. Thanks anyway. Jerry Jalenak Senior Programmer / Analyst, Web Publishing LabOne, Inc. 10101 Renner Blvd. Lenexa, KS 66219 (913) 577-1496 [EMAIL PROTECTED] > -Original Message- > From: Jerry Jalenak [mailto:

RE: [newbie] Confused about PrefixQuery

2005-01-19 Thread Jerry Jalenak
Erik, Thanks for reply. Some lists want all the info, some don't. Just thought I'd try to provide as much info as possible 8-) That being said, where do I find Luke? Jerry Jalenak Senior Programmer / Analyst, Web Publishing LabOne, Inc. 10101 Renner Blvd. Lenexa, KS 66219 (913

[newbie] Confused about PrefixQuery

2005-01-19 Thread Jerry Jalenak
528960058, DOB = 19010101, Collected = 20050118, Created = 20050119 Hit 7: Specimen = 38247027, Account = 23SQ, Status = N, Name = ROBERT BASTOW, SSN = 528960058, DOB = 19010101, Collected = 20050118, Created = 20050119 Hit 8: Specimen = 38247027, Account = 23SQ, Status = N, Name = ROBERT BASTOW, S