Search Architect Job

2004-02-05 Thread search architect
Hello Everybody, This may not be the right group for posting a job, but I couldn't find a better one. My friend's company is looking for a full time Technical Architect/designer. I don't know the exact job title. Here is the one line description of who they are looking for. It's a small sized c

Re: Newbie Phrase Query question

2004-02-05 Thread Erik Hatcher
On Feb 5, 2004, at 8:19 PM, Scott Smith wrote: There is a minor issue I found that I think works as documented, but wonder why it's that way. If you enter a search string that's a hyphenated word such as "fred-bill" (w/o the quotes), the QueryParser generates a search string to find all documents

RE: Newbie Phrase Query question

2004-02-05 Thread Scott Smith
Actually, I found your "QueryParser Rules" article the most useful. It explained a number of things that I had puzzled about. Query.toString() helped also. So, obvious in hindsight, an exact phrase match still goes through the tokenizer. If there are stop words or you're stemming or etc., you n

Re: Query question

2004-02-05 Thread Erik Hatcher
On Feb 5, 2004, at 3:27 PM, Justin Woody wrote: If I search the index for "building" it comes back fine (2 records) or "builder" (1record), but if I search for "build*" I only receive one record, in my example, the second record. The client would like all 3 records to come back. Is there a way I ca

Re: [OT] Re: Need Advices and Help

2004-02-05 Thread Caroline Jen
Hi, I did not explain my problem well; henceforth, resulting in some misunderstanding. And I do not have any experience in the Lucene. My Project: build a message inbox for all "registered members" of a web site. Messages: Messages are sent and received "within" the web site. Messages ar

Re: SQLDirectory

2004-02-05 Thread Doug Cutting
Philippe Laflamme wrote: I've worked on an implementation for Postgres. I used the Large Object API provided by the Postgres JDBC driver. It works fine but I doubt it is very scalable because the number of open connections during indexing can become very high. Lucene opens many different files when

Re: Using Explain and fieldNorm

2004-02-05 Thread Doug Cutting
Using the terminology in http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/search/Similarity.html fieldNorm is defined as getBoost(t.field in d) * lengthNorm(t.field in d) These two values are multipled into a single value at index time, and it is unfortunately impossible to separa

Query question

2004-02-05 Thread Justin Woody
Hello all, I am a relatively new user to Lucene. I have indexed several database tables, but I am having problems returning the search results I would like. Here is the example I have unStored index on the description column Description examples: department a building 4 building 9 department 2 s

File locking problem still exists

2004-02-05 Thread Ardor Wei
Hi, we have 2 threads: 1 writes to general index (D:\lucent\index) and the other writes to real-time index (D:\lucene\realtimeIndex). The threads are actually running a loop (writing to the directory once every 1 minute). We still always have problems like "can't delete _f17" "can't find _f17). Our

RE: FUTEX LOCK - IBM jre probs

2004-02-05 Thread Dan Quaroni
I've tried the IBM jre on a number of occassions and have always found that it didn't behave the way the sun one does... In the end I gave up on it. -Original Message- From: John Moylan [mailto:[EMAIL PROTECTED] Sent: Thursday, February 05, 2004 10:10 AM To: Lucene Users List Subject: FUT

Using Explain and fieldNorm

2004-02-05 Thread Grant Ingersoll
Hi, Was wondering what the fieldNorm section means when using the Explain functionality? How does this relate to the scoring algorithm given in the Similarity javadocs? Thanks, Grant - To unsubscribe, e-mail: [EMAIL PROTECTED

FUTEX LOCK - IBM jre probs

2004-02-05 Thread John Moylan
Hi Folks, I'm getting a FUTEX lock when I use the indexer with IBMJava2-JRE-1.4.1-8 (the jre that comes with RedHat AS3) Sun's 1.4 SDK works fine. Best Regards, John -- John Moylan -- ePublishing Radio Telefis Eireann, Montrose House, Donnybrook, Dublin 4, Eire t:+353 1 20

Re: ANNOUNCE: Plucene

2004-02-05 Thread Charlie Smith
Can we post to this forum for help with plucene? There is reference to a simplanalyzer. Is there another that would be used to parse ms word, word perfect files? >>> [EMAIL PROTECTED] 2/2/2004 5:54:03 PM >>> Plucene 1.0 has just been released to CPAN, and is available at http://search.cpan.org

Re: ANNOUNCE: Plucene

2004-02-05 Thread Charlie Smith
Can we post to this forum for help with plucene? There is reference to a simplanalyzer. Is there another that would be used to parse ms word, word perfect files? >>>Tony wrote on 2/2/2004 5:54:03 PM >>> Plucene 1.0 has just been released to CPAN, and is available at http://search.cpan.org/dist

[OT] Re: Need Advices and Help

2004-02-05 Thread petite_abeille
On Feb 05, 2004, at 13:01, Otis Gospodnetic wrote: I believe it would be the value of a 'Message-ID' or 'Reference' or 'Reference-ID' message header. However, I remember reading that mail readers are not very good at sticking to a standard (some RFC, I guess), so they don't always provide the corr

Re: ANNOUNCE: Plucene

2004-02-05 Thread Otis Gospodnetic
Good news, I was looking forward to the Perl port. I added it to the list of Lucene ports on Lucene site. Otis --- Tony Bowden <[EMAIL PROTECTED]> wrote: > > Plucene 1.0 has just been released to CPAN, and is available at > http://search.cpan.org/dist/Plucene/ > > This is a port of Lucene to

Re: Need Advices and Help

2004-02-05 Thread Otis Gospodnetic
I believe it would be the value of a 'Message-ID' or 'Reference' or 'Reference-ID' message header. However, I remember reading that mail readers are not very good at sticking to a standard (some RFC, I guess), so they don't always provide the corrent ID, or they store it under non-standard names,

Re: Need Advices and Help

2004-02-05 Thread Otis Gospodnetic
I believe it would be something like Message-ID or --- Caroline Jen <[EMAIL PROTECTED]> wrote: > I am trying to build message inboxes for all > registered members of a web site. Therefore, each > thread (i.e. under a certain discussion topic) can > have several postings. And each registered memb