Re: Starts With x and Ends With x Queries

2005-02-04 Thread Peter Pimley
I sent this to the wrong address. Sorry. Peter Pimley wrote: Well done. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Starts With x and Ends With x Queries

2005-02-04 Thread Peter Pimley
Well done. I was so annoyed with the humiliation-for-kicks this afternoon that I just practised my self-destruction technicques with some friends this evening ;) As for configuration, java.lang.system.getenv will give you access to an environment variable. http://java.sun.com/j2se/1.5.0/docs/

Source code for an accent-removal filter

2005-02-01 Thread Peter Pimley
Hi. In December I made some posts concerning a filter that could work by getting the unicode name of a character and trying to figure out the closest latin equivalent. For example, if it encountered character 00C1 LATIN CAPITAL LETTER A WITH ACUTE, it would be clever enough to replace that wit

Re: lucene integration with relational database

2005-01-15 Thread Peter Pimley
sunil goyal wrote: But can i do for instance a unified query where i want to take certain parameters (non-textual e.g. age < 30 ) from relational databases and keywords from the lucene index ? When I have had to do this, I've done the lucene search first, and then manually filtered out the hits

(Offtopic) The unicode name for a character

2004-12-22 Thread Peter Pimley
Hi everyone, The Question: In Java generally, Is there an easy way to get the unicode name of a character? (e.g. "LATIN SMALL LETTER A" from 'a') The Reasoning (for those who are interested): The documents I'm indexing have quite a lot of characters that are basically variations on the basic A-

Re: boosting challenge

2004-11-29 Thread Peter Pimley
Frank Morton wrote: Apologies to all. Worse than RTFM. A post-sorting was being done to the search results, alphabetizing by name. Doh... I was going to say that I'm doing exactly what you describe with weighting one field more than the other and it works just fine. Still, you've found the probl

Re: Index in RAM - is it realy worthy?

2004-11-24 Thread Peter Pimley
[EMAIL PROTECTED] wrote: As I experiensed on my 2 cpu box, during the query execution both processors were realy busy. The question is would it accelerate speed if I get 4 cpu box, 10 cpu... I mean real performance boost (at least factor 10), not just %-ge. I'm no expert on this, but I would

Re: QueryParser: "[stopword] AND something" throws Exception

2004-11-12 Thread Peter Pimley
Thanks for pointing that out, and sorry for reporting a duplicate bug. I went here: http://jakarta.apache.org/site/binindex.cgi and the lucene link about halfway down the page links to 1.4-final. I didn't find my way to the page that announces 1.4.2. I'll install 1.4.2 on Monday morning, which

QueryParser: "[stopword] AND something" throws Exception

2004-11-12 Thread Peter Pimley
ortunately I am up against a deadline right now so I can't fix this myself. I'm just going to filter out stop words before feeding them to the query parser. I'll try to have a look at it in roughly 2 weeks time if nobody else has solved it. Peter Pimley, Semantico Here is the st

LetterTokenizer to allow digits

2004-11-05 Thread Peter Pimley
scussed earlier. I googled for the relevant terms and found nothing. Thanks, Peter Pimley, Semantico. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Correct way to search across all fields

2004-09-17 Thread Peter Pimley
Have I missed the correct way to do this? Thanks in advance, Peter Pimley - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: (n00b) Meaning of Hits.id (int)

2004-09-09 Thread Peter Pimley
Oh, it's that simple. :) Thanks for that! Peter Morus Walter wrote: It's lucenes internal id or document number which allows you to access the document and its stored fields. See IndexSearcher.doc(int i) or IndexReader.document(int n) The docs just don't name the parameter 'id'. -

(n00b) Meaning of Hits.id (int)

2004-09-09 Thread Peter Pimley
n't find any mention anywhere else about Document ids. Could anybody explain what this is? Many Thanks in Advance, Peter Pimley, Semantico - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]