wild card with keyword fileld

2005-07-18 Thread Rahul D Thakare
  Hi, I am using Field.Keyword for indexing multi-word keyword (eg: MAIN LOGIG). Also used keywordAnalyzer, but wild card search is not coming up. Is there anything which I need to do in addition or, wild card search is not possible with keyword field. thanks and regards, Rahul Thakare..

Re: n-gram indexing

2005-07-18 Thread Chris Lamprecht
Can you run a phrase query with high slop factor? like "united states of america"~99 This will match documents with all the terms anywhere in the document. But it will also give higher scores when the terms are closer together, using edit distance I believe. -chris On 7/18/05, Rajesh Mun

Re: Best Practices for Distributing Lucene Indexing and Searching

2005-07-18 Thread Chris Lamprecht
See the paper at: http://labs.google.com/papers/mapreduce.html "MapReduce is a programming model and an associated implementation for processing and generating large data sets. Users specify a map function that processes a key/value pair to generate a set of intermediate key/value pairs, and a re

Re: n-gram indexing

2005-07-18 Thread Andy Roberts
On Monday 18 Jul 2005 22:06, Rajesh Munavalli wrote: > Intution behind adding n-grams is to boost naturally occurring larger > phrases versus using phrase queries. For example, if I am searching for > "united states of america", I want the search results to return the > documents ordered as follows

RE: n-gram indexing

2005-07-18 Thread Chris Hostetter
Your intuition is right, but i can't think of any reason why you need to add the n-grams at indexing time -- or why using phrase queries would be a bad thing in this case. When you get a multiword query, construct the n-grams of the query words as multiple phrase queries and search for a BooleanQ

RE: n-gram indexing

2005-07-18 Thread Rajesh Munavalli
Intution behind adding n-grams is to boost naturally occurring larger phrases versus using phrase queries. For example, if I am searching for "united states of america", I want the search results to return the documents ordered as follows Rank 1 - Documents containing all the words occurring toget

Re: n-gram indexing

2005-07-18 Thread Andy Roberts
On Monday 18 Jul 2005 21:27, Rajesh Munavalli wrote: > At what point do I add n-grams? Does the order in which I add n-grams > affect exact phrase queries later? My questions are > > (1) Should I add all the 1-grams followed by 2-grams followed by > 3-grams..etc sentence by sentence OR > > (2) Add

Re: Static index on CD, how much forward compatible is the index

2005-07-18 Thread Otis Gospodnetic
I was going to make the same suggestion. That's how I'd do it. For index format changes, check CHANGES.txt file (should be linked from Lucene's home page). Otis --- Chris Hostetter <[EMAIL PROTECTED]> wrote: > Why not put the application on the CD as well? .. or at a minimum, > the > lucene JAR

Re: Static index on CD, how much forward compatible is the index

2005-07-18 Thread Chris Hostetter
Why not put the application on the CD as well? .. or at a minimum, the lucene JAR. That way you never have to worry about the possibility of it changing. : Date: Mon, 18 Jul 2005 16:32:33 -0400 : From: "Tardif, Sebastien" <[EMAIL PROTECTED]> : Reply-To: java-user@lucene.apache.org : To: java-user

Static index on CD, how much forward compatible is the index

2005-07-18 Thread Tardif, Sebastien
Otis, Let say I have a static index on CD, and my customer will be using this CD for the next 10 years. I will also probably fix bug in the application that use the CD and probably want to upgrade Lucene library at some point. How much backward compatible is Lucene with old index when Lucene relea

Writting a custom query/indexSearcher

2005-07-18 Thread Robichaud, Jean-Philippe
Hi everyone. I need a special query type that looks like a phrase query but with special logic inside (like allowing inversions of certain terms only and not of others, special score manipulation on certain 'events', ...) I wonder what approach I should take? How does someone build a custom q

n-gram indexing

2005-07-18 Thread Rajesh Munavalli
At what point do I add n-grams? Does the order in which I add n-grams affect exact phrase queries later? My questions are (1) Should I add all the 1-grams followed by 2-grams followed by 3-grams..etc sentence by sentence OR (2) Add all the 1 grams of entire document first before starting 2-grams

Re: BOOLEAN OPERATOR HOWTO

2005-07-18 Thread Erik Hatcher
On Jul 18, 2005, at 8:12 AM, Karthik N S wrote: I have 2 Questions. But there were no question marks! I don't understand your questions at all, sorry, but I'll see if I can decipher it somewhat 1) The Search Criteria src requires to automatically fill " " between Search words

Re: BOOLEAN OPERATOR HOWTO

2005-07-18 Thread Nader Henein
Sounds like straight forward string manipulation, just (a) tokenize your string into a collection, and then (b) manipulate the collection to reach your desired stream, then (c) join it back into a string. Oh and once again Erik is spelled consistently with a 'k' not a 'c', you got the first one

Re: Index locked exception while updating index

2005-07-18 Thread Harini Raghavan
Chris, The reason I am creating the IndexWriter for every call to addDocument is because the update to index gets triggered by JMS events, so I will not know at what point to close the IndexWriter. Your suggestion to have a separate management thread to close the writer sounds good, I will try

BOOLEAN OPERATOR HOWTO

2005-07-18 Thread Karthik N S
  Hi Lucene Apologies.. I have 2 Questions. 1) The Search Criteria  src  requires to automatically fill   "  "  between  Search words with a Boolean Operator   "  AND ".   2) The Search Criteria  src  requires to automatically recognise the existing  Boolean Query  ' AN

RE: Best Practices for Distributing Lucene Indexing and Searching

2005-07-18 Thread Peter Gelderbloem
I am thinking of having a cluster of one indexer and a few searchers 1 to n. The indexer will consist of a number of stages as defined in SEDA. I must still do this decomposition. the resulting index will be published via message q to the searchers that will stop doing searches long enough to upda

Re: Index locked exception while updating index

2005-07-18 Thread Chris Hostetter
I freely admit that i wasn't paying much attention to the begining of this thread, so maybe there is some subtlety i'm missing, but i don't understand why your "addDocument" method needs to be synchronized. I believe the reason you are getting the lock exception is because you are opening a seper