Hi,
I am using Field.Keyword for indexing multi-word keyword (eg: MAIN LOGIG).
Also used keywordAnalyzer, but wild card search is not coming up. Is there
anything which I need to do in addition or, wild card search is not possible
with keyword field.
thanks and regards,
Rahul Thakare..
Can you run a phrase query with high slop factor? like
"united states of america"~99
This will match documents with all the terms anywhere in the document.
But it will also give higher scores when the terms are closer
together, using edit distance I believe.
-chris
On 7/18/05, Rajesh Mun
See the paper at: http://labs.google.com/papers/mapreduce.html
"MapReduce is a programming model and an associated implementation for
processing and generating large data sets. Users specify a map
function that processes a key/value pair to generate a set of
intermediate key/value pairs, and a re
On Monday 18 Jul 2005 22:06, Rajesh Munavalli wrote:
> Intution behind adding n-grams is to boost naturally occurring larger
> phrases versus using phrase queries. For example, if I am searching for
> "united states of america", I want the search results to return the
> documents ordered as follows
Your intuition is right, but i can't think of any reason why you need to
add the n-grams at indexing time -- or why using phrase queries would be a
bad thing in this case. When you get a multiword query, construct the
n-grams of the query words as multiple phrase queries and search for a
BooleanQ
Intution behind adding n-grams is to boost naturally occurring larger
phrases versus using phrase queries. For example, if I am searching for
"united states of america", I want the search results to return the
documents ordered as follows
Rank 1 - Documents containing all the words occurring toget
On Monday 18 Jul 2005 21:27, Rajesh Munavalli wrote:
> At what point do I add n-grams? Does the order in which I add n-grams
> affect exact phrase queries later? My questions are
>
> (1) Should I add all the 1-grams followed by 2-grams followed by
> 3-grams..etc sentence by sentence OR
>
> (2) Add
I was going to make the same suggestion. That's how I'd do it.
For index format changes, check CHANGES.txt file (should be linked from
Lucene's home page).
Otis
--- Chris Hostetter <[EMAIL PROTECTED]> wrote:
> Why not put the application on the CD as well? .. or at a minimum,
> the
> lucene JAR
Why not put the application on the CD as well? .. or at a minimum, the
lucene JAR. That way you never have to worry about the possibility of it
changing.
: Date: Mon, 18 Jul 2005 16:32:33 -0400
: From: "Tardif, Sebastien" <[EMAIL PROTECTED]>
: Reply-To: java-user@lucene.apache.org
: To: java-user
Otis,
Let say I have a static index on CD, and my customer will be using this
CD for the next 10 years. I will also probably fix bug in the
application that use the CD and probably want to upgrade Lucene library
at some point. How much backward compatible is Lucene with old index
when Lucene relea
Hi everyone.
I need a special query type that looks like a phrase query but with special
logic inside (like allowing inversions of certain terms only and not of
others, special score manipulation on certain 'events', ...) I wonder what
approach I should take? How does someone build a custom q
At what point do I add n-grams? Does the order in which I add n-grams
affect exact phrase queries later? My questions are
(1) Should I add all the 1-grams followed by 2-grams followed by
3-grams..etc sentence by sentence OR
(2) Add all the 1 grams of entire document first before starting 2-grams
On Jul 18, 2005, at 8:12 AM, Karthik N S wrote:
I have 2 Questions.
But there were no question marks! I don't understand your questions
at all, sorry, but I'll see if I can decipher it somewhat
1) The Search Criteria src requires to automatically fill " "
between Search words
Sounds like straight forward string manipulation, just (a) tokenize your string
into a collection, and then (b) manipulate the collection to reach your desired
stream, then (c) join it back into a string. Oh and once again Erik is spelled
consistently with a 'k' not a 'c', you got the first one
Chris,
The reason I am creating the IndexWriter for every call to addDocument is
because the update to index gets triggered by JMS events, so I will not know
at what point to close the IndexWriter.
Your suggestion to have a separate management thread to close the writer
sounds good, I will try
Hi
Lucene
Apologies..
I have 2
Questions.
1)
The Search Criteria src requires to automatically
fill " " between Search words with a Boolean
Operator " AND ".
2)
The Search Criteria src requires to automatically
recognise the existing Boolean Query ' AN
I am thinking of having a cluster of one indexer and a few searchers 1
to n.
The indexer will consist of a number of stages as defined in SEDA. I
must still do this decomposition. the resulting index will be published
via message q to the searchers that will stop doing searches long enough
to upda
I freely admit that i wasn't paying much attention to the begining of this
thread, so maybe there is some subtlety i'm missing, but i don't
understand why your "addDocument" method needs to be synchronized.
I believe the reason you are getting the lock exception is because you are
opening a seper
18 matches
Mail list logo