Re: EdgeNGram relevancy

2010-11-16 Thread Robert Gründler
it seems adding the '+' (required) operator to each term in a multi-term query does the trick: http://lucene.apache.org/java/2_4_0/queryparsersyntax.html#+ ie: edgytext2:(+Martin +Sco) -robert On Nov 16, 2010, at 8:52 PM, Robert Gründler wrote: > thanks for the explanation. > > the result

Re: EdgeNGram relevancy

2010-11-16 Thread Robert Gründler
thanks for the explanation. the results for the autocompletion are pretty good now, but we still have a small problem. When there are hits in the "edgytext2" fields, results which only have hits in the "edgytext" field should not be returned at all. Example: Query: "Martin Sco" Current Resu

Re: EdgeNGram relevancy

2010-11-11 Thread Jonathan Rochkind
Without the parens, the "edgytext:" only applied to "Mr", the default field still applied to "Scorcese". The double quotes are neccesary in the second case (rather than parens), because on a non-tokenized field because the standard query parser will "pre-tokenize" on whitespace before sending

Re: EdgeNGram relevancy

2010-11-11 Thread Robert Gründler
> > Did you run your query without using () and "" operators? If yes can you try > this? > &q=edgytext:(Mr Scorsese) OR edgytext2:"Mr Scorsese"^2.0 I didn't use () and "" in my query before. Using the query with those operators works now, stopwords are thrown out as the should, thanks. However,

Re: EdgeNGram relevancy

2010-11-11 Thread Andy
Ah I see. Thanks for the explanation. Could you set the defaultOperator to "AND"? That way both "Bill" and "Cl" must be a match and that would exclude "Clyde Phillips". --- On Thu, 11/11/10, Robert Gründler wrote: > From: Robert Gründler > Su

Re: EdgeNGram relevancy

2010-11-11 Thread Robert Gründler
itespaceTokenizerFactory. And query both these fields with >> an OR operator. >> >> edgytext:(Bill Cl) OR edgytext2:"Bill Cl" >> >> You can even apply boost so that begins with matches comes >> first. >> >> --- On Thu, 11/11/10, Robert Gründle

Re: EdgeNGram relevancy

2010-11-11 Thread Andy
t begins with matches comes > first. > > --- On Thu, 11/11/10, Robert Gründler > wrote: > > > From: Robert Gründler > > Subject: EdgeNGram relevancy > > To: solr-user@lucene.apache.org > > Date: Thursday, November 11, 2010, 5:51 PM > > Hi, > > &g

Re: EdgeNGram relevancy

2010-11-11 Thread Nick Martin
On 12 Nov 2010, at 01:46, Ahmet Arslan wrote: >> This setup now makes troubles regarding StopWords, here's >> an example: >> >> Let's say the index contains 2 Strings: "Mr Martin >> Scorsese" and "Martin Scorsese". "Mr" is in the stopword >> list. >> >> Query: edgytext:Mr Scorsese OR edgytext2

Re: EdgeNGram relevancy

2010-11-11 Thread Ahmet Arslan
> This setup now makes troubles regarding StopWords, here's > an example: > > Let's say the index contains 2 Strings: "Mr Martin > Scorsese" and "Martin Scorsese". "Mr" is in the stopword > list. > > Query: edgytext:Mr Scorsese OR edgytext2:Mr Scorsese^2.0 > > This way, the only result i get is

Re: EdgeNGram relevancy

2010-11-11 Thread Robert Gründler
l" > > You can even apply boost so that begins with matches comes first. > > --- On Thu, 11/11/10, Robert Gründler wrote: > >> From: Robert Gründler >> Subject: EdgeNGram relevancy >> To: solr-user@lucene.apache.org >> Date: Thursday, November 11

Re: EdgeNGram relevancy

2010-11-11 Thread Ahmet Arslan
10, Robert Gründler wrote: > From: Robert Gründler > Subject: EdgeNGram relevancy > To: solr-user@lucene.apache.org > Date: Thursday, November 11, 2010, 5:51 PM > Hi, > > consider the following fieldtype (used for > autocompletion): > >   positio

EdgeNGram relevancy

2010-11-11 Thread Robert Gründler
Hi, consider the following fieldtype (used for autocompletion): This works fine as long as the query string is a single word. For multiple words, the ranking is weird though. Example: Que