Re: Text search NGram

2016-03-16 Thread Emir Arnautovic
ir.arnauto...@sematext.com] Sent: Wednesday, March 16, 2016 3:41 PM To: solr-user@lucene.apache.org Subject: Re: Text search NGram Hi Rajesh, It seems that title length is not different enough to have different fieldNorm - in all titles it is 0.5 so all documents for exact match query result in same score.

Re: Text search NGram

2016-03-16 Thread Emir Arnautovic
y anyone other than the intended person(s) is prohibited. -Original Message- From: Emir Arnautovic [mailto:emir.arnauto...@sematext.com] Sent: Wednesday, March 16, 2016 2:39 PM To: solr-user@lucene.apache.org Subject: Re: Text search NGram Hi Rajesh, Did you reindex afters setting omitNor

Re: Text search NGram

2016-03-16 Thread Emir Arnautovic
alent Measurement products and services. If you have received this e-mail in error, please notify the sender and immediately, destroy all copies of this email and its attachments. The publication, copying, in whole or in part, or use or dissemination in any other way of this e-mail and attachmen

RE: Text search NGram

2016-03-16 Thread G, Rajesh
o CEB and/or its subsidiaries, including CEB subsidiaries that offer SHL Talent Measurement products and services. If you have received this e-mail in error, please notify the sender and immediately, destroy all copies of this email and its attachments. The publication, copying, in whole or in par

Re: Text search NGram

2016-03-07 Thread Jack Krupansky
re intended only for the use of the > addressee(s) and may contain confidential and legally privileged > information belonging to CEB and/or its subsidiaries, including CEB > subsidiaries that offer SHL Talent Measurement products and services. If > you have received this e-mail in erro

RE: Text search NGram

2016-03-07 Thread G, Rajesh
by anyone other than the intended person(s) is prohibited. -Original Message- From: Jack Krupansky [mailto:jack.krupan...@gmail.com] Sent: Monday, March 7, 2016 8:24 PM To: solr-user@lucene.apache.org Subject: Re: Text search NGram The charFilter isn't doing anything useful -

Re: Text search NGram

2016-03-07 Thread Jack Krupansky
The charFilter isn't doing anything useful - the white space tokenzier will ignore extra white space anyway. -- Jack Krupansky On Mon, Mar 7, 2016 at 5:44 AM, G, Rajesh wrote: > Hi Team, > > We have the blow type and we have indexed the value "title": "Microsoft > Visual Studio 2006" and "titl

RE: Text search NGram

2016-03-07 Thread G, Rajesh
: Emir Arnautovic [mailto:emir.arnauto...@sematext.com] Sent: Monday, March 7, 2016 8:16 PM To: solr-user@lucene.apache.org Subject: Re: Text search NGram Not sure I understood question. What I meant is you to try setting omitNorms="false" to your txt_token field type if you want to stick

Re: Text search NGram

2016-03-07 Thread Emir Arnautovic
n part, or use or dissemination in any other way of this e-mail and attachments by anyone other than the intended person(s) is prohibited. -Original Message- From: G, Rajesh [mailto:r...@cebglobal.com] Sent: Monday, March 7, 2016 7:50 PM To: solr-user@lucene.apache.org Subject: RE: Text se

Re: Text search NGram

2016-03-07 Thread Emir Arnautovic
age- From: Emir Arnautovic [mailto:emir.arnauto...@sematext.com] Sent: Monday, March 7, 2016 7:36 PM To: solr-user@lucene.apache.org Subject: Re: Text search NGram Hi Rajesh, It is most likely related to norms - you can try setting omitNorms="true" and reindexing content. Anyway, it is no

RE: Text search NGram

2016-03-07 Thread G, Rajesh
intended person(s) is prohibited. -Original Message- From: G, Rajesh [mailto:r...@cebglobal.com] Sent: Monday, March 7, 2016 7:50 PM To: solr-user@lucene.apache.org Subject: RE: Text search NGram Hi Emir, Thanks for you email. Can you please help me to understand what do you mean by &quo

RE: Text search NGram

2016-03-07 Thread G, Rajesh
of this e-mail and attachments by anyone other than the intended person(s) is prohibited. -Original Message- From: Emir Arnautovic [mailto:emir.arnauto...@sematext.com] Sent: Monday, March 7, 2016 7:36 PM To: solr-user@lucene.apache.org Subject: Re: Text search NGram Hi Rajesh, It is m

Re: Text search NGram

2016-03-07 Thread Emir Arnautovic
Hi Rajesh, It is most likely related to norms - you can try setting omitNorms="true" and reindexing content. Anyway, it is not common to use just ngrams for matching content - in such case you can expect more unexpected ordering/results. You should combine ngrams fields with normally tokenized

RE: Text search NGram

2016-03-07 Thread G, Rajesh
Dalal [mailto:binoydala...@gmail.com] Sent: Monday, March 7, 2016 5:12 PM To: solr-user@lucene.apache.org Subject: Re: Text search NGram What query parser are you using? Additionally, run the same query with &debugQuery=true and see how your results are being scored to find out why the ms vs

Re: Text search NGram

2016-03-07 Thread Binoy Dalal
What query parser are you using? Additionally, run the same query with &debugQuery=true and see how your results are being scored to find out why the ms vs 2006 shows up before 2005. On Mon, 7 Mar 2016, 16:14 G, Rajesh, wrote: > Hi Team, > > We have the blow type and we have indexed the value

Text search NGram

2016-03-07 Thread G, Rajesh
Hi Team, We have the blow type and we have indexed the value "title": "Microsoft Visual Studio 2006" and "title": "Microsoft Visual Studio 8.0.61205.56 (2005)" When I search for title:(Microsoft Visual AND Studio AND 2005) I get Microsoft Visual Studio 8.0.61205.56 (2005) as the second record