Re: Phrase Exact Match with Margin of Error

2017-06-15 Thread Susheel Kumar
Agree, that's the challenge. Since ComplexPhraseQuery parser needs terms analyzed/tokenized and if don't, it can't really operate at individual tokens with fuzzy or wildcard matches. The solution I can think of is to execute query against both the fields (KeywordTokenized..) and Non-KeywordTokeniz

Re: Phrase Exact Match with Margin of Error

2017-06-15 Thread simon
I think that's because the KeywordTokenizer by definition produces a single token (not a phrase). Perhaps you could create two fields by a copyField - the one you already have(field1), and one tokenized using StandardTokenizer or WhiteSpaceTokenizer(field2) which will produce a phrase with multipl

Re: Phrase Exact Match with Margin of Error

2017-06-15 Thread Max Bridgewater
Thanks Susheel. The challenge is that if I search for the word "between" alone, I still get plenty of results. In a way I want the query to match the document title exactly (up to a few characters) and the document title match the query exactly (up to a few characters). KeywordTokenizer allows tha

Re: Phrase Exact Match with Margin of Error

2017-06-15 Thread Susheel Kumar
CompledPhraseQuery parser is what you need to look https://cwiki.apache.org/confluence/display/solr/Other+Parsers#OtherParsers-ComplexPhraseQueryParser. See below for e.g. http://localhost:8983/solr/techproducts/select?debugQuery=on&indent=on&q=manu:%22Bridge%20the%20gat~1%20between%20your%20ski

Phrase Exact Match with Margin of Error

2017-06-15 Thread Max Bridgewater
Hi, I am trying to do phrase exact match. For this, I use KeywordTokenizerFactory. This basically does what I want to do. My field type is defined as follows: In addition to this, I want to tolerate typos of two or three lett