It's worth to look into <explain> to check particular scoring values. But for most suspect is the reducing precision when float norms are stored in byte vals. See javadoc for DefaultSimilarity.encodeNormValue(float)
On Mon, Dec 8, 2014 at 5:49 PM, S.L <simpleliving...@gmail.com> wrote: > I have two documents doc1 and doc2 and each one of those has a field called > phoneName. > > doc1:phoneName:"Details about Apple iPhone 4s - 16GB - White (Verizon) > Smartphone Factory Unlocked" > > doc2:phoneName:"Apple iPhone 4S 16GB for Net10, No Contract, White" > > Here if I search for > > q=iphone+4s+16gb&qf=phoneName&mm=1&pf=phoneName&ps=1&pf2=phoneName&pf3=phoneName&stopwords=true&lowercaseOperators=true > > Doc1 and Doc2 both have the same identical score , but since the field > phoneName in the doc2 has shorter length I would expect it to have a higher > score , but both have an identical score of 9.961212. > > The phoneName filed is defined as follows.As we can see no where am I > specifying omitNorms=True, still the behavior seems to be that the length > norm is not functioning at all. Can some one let me know whats the issue > here ? > > <field name="phoneName" type="text_en_splitting" indexed="true" > stored="true" required="true" /> > <fieldType name="text_en_splitting" class="solr.TextField" > positionIncrementGap="100" autoGeneratePhraseQueries="true"> > <analyzer type="index"> > <tokenizer class="solr.WhitespaceTokenizerFactory" /> > <!-- in this example, we will only use synonyms at query > time <filter > class="solr.SynonymFilterFactory" > synonyms="index_synonyms.txt" ignoreCase="true" > expand="false"/> --> > <!-- Case insensitive stop word removal. add > enablePositionIncrements=true > in both the index and query analyzers to leave a 'gap' > for more accurate > phrase queries. --> > <filter class="solr.StopFilterFactory" ignoreCase="true" > words="lang/stopwords_en.txt" > enablePositionIncrements="true" /> > <filter class="solr.WordDelimiterFilterFactory" > generateWordParts="1" generateNumberParts="1" > catenateWords="1" > catenateNumbers="1" catenateAll="0" > splitOnCaseChange="1" /> > <filter class="solr.LowerCaseFilterFactory" /> > <filter class="solr.KeywordMarkerFilterFactory" > protected="protwords.txt" /> > <filter class="solr.PorterStemFilterFactory" /> > </analyzer> > <analyzer type="query"> > <tokenizer class="solr.WhitespaceTokenizerFactory" /> > <filter class="solr.SynonymFilterFactory" > synonyms="synonyms.txt" > ignoreCase="true" expand="true" /> > <filter class="solr.StopFilterFactory" ignoreCase="true" > words="lang/stopwords_en.txt" > enablePositionIncrements="true" /> > <filter class="solr.WordDelimiterFilterFactory" > generateWordParts="1" generateNumberParts="1" > catenateWords="0" > catenateNumbers="0" catenateAll="0" > splitOnCaseChange="1" /> > <filter class="solr.LowerCaseFilterFactory" /> > <filter class="solr.KeywordMarkerFilterFactory" > protected="protwords.txt" /> > <filter class="solr.PorterStemFilterFactory" /> > </analyzer> > </fieldType> > -- Sincerely yours Mikhail Khludnev Principal Engineer, Grid Dynamics <http://www.griddynamics.com> <mkhlud...@griddynamics.com>