http://wiki.apache.org/lucene-java/ConceptsAndDefinitions

On Thu, Feb 18, 2010 at 7:13 AM, Raimon Bosch <raimon.bo...@gmail.com> wrote:
>
>
> I am not an expert in lucene scoring formula, but omintNorms=false makes the
> scoring formula a little bit more complex, taking into account boosting for
> fields and documents. If I'm not wrong (if I am please, correct me) I think
> that with omitNorms=false take into account the queryNorm(q) and norm(t,d)
> from formula: score(q,d)   =   coord(q,d)  ·  queryNorm(q)  ·            ∑    
>    (
> tf(t in d)  ·  idf(t)2  ·  t.getBoost() ·  norm(t,d)  ) so the formula will
> be more complex.
>
> See
> http://lucene.apache.org/java/2_4_0/api/org/apache/lucene/search/Similarity.html,
> and
> http://old.nabble.com/scores-are-the-same-for-many-diferent-documents-td27623039.html#a27623039
>
> multiValued option is used to create fields with multiple values.
>
> We use it one of our indexed modifying the schema.xml, adding a new field
>
> ...
> <field name="s_similar_name"         type="text"            indexed="true"
> stored="true" multiValued="true"/>
> ...
>
> This field is processed in a specific UpdateRequestProcessorFactory (write
> by us) from a comma separated field called 's_similar_names':
> ...
> public void processAdd(AddUpdateCommand cmd) throws IOException {
>    SolrInputDocument doc = cmd.getSolrInputDocument();
>
>    String v = (String)doc.getFieldValue( "s_similar_names" );
>    if( v != null ) {
>      String s_similar_names[] = v.split(",");
>      for(String s_similar_name : s_similar_names){
>        if(!s_similar_name.equals(""))
>            doc.addField( "s_similar_name", s_similar_name );
>      }
>    }
>
>    // pass it up the chain
>    super.processAdd(cmd);
>  }
> ...
>
> A processofactory is specified in solrconfig.xml
>
> ...
> # <updateRequestProcessorChain name="mychain">
> #     <processor
> class="org.apache.solr.update.processor.MyUpdateProcessorFactory"/>
> #     <processor class="solr.LogUpdateProcessorFactory" />
> #     <processor class="solr.RunUpdateProcessorFactory" />
> #   </updateRequestProcessorChain>
> ...
>
> and adding this chain to XmlUpdateRequestHandler in solrconfig.xml:
>
> ...
> # <requestHandler name="/update" class="solr.XmlUpdateRequestHandler" >
> #     <lst name="defaults">
> #        <str name="update.processor">mychain</str>
> #      </lst>
> #   </requestHandler>
> ...
>
> termVector is used to save more info about terns of a document in the index
> and save computational time in functions like MoreLikeThis.
> http://wiki.apache.org/solr/TermVectorComponent. We don't use it.
>
>
> adeelmahmood wrote:
>>
>> I was gonna ask a question about this but you seem like you might have the
>> answer for me .. wat exactly is the omitNorms field do (or is expected to
>> do) .. also if you could please help me understand what termVectors and
>> multiValued options do ??
>> Thanks for ur help
>>
>>
>> Raimon Bosch wrote:
>>>
>>>
>>> Hi,
>>>
>>> We did some tests with omitNorms=false. We have seen that in the last
>>> result's page we have some scores set to 0.0. This scores setted to 0 are
>>> problematic to our sorters.
>>>
>>> It could be some kind of bug?
>>>
>>> Regrads,
>>> Raimon Bosch.
>>>
>>
>>
>
> --
> View this message in context: 
> http://old.nabble.com/some-scores-to-0-using-omitNorns%3Dfalse-tp27637436p27637827.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>



-- 
Lance Norskog
goks...@gmail.com

Reply via email to