> We do leverage synonyms but they are not appropriate for
> this case. We use synonyms for words that are truly
> synonymous for the entire index such as "inc" and
> "incorporated". Those words are always interchangeable.
> However, many of the employer alternate names are only valid
> for a single employer not for the entire index.
> 
> We do disable the lengthNorm but we benefit from tf and idf
> so disabling those would cause more harm than good.
> 
> Any other suggestions would be appreciated.

May be WDF can useful?

http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimiterFilterFactory


      

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to