> We do leverage synonyms but they are not appropriate for > this case. We use synonyms for words that are truly > synonymous for the entire index such as "inc" and > "incorporated". Those words are always interchangeable. > However, many of the employer alternate names are only valid > for a single employer not for the entire index. > > We do disable the lengthNorm but we benefit from tf and idf > so disabling those would cause more harm than good. > > Any other suggestions would be appreciated.
May be WDF can useful? http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimiterFilterFactory --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org