You can use Levenstein Distance algorithm inside solr without writing code by
specifing the source of terms in solrconfig.xml

<searchComponent name="spellcheck" class="solr.SpellCheckComponent">
  <lst name="spellchecker">
    <str name="classname">solr.IndexBasedSpellChecker</str>
    <str name="spellcheckIndexDir">./spellchecker</str>
    <str name="field">content</str>
    <str name="buildOnCommit">true</str>
  </lst>
</searchComponent>

This example shows the results of a simple query that defines a query using
the spellcheck.q parameter. The query also includes a spellcheck.build=true
parameter, which is needs to be called only once in order to build the
index. spellcheck.build should not be specified with for each request.

http://localhost:8983/solr/spellCheckCompRH?q=*:*&spellcheck.q=hell%20ultrashar&spellcheck=true&spellcheck.build=true

<lst name="spellcheck">
  <lst name="suggestions">
    <lst name="hell">
      <int name="numFound">1</int>
      <int name="startOffset">0</int>
      <int name="endOffset">4</int>
      <arr name="suggestion">
        <str>dell</str>
      </arr>
    </lst>
    <lst name="ultrashar">
      <int name="numFound">1</int>
      <int name="startOffset">5</int>
      <int name="endOffset">14</int>
      <arr name="suggestion">
        <str>ultrasharp</str>
      </arr>
    </lst>
  </lst>
</lst>



Once the suggestions are collected, they are ranked by the configured
distance measure (Levenstein Distance by default) and then by aggregate
frequency.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/How-to-properly-use-Levenstein-distance-with-in-Java-tp4164793p4164883.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to