I do not get any suggestion (when I search for "wrangle") , however I
correctly get the suggestion wrangler when I search for wranglr , I am
using the Direct and WordBreak spellcheckers in combination, I have not
tried using anything else.

Is the distance calculation of Solr different than what Levestien distance
calculation ? I have set maxEdits to 1 , assuming that this corresponds to
the maxDistance.

Thanks for your help!


On Mon, Jun 2, 2014 at 1:54 PM, david.w.smi...@gmail.com <
david.w.smi...@gmail.com> wrote:

> What do you get then?  Suggestions, but not the one you’re looking for, or
> is it deemed correctly spelled?
>
> Have you tried another spellChecker impl, for troubleshooting purposes?
>
> ~ David Smiley
> Freelance Apache Lucene/Solr Search Consultant/Developer
> http://www.linkedin.com/in/davidwsmiley
>
>
> On Sat, May 31, 2014 at 12:33 AM, S.L <simpleliving...@gmail.com> wrote:
>
> > Hi All,
> >
> > I have a small test index of 400 documents , it happens to have an entry
> > for  "wrangler", When I search for "wranglr", I correctly get the
> collation
> > suggestion as "wrangler", however when I search for "wrangle" , I do not
> > get a suggestion for "wrangler".
> >
> > The Levenstien distance between wrangle --> wrangler is same as the
> > Levestien distance between wranglr-->wrangler , I am just wondering why I
> > do not get a suggestion for wrangle.
> >
> > Below is my Direct spell checker configuration.
> >
> >     <lst name="spellchecker">
> >       <str name="name">direct</str>
> >       <str name="field">suggestAggregate</str>
> >       <str name="classname">solr.DirectSolrSpellChecker</str>
> >       <!-- the spellcheck distance measure used, the default is the
> > internal levenshtein -->
> >       <str name="distanceMeasure">internal</str>
> >       <str name="comparatorClass">score</str>
> >
> >       <!-- minimum accuracy needed to be considered a valid spellcheck
> > suggestion -->
> >       <float name="accuracy">0.7</float>
> >       <!-- the maximum #edits we consider when enumerating terms: can be
> 1
> > or 2 -->
> >       <int name="maxEdits">1</int>
> >       <!-- the minimum shared prefix when enumerating terms -->
> >       <int name="minPrefix">3</int>
> >       <!-- maximum number of inspections per result. -->
> >       <int name="maxInspections">5</int>
> >       <!-- minimum length of a query term to be considered for correction
> > -->
> >       <int name="minQueryLength">4</int>
> >       <!-- maximum threshold of documents a query term can appear to be
> > considered for correction -->
> >       <float name="maxQueryFrequency">0.01</float>
> >       <!-- uncomment this to require suggestions to occur in 1% of the
> > documents -->
> >       <!--
> >       <float name="thresholdTokenFrequency">.01</float>
> >       -->
> >     </lst>
> >
>

Reply via email to