[jira] Commented: (SOLR-81) Add Query Spellchecker functionality

Hoss Man (JIRA) Tue, 13 Mar 2007 21:44:30 -0800

    [ 
https://issues.apache.org/jira/browse/SOLR-81?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12480660
 ]


Hoss Man commented on SOLR-81:
------------------------------

Otis: haven't had a chance to look at your newest patch yet, but just to 
clarify my comment#4... In the last patch i looked at, LuceneDictionary could 
be used to build the dictionary based on a field name from the index -- but 
this will only work for simple String or TextFields.

Theoretically, someone could write a ROT132FieldType that munges up the field 
values stored in it, if you were to try and build a SpellChecker index from 
this field, nothing good would come of it just using LUceneDIctionary (because 
of hte way it uses hte raw TermEnum) .. but since we have the IndexSchema, we 
can get the FieldType for the field name we want to use, and then the 
"indexedToReadable" method on each indexed term will tell you the "plain text" 
version.

it's a minor thing, but it's a good thing to take into account.

Alternately, we can just document that it doesn't make sense to use any field 
type except "StrField" (even TextField doens't really make sense since we can't 
anticipate what hte Analyzer might have done)


> Add Query Spellchecker functionality
> ------------------------------------
>
>                 Key: SOLR-81
>                 URL: https://issues.apache.org/jira/browse/SOLR-81
>             Project: Solr
>          Issue Type: New Feature
>          Components: search
>            Reporter: Otis Gospodnetic
>            Priority: Minor
>         Attachments: SOLR-81-edgengram-ngram.patch, 
> SOLR-81-ngram-schema.patch, SOLR-81-ngram.patch, SOLR-81-ngram.patch, 
> SOLR-81-ngram.patch, SOLR-81-ngram.patch, SOLR-81-spellchecker.patch, 
> SOLR-81-spellchecker.patch, SOLR-81-spellchecker.patch
>
>
> Use the simple approach of n-gramming outside of Solr and indexing n-gram 
> documents.  For example:
> <doc>
> <field name="word">lettuce</field>
> <field name="start3">let</field>
> <field name="gram3">let ett ttu tuc uce</field>
> <field name="end3">uce</field>
> <field name="start4">lett</field>
> <field name="gram4">lett ettu ttuc tuce</field>
> <field name="end4">tuce</field>
> </doc>
> See:
> http://www.mail-archive.com/solr-user@lucene.apache.org/msg01254.html
> Java clients: SOLR-20 (add delete commit optimize), SOLR-30 (search)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (SOLR-81) Add Query Spellchecker functionality

Reply via email to