[jira] [Commented] (SOLR-3029) Poor json formatting of spelling collation info

James Dyer (Commented) (JIRA) Thu, 29 Mar 2012 08:02:49 -0700

    [ 
https://issues.apache.org/jira/browse/SOLR-3029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13241289#comment-13241289
 ]


James Dyer commented on SOLR-3029:
----------------------------------

Yonik,

I can answer some of your questions.  I do agree the spellcheck response format 
leaves something to be desired and maybe 4.0 is a good time to break backwards 
and improve it.

{quote}
Unless order is really important, "suggestions" should be a map
{quote}
I don't see why order would matter here, although some users might like to see 
the corrections listed in the order they appeared in the query.

{quote}
same for "collation"
{quote}
The collations are ranked, so order is important.

{quote}
and "misspellingsAndCorrections"
{quote}
The order shouldn't matter unless users are picky about the corrections being 
presented in the order they occur in the query.

{quote}
why is "collation" inside "suggestions" along with other words? should this be 
one level higher?
{quote}
This always confused me too.  I agree it should be one level higher.

{quote}
why isn't this giving me multiple collations
{quote}
This is a bug.  See SOLR-2853.

{quote}
why aren't multiple suggestions returned in misspellingsAndCorrections? (and 
what's the purpose ...?)
{quote}
This is nested with the Collation and gives details, for that particular 
collation, which misspelled word got which replacement.  This makes it easy for 
clients to generate messages like "no results found for abcdefgq ...  Showing 
abcdefgx instead!"  You can suppress this information by not specifying 
"spellcheck.collateExtendedResults=true".  For users (like me) who are 
interested in the collations only and don't care about individual-word 
corrections, it would be nice if we could suppress the first section of the 
response entirely.

{quote}
I briefly tried distributed search...
{quote}
DistributedSpellCheckComponentTest is supposed to detect problems like this but 
maybe something is going on and there is a bug this test isn't catching?

For what its worth you had voiced some misgivings about the JSON format when 
the multiple-collations feature was added.  At that time I supplied a quick 
patch to address your concerns.  I'm not sure if that patch fixes the problem 
described here.  See SOLR-2010 and your comment from Oct 16, 2010 and the (now 
outdated, never committed) patch I supplied on Oct 20.  

The patch on this issue causes multiple test failures although I didn't look 
into them.




                
> Poor json formatting of spelling collation info
> -----------------------------------------------
>
>                 Key: SOLR-3029
>                 URL: https://issues.apache.org/jira/browse/SOLR-3029
>             Project: Solr
>          Issue Type: Bug
>          Components: spellchecker
>    Affects Versions: 4.0
>            Reporter: Antony Stubbs
>            Priority: Blocker
>         Attachments: SOLR-3029.patch
>
>
> {noformat}
> "spellcheck": {
>     "suggestions": [
>     "dalllas",
>     {
> <snip>
>         {
>             "word": "canallas",
>             "freq": 1
>         }
>         ]
>     },
>     "correctlySpelled",
>     false,
>     "collation",
>     "dallas"
>     ]
> }
> {noformat}
> The correctlySpelled and collation key/values are stored as consecutive 
> elements in an array - quite odd. Is there a reason isn't not a key/value map 
> like most things?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-3029) Poor json formatting of spelling collation info

Reply via email to