[ 
https://issues.apache.org/jira/browse/LUCENENET-366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12863432#action_12863432
 ] 

Digy commented on LUCENENET-366:
--------------------------------

Hi Ben,

Your patches start to divert from the original Java code(v2.0).

What about porting the new SpellChecker from 
https://svn.apache.org/repos/asf/lucene/java/branches/lucene_2_9/contrib/spellchecker/src/
 ?
Would you try it?

DIGY

> Spellchecker issues
> -------------------
>
>                 Key: LUCENENET-366
>                 URL: https://issues.apache.org/jira/browse/LUCENENET-366
>             Project: Lucene.Net
>          Issue Type: Bug
>            Reporter: Ben West
>            Priority: Minor
>         Attachments: LuceneNet-SpellcheckFixes.patch
>
>
> There are several issues with the spellchecker:
> - It doesn't do duplicate checking across updates (so the same word is often 
> indexed many, many times)
> - The n-gram fields are stored as well as indexed, which increases the size 
> of the index by several orders of magnitude and provides no benefit
> - Some deprecated functions are used, which slows it down
> - Some methods aren't commented fully
> I will attach a patch that fixes these.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to