[jira] [Closed] (LUCENE-10626) Hunspell: add tools to aid dictionary editing: analysis introspection, stem expansion and stem/flag suggestion

2022-07-05 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov closed LUCENE-10626. - > Hunspell: add tools to aid dictionary editing: analysis introspection, stem > expansion and

[jira] [Resolved] (LUCENE-10626) Hunspell: add tools to aid dictionary editing: analysis introspection, stem expansion and stem/flag suggestion

2022-07-05 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov resolved LUCENE-10626. --- Resolution: Fixed > Hunspell: add tools to aid dictionary editing: analysis introspection,

[jira] [Created] (LUCENE-10626) Hunspell: add tools to aid dictionary editing: analysis introspection, stem expansion and stem/flag suggestion

2022-06-23 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-10626: - Summary: Hunspell: add tools to aid dictionary editing: analysis introspection, stem expansion and stem/flag suggestion Key: LUCENE-10626 URL:

[jira] [Closed] (LUCENE-10452) Hunspell: call checkCanceled less frequently to reduce the overhead

2022-03-16 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov closed LUCENE-10452. - > Hunspell: call checkCanceled less frequently to reduce the overhead >

[jira] [Resolved] (LUCENE-10452) Hunspell: call checkCanceled less frequently to reduce the overhead

2022-03-16 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov resolved LUCENE-10452. --- Fix Version/s: 10.0 (main) 9.2 Resolution: Fixed > Hunspell:

[jira] [Updated] (LUCENE-10451) Hunspell: don't perform potentially expensive spellchecking after timeout

2022-03-16 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov updated LUCENE-10451: -- Fix Version/s: 10.0 (main) > Hunspell: don't perform potentially expensive spellchecking

[jira] [Commented] (LUCENE-10451) Hunspell: don't perform potentially expensive spellchecking after timeout

2022-03-16 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17507402#comment-17507402 ] Peter Gromov commented on LUCENE-10451: --- OK, convinced, will add. > Hunspell: don't perform

[jira] [Commented] (LUCENE-10451) Hunspell: don't perform potentially expensive spellchecking after timeout

2022-03-16 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17507393#comment-17507393 ] Peter Gromov commented on LUCENE-10451: --- I will cherry-pick a bit later, thanks! As for

[jira] [Comment Edited] (LUCENE-10451) Hunspell: don't perform potentially expensive spellchecking after timeout

2022-03-15 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17507227#comment-17507227 ] Peter Gromov edited comment on LUCENE-10451 at 3/15/22, 8:54 PM: -

[jira] [Commented] (LUCENE-10451) Hunspell: don't perform potentially expensive spellchecking after timeout

2022-03-15 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17507227#comment-17507227 ] Peter Gromov commented on LUCENE-10451: --- [~dweiss] That'd be nice. Do you approve this backport?

[jira] [Closed] (LUCENE-10451) Hunspell: don't perform potentially expensive spellchecking after timeout

2022-03-15 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov closed LUCENE-10451. - > Hunspell: don't perform potentially expensive spellchecking after timeout >

[jira] [Resolved] (LUCENE-10451) Hunspell: don't perform potentially expensive spellchecking after timeout

2022-03-15 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov resolved LUCENE-10451. --- Fix Version/s: 9.2 Resolution: Fixed > Hunspell: don't perform potentially

[jira] [Created] (LUCENE-10452) Hunspell: call checkCanceled less frequently to reduce the overhead

2022-03-02 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-10452: - Summary: Hunspell: call checkCanceled less frequently to reduce the overhead Key: LUCENE-10452 URL: https://issues.apache.org/jira/browse/LUCENE-10452 Project:

[jira] [Created] (LUCENE-10451) Hunspell: don't perform potentially expensive spellchecking after timeout

2022-03-02 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-10451: - Summary: Hunspell: don't perform potentially expensive spellchecking after timeout Key: LUCENE-10451 URL: https://issues.apache.org/jira/browse/LUCENE-10451

[jira] [Commented] (LUCENE-10304) Make sure IDEs are usable after modules are introduced

2021-12-10 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17457139#comment-17457139 ] Peter Gromov commented on LUCENE-10304: --- It's certainly not my area of expertise, sorry :( I've

[jira] [Commented] (LUCENE-9687) Hunspell support improvements

2021-05-29 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17353926#comment-17353926 ] Peter Gromov commented on LUCENE-9687: -- [~rcmuir] Thank you so much! > Hunspell support

[jira] [Commented] (LUCENE-9687) Hunspell support improvements

2021-05-29 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17353819#comment-17353819 ] Peter Gromov commented on LUCENE-9687: -- [~mikemccand] Oops, sorry, I thought I've run some sanity

[jira] [Closed] (LUCENE-9895) Hunspell: make suggest-with-timeout API public

2021-04-07 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov closed LUCENE-9895. > Hunspell: make suggest-with-timeout API public > -- > >

[jira] [Resolved] (LUCENE-9895) Hunspell: make suggest-with-timeout API public

2021-04-07 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov resolved LUCENE-9895. -- Fix Version/s: trunk Resolution: Fixed > Hunspell: make suggest-with-timeout API

[jira] [Closed] (LUCENE-9894) Hunspell: add user-friendly diagnostics for morph data API misuse

2021-04-07 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov closed LUCENE-9894. > Hunspell: add user-friendly diagnostics for morph data API misuse >

[jira] [Resolved] (LUCENE-9894) Hunspell: add user-friendly diagnostics for morph data API misuse

2021-04-07 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov resolved LUCENE-9894. -- Fix Version/s: trunk Resolution: Fixed > Hunspell: add user-friendly diagnostics for

[jira] [Created] (LUCENE-9895) Hunspell: make suggest-with-timeout API public

2021-03-30 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9895: Summary: Hunspell: make suggest-with-timeout API public Key: LUCENE-9895 URL: https://issues.apache.org/jira/browse/LUCENE-9895 Project: Lucene - Core Issue

[jira] [Created] (LUCENE-9894) Hunspell: add user-friendly diagnostics for morph data API misuse

2021-03-30 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9894: Summary: Hunspell: add user-friendly diagnostics for morph data API misuse Key: LUCENE-9894 URL: https://issues.apache.org/jira/browse/LUCENE-9894 Project: Lucene -

[jira] [Created] (LUCENE-9852) Make Hunspell thread-safe

2021-03-18 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9852: Summary: Make Hunspell thread-safe Key: LUCENE-9852 URL: https://issues.apache.org/jira/browse/LUCENE-9852 Project: Lucene - Core Issue Type: Bug

[jira] [Commented] (LUCENE-9687) Hunspell support improvements

2021-03-15 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17301919#comment-17301919 ] Peter Gromov commented on LUCENE-9687: -- Thanks for thorough reviews! I'll be reading the dev

[jira] [Commented] (LUCENE-9687) Hunspell support improvements

2021-03-15 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17301696#comment-17301696 ] Peter Gromov commented on LUCENE-9687: -- With all my PRs accepted (thank you!) I'm satisfied with

[jira] [Created] (LUCENE-9842) Hunspell suggestions: speed up for non-Latin scripts

2021-03-15 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9842: Summary: Hunspell suggestions: speed up for non-Latin scripts Key: LUCENE-9842 URL: https://issues.apache.org/jira/browse/LUCENE-9842 Project: Lucene - Core

[jira] [Updated] (LUCENE-9842) Hunspell suggestions: speed up for some non-Latin scripts

2021-03-15 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov updated LUCENE-9842: - Summary: Hunspell suggestions: speed up for some non-Latin scripts (was: Hunspell suggestions:

[jira] [Created] (LUCENE-9833) Hunspell: AssertionError in WordStorage.lookupWord

2021-03-11 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9833: Summary: Hunspell: AssertionError in WordStorage.lookupWord Key: LUCENE-9833 URL: https://issues.apache.org/jira/browse/LUCENE-9833 Project: Lucene - Core

[jira] [Created] (LUCENE-9832) Hunspell: SIOOBE in GeneratingSuggester.expandRoot

2021-03-11 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9832: Summary: Hunspell: SIOOBE in GeneratingSuggester.expandRoot Key: LUCENE-9832 URL: https://issues.apache.org/jira/browse/LUCENE-9832 Project: Lucene - Core

[jira] [Created] (LUCENE-9831) Hunspell GeneratingSuggester: faster flag & case checks, less allocations

2021-03-10 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9831: Summary: Hunspell GeneratingSuggester: faster flag & case checks, less allocations Key: LUCENE-9831 URL: https://issues.apache.org/jira/browse/LUCENE-9831 Project:

[jira] [Created] (LUCENE-9830) Hunspell: store word length for faster dictionary lookup/enumeration

2021-03-10 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9830: Summary: Hunspell: store word length for faster dictionary lookup/enumeration Key: LUCENE-9830 URL: https://issues.apache.org/jira/browse/LUCENE-9830 Project: Lucene

[jira] [Created] (LUCENE-9825) Hunspell: reverse the "words" trie for faster word lookup/suggestions

2021-03-05 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9825: Summary: Hunspell: reverse the "words" trie for faster word lookup/suggestions Key: LUCENE-9825 URL: https://issues.apache.org/jira/browse/LUCENE-9825 Project:

[jira] [Created] (LUCENE-9824) Hunspell suggestions: speed up ngram score calculation for each dictionary entry

2021-03-04 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9824: Summary: Hunspell suggestions: speed up ngram score calculation for each dictionary entry Key: LUCENE-9824 URL: https://issues.apache.org/jira/browse/LUCENE-9824

[jira] [Created] (LUCENE-9813) Add a convenience constructor IntsRef(int[])

2021-02-26 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9813: Summary: Add a convenience constructor IntsRef(int[]) Key: LUCENE-9813 URL: https://issues.apache.org/jira/browse/LUCENE-9813 Project: Lucene - Core Issue

[jira] [Created] (LUCENE-9812) Hunspell: honor empty stripping affixes when generating suggestions

2021-02-25 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9812: Summary: Hunspell: honor empty stripping affixes when generating suggestions Key: LUCENE-9812 URL: https://issues.apache.org/jira/browse/LUCENE-9812 Project: Lucene

[jira] [Created] (LUCENE-9811) Hunspell suggestions: speed up ngram calculation by not searching for substrings in impossible places

2021-02-24 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9811: Summary: Hunspell suggestions: speed up ngram calculation by not searching for substrings in impossible places Key: LUCENE-9811 URL:

[jira] [Created] (LUCENE-9810) Hunspell: when generating suggestions, skip too deep word FST subtrees

2021-02-24 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9810: Summary: Hunspell: when generating suggestions, skip too deep word FST subtrees Key: LUCENE-9810 URL: https://issues.apache.org/jira/browse/LUCENE-9810 Project:

[jira] [Created] (LUCENE-9808) Hunspell suggestions: consider space/dash-separated words for each case variation

2021-02-24 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9808: Summary: Hunspell suggestions: consider space/dash-separated words for each case variation Key: LUCENE-9808 URL: https://issues.apache.org/jira/browse/LUCENE-9808

[jira] [Created] (LUCENE-9806) Hunspell: speed up affix condition checking

2021-02-23 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9806: Summary: Hunspell: speed up affix condition checking Key: LUCENE-9806 URL: https://issues.apache.org/jira/browse/LUCENE-9806 Project: Lucene - Core Issue

[jira] [Created] (LUCENE-9805) Hunspell: fix space + mixed case heuristics on suggestions

2021-02-23 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9805: Summary: Hunspell: fix space + mixed case heuristics on suggestions Key: LUCENE-9805 URL: https://issues.apache.org/jira/browse/LUCENE-9805 Project: Lucene - Core

[jira] [Created] (LUCENE-9804) Hunspell: fix most similar dictionary entry search by reversing the comparator

2021-02-23 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9804: Summary: Hunspell: fix most similar dictionary entry search by reversing the comparator Key: LUCENE-9804 URL: https://issues.apache.org/jira/browse/LUCENE-9804

[jira] [Created] (LUCENE-9803) Hunspell: don't check second stage suffixes if the first stage flag only occurs in prefixes

2021-02-23 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9803: Summary: Hunspell: don't check second stage suffixes if the first stage flag only occurs in prefixes Key: LUCENE-9803 URL: https://issues.apache.org/jira/browse/LUCENE-9803

[jira] [Created] (LUCENE-9801) Hunspell suggestions: speed up expandWord by enumerating only applicable affixes

2021-02-22 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9801: Summary: Hunspell suggestions: speed up expandWord by enumerating only applicable affixes Key: LUCENE-9801 URL: https://issues.apache.org/jira/browse/LUCENE-9801

[jira] [Created] (LUCENE-9800) Hunspell: put a time limit on suggestion calculation

2021-02-22 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9800: Summary: Hunspell: put a time limit on suggestion calculation Key: LUCENE-9800 URL: https://issues.apache.org/jira/browse/LUCENE-9800 Project: Lucene - Core

[jira] [Created] (LUCENE-9799) Hunspell: don't check second-level affixes when the first level isn't a continuation

2021-02-22 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9799: Summary: Hunspell: don't check second-level affixes when the first level isn't a continuation Key: LUCENE-9799 URL: https://issues.apache.org/jira/browse/LUCENE-9799

[jira] [Created] (LUCENE-9790) Hunspell: avoid slow dictionary lookup if the word's hash isn't there

2021-02-19 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9790: Summary: Hunspell: avoid slow dictionary lookup if the word's hash isn't there Key: LUCENE-9790 URL: https://issues.apache.org/jira/browse/LUCENE-9790 Project:

[jira] [Created] (LUCENE-9787) Hunspell: speed up suggesting a bit by not creating a huge TreeSet

2021-02-18 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9787: Summary: Hunspell: speed up suggesting a bit by not creating a huge TreeSet Key: LUCENE-9787 URL: https://issues.apache.org/jira/browse/LUCENE-9787 Project: Lucene -

[jira] [Created] (LUCENE-9786) Hunspell suggestions: try moving the last character into the middle

2021-02-18 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9786: Summary: Hunspell suggestions: try moving the last character into the middle Key: LUCENE-9786 URL: https://issues.apache.org/jira/browse/LUCENE-9786 Project: Lucene

[jira] [Created] (LUCENE-9785) Hunspell: don't check case in compound middle and end

2021-02-18 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9785: Summary: Hunspell: don't check case in compound middle and end Key: LUCENE-9785 URL: https://issues.apache.org/jira/browse/LUCENE-9785 Project: Lucene - Core

[jira] [Created] (LUCENE-9784) Hunspell suggestions: use US keyboard in absence of KEY option

2021-02-17 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9784: Summary: Hunspell suggestions: use US keyboard in absence of KEY option Key: LUCENE-9784 URL: https://issues.apache.org/jira/browse/LUCENE-9784 Project: Lucene -

[jira] [Created] (LUCENE-9783) Hunspell: don't suggest more than 4 ngram corrections by default

2021-02-17 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9783: Summary: Hunspell: don't suggest more than 4 ngram corrections by default Key: LUCENE-9783 URL: https://issues.apache.org/jira/browse/LUCENE-9783 Project: Lucene -

[jira] [Created] (LUCENE-9782) Hunspell suggestions: split by space (but not dash) also before last char

2021-02-17 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9782: Summary: Hunspell suggestions: split by space (but not dash) also before last char Key: LUCENE-9782 URL: https://issues.apache.org/jira/browse/LUCENE-9782 Project:

[jira] [Updated] (LUCENE-9782) Hunspell suggestions: split by space (but not dash) also before last char

2021-02-17 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov updated LUCENE-9782: - Description: Suggest replacing English "ina" with "in a" > Hunspell suggestions: split by space

[jira] [Created] (LUCENE-9781) Speed up BytesStore reader setPosition

2021-02-17 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9781: Summary: Speed up BytesStore reader setPosition Key: LUCENE-9781 URL: https://issues.apache.org/jira/browse/LUCENE-9781 Project: Lucene - Core Issue Type:

[jira] [Updated] (LUCENE-9779) Hunspell: add an API to interrupt long computations

2021-02-16 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov updated LUCENE-9779: - Description: For example, to ensure responsiveness, IntelliJ-based IDEs periodically call a

[jira] [Created] (LUCENE-9779) Hunspell: add an API to interrupt long computations

2021-02-16 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9779: Summary: Hunspell: add an API to interrupt long computations Key: LUCENE-9779 URL: https://issues.apache.org/jira/browse/LUCENE-9779 Project: Lucene - Core

[jira] [Created] (LUCENE-9778) Hunspell: speed up input conversion

2021-02-16 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9778: Summary: Hunspell: speed up input conversion Key: LUCENE-9778 URL: https://issues.apache.org/jira/browse/LUCENE-9778 Project: Lucene - Core Issue Type:

[jira] [Created] (LUCENE-9776) Hunspell: allow to inflect the last part of COMPOUNDRULE compound

2021-02-16 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9776: Summary: Hunspell: allow to inflect the last part of COMPOUNDRULE compound Key: LUCENE-9776 URL: https://issues.apache.org/jira/browse/LUCENE-9776 Project: Lucene -

[jira] [Created] (LUCENE-9775) Hunspell: make FORCEUCASE work when the first compound word is inherently title-case

2021-02-15 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9775: Summary: Hunspell: make FORCEUCASE work when the first compound word is inherently title-case Key: LUCENE-9775 URL: https://issues.apache.org/jira/browse/LUCENE-9775

[jira] [Created] (LUCENE-9772) Hunspell: CHECKCOMPOUNDCASE shouldn't prohibit dash-separated uppercase compounds

2021-02-15 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9772: Summary: Hunspell: CHECKCOMPOUNDCASE shouldn't prohibit dash-separated uppercase compounds Key: LUCENE-9772 URL: https://issues.apache.org/jira/browse/LUCENE-9772

[jira] [Created] (LUCENE-9771) Hunspell: don't lookup word roots unnecessarily to check flags

2021-02-15 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9771: Summary: Hunspell: don't lookup word roots unnecessarily to check flags Key: LUCENE-9771 URL: https://issues.apache.org/jira/browse/LUCENE-9771 Project: Lucene -

[jira] [Created] (LUCENE-9770) Hunspell: don't perform compound check recursively when looking for space-separated word pairs

2021-02-15 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9770: Summary: Hunspell: don't perform compound check recursively when looking for space-separated word pairs Key: LUCENE-9770 URL: https://issues.apache.org/jira/browse/LUCENE-9770

[jira] [Created] (LUCENE-9769) Hunspell: KEEPCASE should take precedence over affixed forms

2021-02-12 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9769: Summary: Hunspell: KEEPCASE should take precedence over affixed forms Key: LUCENE-9769 URL: https://issues.apache.org/jira/browse/LUCENE-9769 Project: Lucene - Core

[jira] [Updated] (LUCENE-9687) Hunspell support improvements

2021-02-12 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov updated LUCENE-9687: - Description: I'd like Lucene's Hunspell support to be on a par with the native C++ Hunspell

[jira] [Commented] (LUCENE-9766) Hunspell: add API for retrieving dictionary morphological data and stemming

2021-02-12 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17283771#comment-17283771 ] Peter Gromov commented on LUCENE-9766: -- That's because I'm creating a new subtask for each PR, as

[jira] [Commented] (LUCENE-9766) Hunspell: add API for retrieving dictionary morphological data and stemming

2021-02-12 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17283749#comment-17283749 ] Peter Gromov commented on LUCENE-9766: -- This one is indeed a bit out of the current wording of

[jira] [Created] (LUCENE-9766) Hunspell: add API for retrieving dictionary morphological data and stemming

2021-02-12 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9766: Summary: Hunspell: add API for retrieving dictionary morphological data and stemming Key: LUCENE-9766 URL: https://issues.apache.org/jira/browse/LUCENE-9766 Project:

[jira] [Created] (LUCENE-9765) Hunspell: rename SpellChecker to Hunspell, fix test name, update javadoc and CHANGES.txt

2021-02-11 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9765: Summary: Hunspell: rename SpellChecker to Hunspell, fix test name, update javadoc and CHANGES.txt Key: LUCENE-9765 URL: https://issues.apache.org/jira/browse/LUCENE-9765

[jira] [Created] (LUCENE-9764) Hunspell: try title case as FORCEUCASE misspelled word suggestions

2021-02-11 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9764: Summary: Hunspell: try title case as FORCEUCASE misspelled word suggestions Key: LUCENE-9764 URL: https://issues.apache.org/jira/browse/LUCENE-9764 Project: Lucene -

[jira] [Created] (LUCENE-9763) Hunspell: fix FORBIDDENWORD support

2021-02-11 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9763: Summary: Hunspell: fix FORBIDDENWORD support Key: LUCENE-9763 URL: https://issues.apache.org/jira/browse/LUCENE-9763 Project: Lucene - Core Issue Type:

[jira] [Created] (LUCENE-9761) Hunspell: check that FLAG and SET don't occur too far in the file, cleanup

2021-02-10 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9761: Summary: Hunspell: check that FLAG and SET don't occur too far in the file, cleanup Key: LUCENE-9761 URL: https://issues.apache.org/jira/browse/LUCENE-9761 Project:

[jira] [Created] (LUCENE-9760) Hunspell: print total memory usage in TestAllDictionaries, cleanup

2021-02-10 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9760: Summary: Hunspell: print total memory usage in TestAllDictionaries, cleanup Key: LUCENE-9760 URL: https://issues.apache.org/jira/browse/LUCENE-9760 Project: Lucene -

[jira] [Created] (LUCENE-9759) Hunspell: add more to TestHunspellRepositoryTestCases.EXPECTED_FAILURES

2021-02-10 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9759: Summary: Hunspell: add more to TestHunspellRepositoryTestCases.EXPECTED_FAILURES Key: LUCENE-9759 URL: https://issues.apache.org/jira/browse/LUCENE-9759 Project:

[jira] [Updated] (LUCENE-9758) Hunspell: support NOSUGGEST option

2021-02-10 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov updated LUCENE-9758: - Description: don't suggest NOSUGGEST and SUBSTANDARD dictionary entries and their derivations

[jira] [Created] (LUCENE-9758) Hunspell: support NOSUGGEST option

2021-02-10 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9758: Summary: Hunspell: support NOSUGGEST option Key: LUCENE-9758 URL: https://issues.apache.org/jira/browse/LUCENE-9758 Project: Lucene - Core Issue Type:

[jira] [Created] (LUCENE-9753) Hunspell: disallow compounds with parts present in dictionary space-separated

2021-02-09 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9753: Summary: Hunspell: disallow compounds with parts present in dictionary space-separated Key: LUCENE-9753 URL: https://issues.apache.org/jira/browse/LUCENE-9753

[jira] [Created] (LUCENE-9752) Hunspell Stemmer: reduce parameter count

2021-02-09 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9752: Summary: Hunspell Stemmer: reduce parameter count Key: LUCENE-9752 URL: https://issues.apache.org/jira/browse/LUCENE-9752 Project: Lucene - Core Issue Type:

[jira] [Created] (LUCENE-9750) Hunspell: improve suggestions for mixed-case misspelled words

2021-02-09 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9750: Summary: Hunspell: improve suggestions for mixed-case misspelled words Key: LUCENE-9750 URL: https://issues.apache.org/jira/browse/LUCENE-9750 Project: Lucene - Core

[jira] [Created] (LUCENE-9749) Hunspell: apply output conversion (OCONV) to the suggestions

2021-02-09 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9749: Summary: Hunspell: apply output conversion (OCONV) to the suggestions Key: LUCENE-9749 URL: https://issues.apache.org/jira/browse/LUCENE-9749 Project: Lucene - Core

[jira] [Commented] (LUCENE-9740) Avoid buffering and double-scan of flags in *.aff file

2021-02-09 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17281609#comment-17281609 ] Peter Gromov commented on LUCENE-9740: -- Very nice, thanks! I think this can be merged, and

[jira] [Created] (LUCENE-9748) Hunspell: suggest inflected dictionary entries similar to the misspelled word

2021-02-09 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9748: Summary: Hunspell: suggest inflected dictionary entries similar to the misspelled word Key: LUCENE-9748 URL: https://issues.apache.org/jira/browse/LUCENE-9748

[jira] [Updated] (LUCENE-9746) Hunspell: unify case variation logic in Stemmer and SpellChecker

2021-02-08 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov updated LUCENE-9746: - Summary: Hunspell: unify case variation logic in Stemmer and SpellChecker (was: Hunspell:

[jira] [Updated] (LUCENE-9746) Hunspell: unify case variation logic between Stemmer and SpellChecker

2021-02-08 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov updated LUCENE-9746: - Summary: Hunspell: unify case variation logic between Stemmer and SpellChecker (was: Hunspell:

[jira] [Created] (LUCENE-9746) Hunspell: unify case variation code between Stemmer and SpellChecker

2021-02-08 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9746: Summary: Hunspell: unify case variation code between Stemmer and SpellChecker Key: LUCENE-9746 URL: https://issues.apache.org/jira/browse/LUCENE-9746 Project: Lucene

[jira] [Created] (LUCENE-9745) Hunspell: tolerate more aff/dic file typos

2021-02-08 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9745: Summary: Hunspell: tolerate more aff/dic file typos Key: LUCENE-9745 URL: https://issues.apache.org/jira/browse/LUCENE-9745 Project: Lucene - Core Issue

[jira] [Created] (LUCENE-9743) Hunspell: ignore original tests which are out of scope for now

2021-02-08 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9743: Summary: Hunspell: ignore original tests which are out of scope for now Key: LUCENE-9743 URL: https://issues.apache.org/jira/browse/LUCENE-9743 Project: Lucene -

[jira] [Created] (LUCENE-9742) Hunspell: suggest dictionary entries similar to the misspelled word

2021-02-07 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9742: Summary: Hunspell: suggest dictionary entries similar to the misspelled word Key: LUCENE-9742 URL: https://issues.apache.org/jira/browse/LUCENE-9742 Project: Lucene

[jira] [Created] (LUCENE-9739) Hunspell: speed up numeric flag parsing

2021-02-07 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9739: Summary: Hunspell: speed up numeric flag parsing Key: LUCENE-9739 URL: https://issues.apache.org/jira/browse/LUCENE-9739 Project: Lucene - Core Issue Type:

[jira] [Updated] (LUCENE-9736) Hunspell: support MAP-based suggestions for groups of similar letters

2021-02-06 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov updated LUCENE-9736: - Summary: Hunspell: support MAP-based suggestions for groups of similar letters (was: Hunspell:

[jira] [Created] (LUCENE-9736) Hunspell: support MAP-based suggestions

2021-02-06 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9736: Summary: Hunspell: support MAP-based suggestions Key: LUCENE-9736 URL: https://issues.apache.org/jira/browse/LUCENE-9736 Project: Lucene - Core Issue Type:

[jira] [Created] (LUCENE-9735) Hunspell: speed up flag checks by avoiding allocations

2021-02-05 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9735: Summary: Hunspell: speed up flag checks by avoiding allocations Key: LUCENE-9735 URL: https://issues.apache.org/jira/browse/LUCENE-9735 Project: Lucene - Core

[jira] [Created] (LUCENE-9734) Hunspell: support suggestions based on "ph" morphological data

2021-02-05 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9734: Summary: Hunspell: support suggestions based on "ph" morphological data Key: LUCENE-9734 URL: https://issues.apache.org/jira/browse/LUCENE-9734 Project: Lucene -

[jira] [Updated] (LUCENE-9733) Hunspell: exception when loading dictionaries with mixed-case words and aliased flags

2021-02-04 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov updated LUCENE-9733: - Description: java.lang.NumberFormatException: For input string: "ÿç397" at

[jira] [Updated] (LUCENE-9733) Hunspell: exception when loading dictionaries with mixed-case words and aliased flags

2021-02-04 Thread Peter Gromov (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Gromov updated LUCENE-9733: - Summary: Hunspell: exception when loading dictionaries with mixed-case words and aliased flags

[jira] [Created] (LUCENE-9733) Hunspell: fix exception when loading dictionaries with mixed-case words

2021-02-04 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9733: Summary: Hunspell: fix exception when loading dictionaries with mixed-case words Key: LUCENE-9733 URL: https://issues.apache.org/jira/browse/LUCENE-9733 Project:

[jira] [Created] (LUCENE-9732) Hunspell: support dictionary entries starting with slash

2021-02-04 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9732: Summary: Hunspell: support dictionary entries starting with slash Key: LUCENE-9732 URL: https://issues.apache.org/jira/browse/LUCENE-9732 Project: Lucene - Core

[jira] [Created] (LUCENE-9729) Hunspell: support CHECKCOMPOUNDREP flags

2021-02-03 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9729: Summary: Hunspell: support CHECKCOMPOUNDREP flags Key: LUCENE-9729 URL: https://issues.apache.org/jira/browse/LUCENE-9729 Project: Lucene - Core Issue Type:

[jira] [Created] (LUCENE-9728) Hunspell: add a performance test

2021-02-03 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9728: Summary: Hunspell: add a performance test Key: LUCENE-9728 URL: https://issues.apache.org/jira/browse/LUCENE-9728 Project: Lucene - Core Issue Type:

[jira] [Created] (LUCENE-9726) Hunspell: speed up spellchecking by stopping at a single found stem

2021-02-03 Thread Peter Gromov (Jira)
Peter Gromov created LUCENE-9726: Summary: Hunspell: speed up spellchecking by stopping at a single found stem Key: LUCENE-9726 URL: https://issues.apache.org/jira/browse/LUCENE-9726 Project: Lucene

  1   2   >