Andriy Rysin created SOLR-11835:
---
Summary: Adjust instructions for Ukrainian on LanguageAnalysis page
Key: SOLR-11835
URL: https://issues.apache.org/jira/browse/SOLR-11835
Project: Solr
Issue
[
https://issues.apache.org/jira/browse/LUCENE-7973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16195876#comment-16195876
]
Andriy Rysin commented on LUCENE-7973:
--
It looks like I need to remove my 3 pull requests from above
Andriy Rysin created LUCENE-7973:
Summary: Update dictionary version for Urainian analyzer
Key: LUCENE-7973
URL: https://issues.apache.org/jira/browse/LUCENE-7973
Project: Lucene - Core
[
https://issues.apache.org/jira/browse/LUCENE-7841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16021165#comment-16021165
]
Andriy Rysin commented on LUCENE-7841:
--
Thanks Dawid, I've pushed the checksum and change file
Andriy Rysin created LUCENE-7841:
Summary: Normalize ґ to г in Ukrainian analyzer
Key: LUCENE-7841
URL: https://issues.apache.org/jira/browse/LUCENE-7841
Project: Lucene - Core
Issue Type:
[
https://issues.apache.org/jira/browse/LUCENE-7785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15974717#comment-15974717
]
Andriy Rysin commented on LUCENE-7785:
--
Thanks Dawid! Thanks everybody for your help and feedback!
[
https://issues.apache.org/jira/browse/LUCENE-7785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15969418#comment-15969418
]
Andriy Rysin commented on LUCENE-7785:
--
`ant precommit` is happy now
> Move dictionary for
[
https://issues.apache.org/jira/browse/LUCENE-7785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15969405#comment-15969405
]
Andriy Rysin commented on LUCENE-7785:
--
Ahh, I see what you mean, I'll push the fix for the order
[
https://issues.apache.org/jira/browse/LUCENE-7785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15969389#comment-15969389
]
Andriy Rysin commented on LUCENE-7785:
--
Here's what I get:
check-lib-versions:
[echo] Lib
[
https://issues.apache.org/jira/browse/LUCENE-7785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15969367#comment-15969367
]
Andriy Rysin commented on LUCENE-7785:
--
Ok, thanks for the suggestions, I was able to run `ant
Andriy Rysin created LUCENE-7785:
Summary: Move dictionary for Ukrainian analyzer to external
dependency
Key: LUCENE-7785
URL: https://issues.apache.org/jira/browse/LUCENE-7785
Project: Lucene - Core
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15628837#comment-15628837
]
Andriy Rysin commented on LUCENE-7287:
--
Cassandra looks like 6.2 is out could you please add
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15365063#comment-15365063
]
Andriy Rysin commented on LUCENE-7287:
--
Thanks Michael, much appreciated!
> New lemma-tizer plugin
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15358188#comment-15358188
]
Andriy Rysin commented on LUCENE-7287:
--
Hey [~mikemccand], can we please merge the pull request
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15348927#comment-15348927
]
Andriy Rysin commented on LUCENE-7287:
--
Ok, I was able to run solr with Ukrainian analyzer and I can
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15348585#comment-15348585
]
Andriy Rysin commented on LUCENE-7287:
--
I've created the dictionary that collapses token+lemma in
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15347153#comment-15347153
]
Andriy Rysin commented on LUCENE-7287:
--
Ok, then I'll prepare the changes as part of this ticket.
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15346878#comment-15346878
]
Andriy Rysin commented on LUCENE-7287:
--
Hmm, that does not look right. Yes we can either use
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15346862#comment-15346862
]
Andriy Rysin commented on LUCENE-7287:
--
Thanks Ahmet!
Shall I create mappings_uk.txt so we can use
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15346413#comment-15346413
]
Andriy Rysin commented on LUCENE-7287:
--
Sure, I can add a comment, but I guess I need to test the
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15344396#comment-15344396
]
Andriy Rysin commented on LUCENE-7287:
--
I've logged in into cwiki but I don't seem to have rights to
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15344252#comment-15344252
]
Andriy Rysin commented on LUCENE-7287:
--
Thanks Ahmet, that looks good! Would you add/push those
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15343258#comment-15343258
]
Andriy Rysin edited comment on LUCENE-7287 at 6/22/16 3:07 AM:
---
I don't
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15343258#comment-15343258
]
Andriy Rysin commented on LUCENE-7287:
--
I don't know much about solr, but I think
[
https://issues.apache.org/jira/browse/LUCENE-7348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15341014#comment-15341014
]
Andriy Rysin commented on LUCENE-7348:
--
[~mikemccand] Hey Michael,
I've analyzed the inflection
Andriy Rysin created LUCENE-7348:
Summary: Add dynamic stemmer for Ukrainian
Key: LUCENE-7348
URL: https://issues.apache.org/jira/browse/LUCENE-7348
Project: Lucene - Core
Issue Type: New
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15340996#comment-15340996
]
Andriy Rysin commented on LUCENE-7287:
--
Looks cool, thanks a lot Michael!
I wonder if we should add
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15337982#comment-15337982
]
Andriy Rysin commented on LUCENE-7287:
--
I guess it does not fit under analysis/common as it depends
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15333888#comment-15333888
]
Andriy Rysin commented on LUCENE-7287:
--
[~mikemccand], [~iorixxx] does this implementation look good
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15326753#comment-15326753
]
Andriy Rysin commented on LUCENE-7287:
--
Thanks for the hint, I've changed the code to use
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15326624#comment-15326624
]
Andriy Rysin commented on LUCENE-7287:
--
I've added a token filter for unicode apostrophes and stress
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15326066#comment-15326066
]
Andriy Rysin commented on LUCENE-7287:
--
Ok, guys, I've created little project with Ukrainian
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15323800#comment-15323800
]
Andriy Rysin commented on LUCENE-7287:
--
Ok, I've imported lucene-sorl and the Ukrainian analyzer
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15317807#comment-15317807
]
Andriy Rysin commented on LUCENE-7287:
--
I just realized that Lucene includes morfologik analyzer
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15306947#comment-15306947
]
Andriy Rysin commented on LUCENE-7287:
--
>From my point of view we can use dict_uk as a source for
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304009#comment-15304009
]
Andriy Rysin commented on LUCENE-7287:
--
BTW how does hunspell stemming works for "exceptions"? There
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304000#comment-15304000
]
Andriy Rysin commented on LUCENE-7287:
--
So do we need to build hunspell dictionary (this may take me
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15298680#comment-15298680
]
Andriy Rysin commented on LUCENE-7287:
--
There's no alternative open dictionary for Ukrainian with
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15298470#comment-15298470
]
Andriy Rysin commented on LUCENE-7287:
--
Quick check via jvisualvm shows ~400MB used by the
39 matches
Mail list logo