[
https://issues.apache.org/jira/browse/LUCENE-2341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13052376#comment-13052376
]
Dawid Weiss commented on LUCENE-2341:
-------------------------------------
Thanks for the contribution, Michał.
Robert: the dictionary is licensed under MPL or CC-SA (to be selected by the
user depending on one's needs). Do you know which one is preferable over
another?
Michał: there is also another (much larger) dictionary that has been released
recently and comes from the Morfeusz project.
http://sgjp.pl/morfeusz/dopobrania.html This dictionary is actually licensed
under BSD license, so no legal worries at all. Both dictionaries are nearly
identical (they differ slightly in their convention of morphosyntactic
annotations) and Morfeusz's dictionary could be compiled into an automaton for
use with Morfologik.
Which way should we go? What do you think?
> explore morfologik integration
> ------------------------------
>
> Key: LUCENE-2341
> URL: https://issues.apache.org/jira/browse/LUCENE-2341
> Project: Lucene - Java
> Issue Type: New Feature
> Components: modules/analysis
> Reporter: Robert Muir
> Assignee: Dawid Weiss
> Attachments: LUCENE-2341.diff, morfologik-stemming-1.5.0.jar
>
>
> Dawid Weiss mentioned on LUCENE-2298 that there is another Polish stemmer
> available:
> http://sourceforge.net/projects/morfologik/
> This works differently than LUCENE-2298, and ideally would be another option
> for users.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]