[ 
https://issues.apache.org/jira/browse/DATAFU-88?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317110#comment-14317110
 ] 

Jakob Homan commented on DATAFU-88:
-----------------------------------

GPL v3 is annoying because those projects can use ASL2 stuff, but we can't use 
them (https://www.apache.org/licenses/GPL-compatibility.html). 

If this feature is treated as optional, we can provide instructions on how to 
obtain the library and use it, but we can't include it ourselves: 
https://www.apache.org/legal/resolved.html#optional

> Port Stanford Core NLP Functionality to DataFu
> ----------------------------------------------
>
>                 Key: DATAFU-88
>                 URL: https://issues.apache.org/jira/browse/DATAFU-88
>             Project: DataFu
>          Issue Type: New Feature
>    Affects Versions: 1.3.0
>            Reporter: Russell Jurney
>            Assignee: Russell Jurney
>              Labels: lemmatizer, nlp, pig, pig_udf, stanford, stemmer
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> For starters I need the Stanford Core NLP stemmer and lemmatizer. 
> It looks like maybe I can add something generic and feed arguments to code 
> like: props.put("annotators", "tokenize, ssplit, pos, lemma");
> Helpful example of lemmatizing at 
> http://stackoverflow.com/questions/1578062/lemmatization-java



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to