[ https://issues.apache.org/jira/browse/PIG-3143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Daniel Dai updated PIG-3143: ---------------------------- Fix Version/s: (was: 0.14.0) 0.15.0 > Enable TOKENIZE to use any configurable Lucene Tokenizer, if a config > parameter is set and the JARs included > ------------------------------------------------------------------------------------------------------------ > > Key: PIG-3143 > URL: https://issues.apache.org/jira/browse/PIG-3143 > Project: Pig > Issue Type: Improvement > Components: impl, internal-udfs > Affects Versions: 0.11 > Reporter: Russell Jurney > Fix For: 0.15.0 > > > I'll do this in time for 12. TOKENIZE is literally useless as is. See: > http://thedatachef.blogspot.com/2011/04/lucene-text-tokenization-udf-for-apache.html > https://github.com/Ganglion/varaha/blob/master/src/main/java/varaha/text/TokenizeText.java -- This message was sent by Atlassian JIRA (v6.3.4#6332)