[ 
https://issues.apache.org/jira/browse/PIG-2443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13174378#comment-13174378
 ] 

Jonathan Coveney commented on PIG-2443:
---------------------------------------

Oh, I totally know what you mean, I'm just saying it would be cool... and given 
that you're doing, say, Integer.parseInt() and then just throwing away the 
result, it seems silly that someone would do the split, and then recast the int 
fields in the relation created by the data for which IsInt is true.

There is currently no way for UDF to produce variable output schema (nor should 
there, be, really). This would be something specific to this use of split.
                
> [Piggybank] Add UDFs to check if a String is an Integer And if a String is 
> Numeric
> ----------------------------------------------------------------------------------
>
>                 Key: PIG-2443
>                 URL: https://issues.apache.org/jira/browse/PIG-2443
>             Project: Pig
>          Issue Type: New Feature
>          Components: piggybank
>            Reporter: Prashant Kommireddi
>            Assignee: Prashant Kommireddi
>         Attachments: isIntNumeric.patch, isIntNumeric.patch
>
>
> UDF that could be used to check if a String is numeric (or an Integer). 
> Several tools such as Splunk, AbInitio have this UDF built-in and companies 
> making an effort to move to Hadoop/Pig could use this.
> Use Case:
> In raw logs there are certain filters/conditions applied based on whether a 
> particular field/value is numeric or not. For eg, SPLIT A INTO CATEGORY1 IF 
> IsInt($0), CATEGORY2 IF !IsInt($0);

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to