[ https://issues.apache.org/jira/browse/HIVE-642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12733747#action_12733747 ]
Emil Ibrishimov commented on HIVE-642: -------------------------------------- There are some easy (compromise) ways to optimize split: 1. Check if the regex argument actually contains some "regex specific characters" and if it doesn't, do a straightforward split without converting to strings. 2. Assume some default value for the second argument (for example - split(str) to be equivalent to split(str, ' ') and optimize for this value 3. Have two separate split functions - one that does regex and one that splits around plain text. I think that 1 is a good choice and can be done rather quickly. > udf equivalent to string split > ------------------------------ > > Key: HIVE-642 > URL: https://issues.apache.org/jira/browse/HIVE-642 > Project: Hadoop Hive > Issue Type: New Feature > Components: Query Processor > Reporter: Namit Jain > Assignee: Emil Ibrishimov > Fix For: 0.4.0 > > Attachments: HIVE-642.1.patch, HIVE-642.2.patch > > > It would be very useful to have a function equivalent to string split in java -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.