[
https://issues.apache.org/jira/browse/HIVE-4070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13667121#comment-13667121
]
Edward Capriolo commented on HIVE-4070:
---------------------------------------
The risk, as I see it, is that there are already a large number of people
depending on the current behaviour. If we change the default that would change
the results current users are getting. Better that the new users learn how hive
works, since they are learning anyway, then break assumptions of current users.
Most users do not want to have to heavily test before upgrade, they want
consistent behaviour between versions.
You suggestion to have a global or session level property is a good one. There
are some cases where I have thought about doing this. In general, it is not
ideal because no other component in hive works this way. Thus having a one-off
configuration for handling how like statements work is odd. Also the query is
no longer self documenting. Based on how some parameter outside the query is
set, the system functions differently. Imagine if we had 10 such parameters
could the same query produce 100 different results based on permutations of
properties?
For the most part, we model functionality in hive based on what mysql does. You
will find a lot of compatibility in how UDFs work and other language features.
There are many ways this can be dealt with, hive has 'like' and 'rlike'. If
there is an sql standard on how like must work that might be ammo for the
argument of changing the default, but basing a change solely on how mysql does
something just for new users is not attractive. MySQL has made its own bad
choices over the years (non standard things like enum) (non standard date/time
types) (non standard ways to specify indexes).
I am guessing that hive's like is the way it is because hive initially only
supported java's UTf8 strings and that comparison is by default case sensitive.
> Like operator in Hive is case sensitive while in MySQL (and most likely other
> DBs) it's case insensitive
> --------------------------------------------------------------------------------------------------------
>
> Key: HIVE-4070
> URL: https://issues.apache.org/jira/browse/HIVE-4070
> Project: Hive
> Issue Type: Bug
> Components: UDF
> Affects Versions: 0.10.0
> Reporter: Mark Grover
> Assignee: Mark Grover
> Priority: Trivial
>
> Hive's like operator seems to be case sensitive.
> See
> https://github.com/apache/hive/blob/trunk/ql/src/java/org/apache/hadoop/hive/ql/udf/UDFLike.java#L164
> However, MySQL's like operator is case insensitive. I don't have other DB's
> (like PostgreSQL) installed and handy but I am guessing their LIKE is case
> insensitive as well.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira