Github user mgaido91 commented on the issue: https://github.com/apache/spark/pull/19635 Hive is interesting. In older versions, it behaves like current Spark. But in its current master branch the behavior is like after the patch: ``` 0: jdbc:hive2://localhost:10000> select 'a' where 1 in ('01'); INFO : Compiling command(queryId=root_20171106045740_254a2d60-ae1f-4851-b304-dfa18551fff2): select 'a' where 1 in ('01') INFO : Concurrency mode is disabled, not creating a lock manager INFO : Semantic Analysis Completed INFO : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:_c0, type:string, comment:null)], properties:null) INFO : Completed compiling command(queryId=root_20171106045740_254a2d60-ae1f-4851-b304-dfa18551fff2); Time taken: 5.794 seconds INFO : Concurrency mode is disabled, not creating a lock manager INFO : Executing command(queryId=root_20171106045740_254a2d60-ae1f-4851-b304-dfa18551fff2): select 'a' where 1 in ('01') INFO : Completed executing command(queryId=root_20171106045740_254a2d60-ae1f-4851-b304-dfa18551fff2); Time taken: 0.008 seconds INFO : OK +------+ | _c0 | +------+ | a | +------+ 1 row selected (6.321 seconds) 0: jdbc:hive2://localhost:10000> select 'a' where 1 in (select '01' from (select 1) dual); INFO : Compiling command(queryId=root_20171106045757_48e04001-bfbd-4557-9dd5-4e97674708ff): select 'a' where 1 in (select '01' from (select 1) dual) INFO : Concurrency mode is disabled, not creating a lock manager INFO : Semantic Analysis Completed INFO : Returning Hive schema: Schema(fieldSchemas:[FieldSchema(name:_c0, type:string, comment:null)], properties:null) INFO : Completed compiling command(queryId=root_20171106045757_48e04001-bfbd-4557-9dd5-4e97674708ff); Time taken: 0.869 seconds INFO : Concurrency mode is disabled, not creating a lock manager INFO : Executing command(queryId=root_20171106045757_48e04001-bfbd-4557-9dd5-4e97674708ff): select 'a' where 1 in (select '01' from (select 1) dual) WARN : Hive-on-MR is deprecated in Hive 2 and may not be available in the future versions. Consider using a different execution engine (i.e. spark, tez) or using Hive 1.X releases. INFO : Query ID = root_20171106045757_48e04001-bfbd-4557-9dd5-4e97674708ff INFO : Total jobs = 1 INFO : Starting task [Stage-4:MAPREDLOCAL] in serial mode INFO : Execution completed successfully INFO : MapredLocal task succeeded INFO : Launching Job 1 out of 1 INFO : Starting task [Stage-3:MAPRED] in serial mode INFO : Number of reduce tasks is set to 0 since there's no reduce operator WARN : Hadoop command-line option parsing not performed. Implement the Tool interface and execute your application with ToolRunner to remedy this. INFO : number of splits:1 INFO : Submitting tokens for job: job_1509962180830_0001 INFO : The url to track the job: http://6edb04432864:8088/proxy/application_1509962180830_0001/ INFO : Starting Job = job_1509962180830_0001, Tracking URL = http://6edb04432864:8088/proxy/application_1509962180830_0001/ INFO : Kill Command = /usr/local/hadoop/bin/hadoop job -kill job_1509962180830_0001 INFO : Hadoop job information for Stage-3: number of mappers: 0; number of reducers: 0 INFO : 2017-11-06 04:58:27,891 Stage-3 map = 0%, reduce = 0% INFO : 2017-11-06 04:58:35,774 Stage-3 map = 100%, reduce = 0% INFO : Ended Job = job_1509962180830_0001 INFO : MapReduce Jobs Launched: INFO : Stage-Stage-3: HDFS Read: 0 HDFS Write: 0 SUCCESS INFO : Total MapReduce CPU Time Spent: 0 msec INFO : Completed executing command(queryId=root_20171106045757_48e04001-bfbd-4557-9dd5-4e97674708ff); Time taken: 37.367 seconds INFO : OK +------+ | _c0 | +------+ | a | +------+ 1 row selected (38.501 seconds) ``` It looks like it has been fixed but I have not been able to find the relevant JIRA ticket so far. @gatorsmile should I check other databases?
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org