[
https://issues.apache.org/jira/browse/HIVE-5888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13963454#comment-13963454
]
Muthu commented on HIVE-5888:
-----------------------------
[~navis] After applying the patch from HIVE-6041 to hive 0.12, queries with
auto MAPJOIN fails with the following error: Any workarounds?
set hive.optimize.skewjoin=true; set hive.auto.convert.join=true; SELECT
ru.userid, SUM(ru.total_count) FROM BIGTABLE ru JOIN SMALLTABLE c on
c.creative_id = ru.creative_id JOIN placement_dapi p ON p.placement_id =
c.placement_id WHERE ru.realdate = '2014-01-02' AND ru.userid > 0 GROUP BY
ru.userid;
Stage-1 is selected by condition resolver.
java.io.FileNotFoundException: java.io.FileNotFoundException: File does not
exist:
/tmp/hive-muthu.nivas/tmp/hive-muthu.nivas/hive_2014-02-26_18-17-04_075_3879899075227148508-1/-mr-10002
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57)
at
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
at
org.apache.hadoop.ipc.RemoteException.instantiateException(RemoteException.java:96)
at
org.apache.hadoop.ipc.RemoteException.unwrapRemoteException(RemoteException.java:58)
at org.apache.hadoop.hdfs.DFSClient.getContentSummary(DFSClient.java:917)
at
org.apache.hadoop.hdfs.DistributedFileSystem.getContentSummary(DistributedFileSystem.java:232)
at
org.apache.hadoop.hive.ql.plan.ConditionalResolverCommonJoin.resolveMapJoinTask(ConditionalResolverCommonJoin.java:185)
at
org.apache.hadoop.hive.ql.plan.ConditionalResolverCommonJoin.getTasks(ConditionalResolverCommonJoin.java:117)
at
org.apache.hadoop.hive.ql.exec.ConditionalTask.execute(ConditionalTask.java:81)
at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:151)
at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:65)
at org.apache.hadoop.hive.ql.exec.TaskRunner.run(TaskRunner.java:55)
> group by after join operation product no result when hive.optimize.skewjoin
> = true
> ------------------------------------------------------------------------------------
>
> Key: HIVE-5888
> URL: https://issues.apache.org/jira/browse/HIVE-5888
> Project: Hive
> Issue Type: Bug
> Affects Versions: 0.11.0, 0.12.0
> Reporter: cyril liao
> Priority: Critical
>
--
This message was sent by Atlassian JIRA
(v6.2#6252)