[ 
https://issues.apache.org/jira/browse/PIG-5163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15947177#comment-15947177
 ] 

Nandor Kollar commented on PIG-5163:
------------------------------------

I did some more investigation, and it seems that the problem is related to 
PIG-4675. We maintain the operator keys between spark operators, and in this 
case, after multiquery optimization and before join group optimization, we have 
a mapping in SparkOperator[scope-30] multiQueryOptimizeConnectionMap: scope-22 
-> scope-8. But after join group optimization the local rearrange in scope-22 
is deleted, and replaced with POJoinGroupSpark with same operator key as the 
global rearrange. I think replacing the mapping in the 
multiQueryOptimizeConnectionMap would fix the problem (see attached patch). 
However, I'm not sure why do we need this map?

> MultiQuery_Streaming_1 is failing with spark exec type
> ------------------------------------------------------
>
>                 Key: PIG-5163
>                 URL: https://issues.apache.org/jira/browse/PIG-5163
>             Project: Pig
>          Issue Type: Sub-task
>          Components: spark
>            Reporter: Nandor Kollar
>            Assignee: liyunzhang_intel
>             Fix For: spark-branch
>
>         Attachments: PIG-5163_1.patch
>
>
> 2nd output was empty, looks like pig on spark didn't generate any data.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to