[
https://issues.apache.org/jira/browse/PIG-4421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14520963#comment-14520963
]
Mohit Sabharwal commented on PIG-4421:
--------------------------------------
Thanks, [~kellyzly]. Could you please create a review board for this?
Could you rebase this patch so that it compatible with the "single spark
operator" change in PIG-4518 ?
I know Skew Join only allows 2 tables. But would {{joinPlans.keySet()}} always
be of size 2 (and correspond to the 2 POForeach operators in the physical plan)
? If you could add some comments about this, it'd be great.
{code}
-----------------------------------------------
# Physical Plan:
#-----------------------------------------------
X: Store
|
|---X: SkewedJoin[tuple] - scope-21
| |
|
|---A: New For Each(false,false,false)[bag] - scope-10
| | |
| |---A:
Load(/Users/mohit/pig/left.txt:org.apache.pig.builtin.PigStorage) - scope-0
|
|---B: New For Each(false,false)[bag] - scope-18
| |
|
|---B: Load(/Users/mohit/pig/right.txt
{code}
> implement visitSkewedJoin in SparkCompiler
> ------------------------------------------
>
> Key: PIG-4421
> URL: https://issues.apache.org/jira/browse/PIG-4421
> Project: Pig
> Issue Type: Sub-task
> Components: spark
> Reporter: liyunzhang_intel
> Assignee: liyunzhang_intel
> Fix For: spark-branch
>
> Attachments: PIG-4421.patch, PIG-4421_2.patch
>
>
> If visitSkewedJoin is not implemented, following unittests will fail.
> org.apache.pig.test.TestSkewedJoin.testSkewedJoinWithGroup
> org.apache.pig.test.TestSkewedJoin.testSkewedJoinMapKey
> org.apache.pig.test.TestSkewedJoin.testSkewedJoinManyReducers
> org.apache.pig.test.TestSkewedJoin.testNonExistingInputPathInSkewJoin
> org.apache.pig.test.TestSkewedJoin.testSkewedJoinOneValue
> org.apache.pig.test.TestSkewedJoin.testSkewedJoinWithNoProperties
> org.apache.pig.test.TestSkewedJoin.testSkewedJoinEmptyInput
> org.apache.pig.test.TestSkewedJoin.testSkewedJoinNullKeys
> org.apache.pig.test.TestSkewedJoin.testSkewedJoinOuter
> org.apache.pig.test.TestSkewedJoin.testRecursiveFileListing
> org.apache.pig.test.TestSkewedJoin.testSkewedJoinReducers
> org.apache.pig.test.TestJoinSmoke.testSkewedJoinWithGroup
> org.apache.pig.test.TestJoinSmoke.testSkewedJoinOuter
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)