[
https://issues.apache.org/jira/browse/PIG-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12727096#action_12727096
]
Hadoop QA commented on PIG-792:
-------------------------------
-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12412434/skewedjoin.patch
against trunk revision 790735.
+1 @author. The patch does not contain any @author tags.
+1 tests included. The patch appears to include 4 new or modified tests.
-1 javadoc. The javadoc tool appears to have generated 1 warning messages.
-1 javac. The applied patch generated 263 javac compiler warnings (more
than the trunk's current 250 warnings).
-1 findbugs. The patch appears to introduce 14 new Findbugs warnings.
-1 release audit. The applied patch generated 162 release audit warnings
(more than the trunk's current 161 warnings).
-1 core tests. The patch failed core unit tests.
+1 contrib tests. The patch passed contrib unit tests.
Test results:
http://hudson.zones.apache.org/hudson/job/Pig-Patch-minerva.apache.org/111/testReport/
Release audit warnings:
http://hudson.zones.apache.org/hudson/job/Pig-Patch-minerva.apache.org/111/artifact/trunk/patchprocess/releaseAuditDiffWarnings.txt
Findbugs warnings:
http://hudson.zones.apache.org/hudson/job/Pig-Patch-minerva.apache.org/111/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Console output:
http://hudson.zones.apache.org/hudson/job/Pig-Patch-minerva.apache.org/111/console
This message is automatically generated.
> PERFORMANCE: Support skewed join in pig
> ---------------------------------------
>
> Key: PIG-792
> URL: https://issues.apache.org/jira/browse/PIG-792
> Project: Pig
> Issue Type: Improvement
> Reporter: Sriranjan Manjunath
> Attachments: skewedjoin.patch
>
>
> Fragmented replicated join has a few limitations:
> - One of the tables needs to be loaded into memory
> - Join is limited to two tables
> Skewed join partitions the table and joins the records in the reduce phase.
> It computes a histogram of the key space to account for skewing in the input
> records. Further, it adjusts the number of reducers depending on the key
> distribution.
> We need to implement the skewed join in pig.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.