[
https://issues.apache.org/jira/browse/PIG-3747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Cheolsoo Park updated PIG-3747:
-------------------------------
Attachment: PIG-3747-2.patch
Thanks Daniel. I changed the table names in the example query-
{code}
A = LOAD 'skewed_data' AS (a1,a2,a3);
B = LOAD 'data' AS (b1,b2,b3);
C = JOIN A BY a1, B BY b1 USING 'skewed';
{code}
> Update skewed join documentation
> --------------------------------
>
> Key: PIG-3747
> URL: https://issues.apache.org/jira/browse/PIG-3747
> Project: Pig
> Issue Type: Bug
> Components: documentation
> Reporter: Cheolsoo Park
> Assignee: Cheolsoo Park
> Fix For: 0.13.0
>
> Attachments: PIG-3747-1.patch, PIG-3747-2.patch
>
>
> While working on Tez skewed join, I found two problems with the skewed join
> documentation-
> # The following is misleading because skewed join also works with left,
> right, and full outer joins.
> {quote}
> Skewed join works with two-table inner join.
> {quote}
> # In two-way join, the 1st table must be the skewed one because Pig samples
> on that. But this is not mentioned anywhere.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)