[ 
https://issues.apache.org/jira/browse/PIG-3747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Cheolsoo Park updated PIG-3747:
-------------------------------

    Attachment: PIG-3747-2.patch

Thanks Daniel. I changed the table names in the example query-
{code}
A = LOAD 'skewed_data' AS (a1,a2,a3);
B = LOAD 'data' AS (b1,b2,b3);
C = JOIN A BY a1, B BY b1 USING 'skewed';
{code}

> Update skewed join documentation
> --------------------------------
>
>                 Key: PIG-3747
>                 URL: https://issues.apache.org/jira/browse/PIG-3747
>             Project: Pig
>          Issue Type: Bug
>          Components: documentation
>            Reporter: Cheolsoo Park
>            Assignee: Cheolsoo Park
>             Fix For: 0.13.0
>
>         Attachments: PIG-3747-1.patch, PIG-3747-2.patch
>
>
> While working on Tez skewed join, I found two problems with the skewed join 
> documentation-
> # The following is misleading because skewed join also works with left, 
> right, and full outer joins.
> {quote}
> Skewed join works with two-table inner join. 
> {quote}
> # In two-way join, the 1st table must be the skewed one because Pig samples 
> on that. But this is not mentioned anywhere.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to