Rohini Palaniswamy created PIG-4057:
---------------------------------------

             Summary: Group All followed by CROSS with default parallelism 
produces wrong results
                 Key: PIG-4057
                 URL: https://issues.apache.org/jira/browse/PIG-4057
             Project: Pig
          Issue Type: Bug
            Reporter: Rohini Palaniswamy
             Fix For: 0.14.0


SET default_parallel 199;
......
by_size = ...
uniq_vals = .....
grpd = group uniq_vals all;
all_vals = FOREACH grpd GENERATE uniq_vals;
cross_result = CROSS by_size, all_vals;
store cross_result into '/tmp/roh/cross/out/recipient_asns';

Job1: grpd, all_vals, cross_result (The plan does GFCross function here for
all_vals assuming cross parallelism to be 1 taking that of the current job even
though it should consider default parallelism 199 of Job 2. Parallelism of Job1
is 1 because of group all)
Job2: cross_result (Actual CROSS of by_size and all_vals)



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to