Rodrigo, I see you're using pig 0.9? The latest code (pig 0.13) is better about preserving order when building the execution plan. See PIG-3902 (https://issues.apache.org/jira/browse/PIG-3902) You might try it without an exec on pig 0.13 if you can. Are you able to share (at least a skeleton of) your script?
--jacob @thedatachef On Jul 20, 2014, at 6:43 AM, Rodrigo Ferreira <[email protected]> wrote: > Hi everyone, > > I found the answer here: > http://pig.apache.org/docs/r0.9.1/perf.html#Implicit-Dependencies > > It seems that when you have implicit dependencies you have to use the EXEC > command in order to help Pig execute your jobs in the right order. > > Rodrigo. > > > 2014-07-20 14:40 GMT+02:00 Rodrigo Ferreira <[email protected]>: > >> I have a Pig script that was divided by the Pig framework in two MapReduce >> jobs. So far so good. >> >> One of these jobs was always failing. When I checked the logs I realized >> that Pig is executing the "2nd" job before the "1st". >> >> Well, I think this is happening because the second part of my script >> doesn't depend explicitly on the first part. But I'd like it to be executed >> before the other part. Is it possible? >> >> I know Pig tries to optimize several things, but changing the order of the >> MR jobs is not something nice. Are pigs "domestic animals" are all? >> >> By the way, how much control do we really have over Pig's internal DAG? >> >> Thanks, >> Rodrigo Ferreira. >>
