I have a Pig script that was divided by the Pig framework in two MapReduce jobs. So far so good.
One of these jobs was always failing. When I checked the logs I realized that Pig is executing the "2nd" job before the "1st". Well, I think this is happening because the second part of my script doesn't depend explicitly on the first part. But I'd like it to be executed before the other part. Is it possible? I know Pig tries to optimize several things, but changing the order of the MR jobs is not something nice. Are pigs "domestic animals" are all? By the way, how much control do we really have over Pig's internal DAG? Thanks, Rodrigo Ferreira.
