Hi,
With OOZIE-1311 and its subtasks, the idea seems to move all the launcher
classes like PigMain, HiveMain etc. to their respective sharelibs.
So, now shared lib is a mandatory deployment step. Before shared lib was
optional as users could bundle jars with their workflow application.
So always requiring shared lib seems to introduce 2 problems:
1. The current deployments which don't use action shared lib will fail. So,
probably we should deprecate the current behavior.
2. The hadoop distributed cache mechanism will fail a job if the files in DC
are updated on hdfs while the hadoop job is running. So, when Oozie is
restarted and shared lib is uploaded to hdfs as part of
deployment, hadoop will fail the existing jobs for which the
timestamp of the file on hdfs doesn't match the timestamp of its copy in the
job's DC.
Thanks,
Virag