Hi, I have browsed through the docs, but didn't find any satisfactory answers to the following questions.
a) What happens if input data is delayed? More concretely, say my job is triggered by input being available at certain time. Subsequently, the job executes successfully. At some later time, delayed data comes in; consequently, the job should be kicked off. Does oozie store metadata for each data set that was successfully processed? b) Is there any documentation on how the scheduler (coordinator) work? Many thanks, stan
