Hey Brian,

On Jun 6, 2012, at 5:40 PM, Brian Foster wrote:

> Okay... totally got behind on this thread... the purpose of this is that 
> regardless of whether we are talking about CAS-PGE running in the resource 
> manager or just some other generic resource manager job, typically any job 
> requires some set of files to exist before it runs, a temp directory to work 
> in, and temp directory cleanup... currently CAS-PGE (or any other job) has to 
> implement this logic (this really should be controlled at a higher level -- 
> this will also avoid directory collisions across jobs as well)... now if 
> CAS-PGE needs a file from the filemgr that is something CAS-PGE should be 
> responsible for.  So in relation to the emails below, pge-config.xml is the 
> file that needs to exist on or be visible to the machine before CAS-PGE is 
> run (CAS-PGE really shouldn't have to stage that file -- it makes for a hacky 
> implementation in CAS-PGE anyway).

Sure, gotcha.

> 
> I invision such a change to the resource manager would include being able to 
> specify a XML file with a list of need files for the job to run, and at 
> runtime the resource manager would stage those files to the temp working 
> directory it created for the job and then clean them up after job execution.  
> Something like:
> 
> <reqInput class="file.staging.class">
>   <file src="/path/to/pge-config.xml" 
> dest="path/relative/to/temp/working/dir/pge-config.xml"/>
> </reqInput>

What about making a specialized extension or orthogonal implementation of the 
Job interface in Resource manager. In other words,
a "WorkflowTaskJob" and "WorkflowTaskJobInput" 2.0-type of thing?

> 
> you could imagine that later you could even extend it to support zip packages 
> which it could stage and unzip:
> <reqInput class="file.staging.class">
>   <file src="/path/to/package.zip" 
> dest="path/relative/to/temp/working/dir/package" 
> postCopyHandler="unzip.logic.class"/>
> </reqInput>
> 

+1.

> This would be ideal for cloud computing since you could then package up your 
> JDK, binaries, etc and the resource manager would make sure they were 
> installed on the machine before executing its job.

Yap, agreed. Let me know if you think WorkflowTaskJob and WorkflowTaskJobInput 
seem like a good place to put stuff like this.

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattm...@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

Reply via email to