Re: [Beowulf] Docker vs KVM paper by IBM

Ellis H. Wilson III Wed, 28 Jan 2015 12:06:50 -0800

On 01/28/2015 02:16 PM, Gavin W. Burris wrote:

Didn't mean to upset you there, Ellis.  I'm talking about every other
discipline that isn't CSE.  I encourage researchers to NOT be their own
IT department, so that their time is freed up to do research.  Obviously
if your research IS the system, that is the exception.

I've only been on one side of the fence, but here is my perspective onthe computational sciences and sysadmin relationship:

There's effectively a bell-curve of users. With perfectly averagesysadmins, the bell-curve looks pretty normal. On the far left-tailyou've got your users whose programs and research operates PERFECTLYunder the current regime...err...toolchain provided by said sysadmin.

The next quartile up, on the left side of perfect average, you have agood bulk of researchers who truly don't want to be sysadmins and whoare willing to change their programs to fit into the availabletoolchain. The cost in this case is put on the researcher to change herprograms and spend hours all over the interwebs figuring out why suchand such compilation failed.

Just over the line in the third quartile we have another bulk of theresearchers who are just savvy enough to work around the toolchains ofthe sysadmins, either via homedir path and lib manipulation, chroots, ordownright bribing/stealing root somehow and installing into publicpaths. The cost generally manifests itself on the IT budget paying forsysadmins to fix this "just savvy enough to be dangerous" user's crapup, and on other researchers whose code now doesn't compile or runbecause the toolchain has been mucked with.

In the last, far right-most tiny quartile, we have those researchers whoactually enjoy some amount of being sysadmins and are relatively ascapable as the departmentally paid ones. It's faster for them to justhandle things themselves. They WILL get around you, no matter what youdo, they'll enjoy doing so, and they'll have the wherewithal to know ifnobody knows all the better. If you resist, they'll just make thingspainful for everyone, and no amount of stick-wielding will dissuade them.

On the two far tails the aggregate costs are generally low. In themiddle costs tend to be high. Offering multiple toolchains on a singlemachine is non-trivial, and dealing with those who force multipletoolchains/drivers/kernels/whatever into such a setup is expensive tocorrect.

So, the obvious answer here is, provide your "standard operatingenvironments" in the form of containerized/VM/whatever images quartiles1 and 2 can use, and allow quartiles 3 and 4 to spin up their own.Multiple environments means quartile 2 can probably just try theirprogram A on environments X, Y, and Z, and find one that "just works."This reduces their time futzing with compilers or fixing otherresearcher's crappy code that breaks on GCC > 4.x. Quartile 3 can spinup their own absolutely crap environment and think their L33t and notscrew over their fellow researchers. Quartiles 1 and 4 are basicallyuntouched, since they were fine before as now.


Everybody wins, probably most of all the IT department.

Best,

ellis
_______________________________________________
Beowulf mailing list, [email protected] sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Re: [Beowulf] Docker vs KVM paper by IBM

Reply via email to