Hi Reuti,
On 27/06/14 12:05, Reuti wrote:
Hi,
Am 27.06.2014 um 12:37 schrieb Tina Friedrich:
maybe someone here has an idea where to look for this...
We have some software - I think its a bash script that calls a python script.
Up until very recently, it ran just fine. And then it started randomly failing.
As in, it always starts, but only sometimes finish (or rather, or only
sometimes produce output). It seems to run; simply doesn't manage to
print/write anything. So far, it seems that it always works if run via qrsh or
qlogin (i.e. only fails when using qsub). We've straced it - both working and
non-working - and the only thing that told us is that after the python script
starts (and should print something), in the runs where it works a 'write' is
called and if it fails it doesn't.
So it's not a problem with buffered output like in Perl (I'm not a Python pro), but the kernel call
to "write" is missing. Are the environment variables exactly the same between the two
runs (maybe one will trigger a "supress output" when set)? Any LD_PRELOAD in effect?
-- Reuti
Hm. Did test forcing unbuffered, and it *seemed* to work better.
The problem with this is that - as we now found - it's intermittent, or
so it seems. Sometimes, they work fine - so when we tested buffered vs
non-buffered, none of the jobs failed.
Makes it hard to test things, really.
Tina
Anyone has any idea?
Tina
--
Tina Friedrich, Computer Systems Administrator, Diamond Light Source Ltd
Diamond House, Harwell Science and Innovation Campus - 01235 77 8442
--
This e-mail and any attachments may contain confidential, copyright and or
privileged material, and are for the use of the intended addressee only. If you
are not the intended addressee or an authorised recipient of the addressee
please notify us of receipt by returning the e-mail and do not use, copy,
retain, distribute or disclose the information in or attached to the e-mail.
Any opinions expressed within this e-mail are those of the individual and not
necessarily of Diamond Light Source Ltd. Diamond Light Source Ltd. cannot
guarantee that this e-mail or any attachments are free from viruses and we
cannot accept liability for any damage which you may sustain as a result of
software viruses which may be transmitted in or with the message.
Diamond Light Source Limited (company no. 4375679). Registered in England and
Wales with its registered office at Diamond House, Harwell Science and
Innovation Campus, Didcot, Oxfordshire, OX11 0DE, United Kingdom
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users
--
Tina Friedrich, Computer Systems Administrator, Diamond Light Source Ltd
Diamond House, Harwell Science and Innovation Campus - 01235 77 8442
--
This e-mail and any attachments may contain confidential, copyright and or
privileged material, and are for the use of the intended addressee only. If you
are not the intended addressee or an authorised recipient of the addressee
please notify us of receipt by returning the e-mail and do not use, copy,
retain, distribute or disclose the information in or attached to the e-mail.
Any opinions expressed within this e-mail are those of the individual and not necessarily of Diamond Light Source Ltd.
Diamond Light Source Ltd. cannot guarantee that this e-mail or any attachments are free from viruses and we cannot accept liability for any damage which you may sustain as a result of software viruses which may be transmitted in or with the message.
Diamond Light Source Limited (company no. 4375679). Registered in England and
Wales with its registered office at Diamond House, Harwell Science and
Innovation Campus, Didcot, Oxfordshire, OX11 0DE, United Kingdom
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users