Hi,

We are receiving a strange permissions issue while submitting jobs to new hosts 
added to our clusters. While submitting jobs with normal permissions to user 
directories the jobs invariably go into error state.


While checking the logs, we find information as below:

12/03/2018 14:16:55|worker|<master>|W|job 210165.1 failed on host <hostname> 
general opening input/output file because: 12/03/2018 15:16:54 [899:26827]: 
error: can't open output file "<folder>/test.sh.o210165": Permission denied
12/03/2018 14:16:55|worker|<master>|W|rescheduling job 210165.1

The strange thing is, while we provide full permissions (777) to a directory 
and run under it, the job runs fine, but output and error files are created on 
behalf of "sgeadmin" user with 744 permissions.

The user directories, job directories and SGE_ROOT folder are all NFS volumes 
mounted on all hosts similarly. There is no issue on hosts that are already 
present in the cluster and jobs run fine on them.

Can anyone please suggest what might be wrong here?

Thanks and regards,
Srinivas.



The information contained in this electronic message and any attachments to 
this message are intended for the exclusive use of the addressee(s) and may 
contain proprietary, confidential or privileged information. If you are not the 
intended recipient, you should not disseminate, distribute or copy this e-mail. 
Please notify the sender immediately and destroy all copies of this message and 
any attachments. WARNING: Computer viruses can be transmitted via email. The 
recipient should check this email and any attachments for the presence of 
viruses. The company accepts no liability for any damage caused by any virus 
transmitted by this email. www.wipro.com
_______________________________________________
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users

Reply via email to