Hello,
nice to hear.
I had a look from all sides but it seemed the SGE master thought the queues on the hosts were full.
This morning when I looked I saw only willow doing some jobs - ortelius still
having this strange state.
I waited for Merl to advise me probably reconfiguring but it was too early this
morning and I simply deleted some (5 i think)
jobs from the queues that were issued on 9th Oct when user-store failed.
I felt the users might probably not wait for this job until now anyway and
hoped the queues would regenerate as they were modified.
This seems to have solved the problem to my luck ;) as in the logs it seems the
jobs were running fine then.
Cheers
nosy
On Sun, 25 Nov 2012, Dr. Trigon wrote:
Date: Sun, 25 Nov 2012 11:33:19
From: Dr. Trigon <dr.tri...@surfeu.ch>
Reply-To: Wikimedia Toolserver <toolserver-l@lists.wikimedia.org>
To: toolserver-l@lists.wikimedia.org
Subject: Re: [Toolserver-l] SGE queue waiting forever?
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Today it seems to be working and fully functional again... Nice job! ;)
Thanks to all involved here!
Greetings and have a nice weekend
DrTrigon
On 24.11.2012 22:10, Platonides wrote:
On 24/11/12 21:38, Dr. Trigon wrote:
@All: If you are working on big files please copy them to local
temp first (on sge $TMP contains an individual temp dir for
the job). E.g. piping big files to other slow programs causes
much nfs load because data must be read in small packages which
cause high load on servers. That's why sge cannot schedule new
jobs on nightshade since days.
What is a big file? Is it ok if the file is in user-home?
Thanks and greetings DrTrigon
/home is also mounted with nfs
Although it's strange that reading from big files overloads the
servers. stdio or the equivalent functionality in the language
they are made should be making it work in blocks.
Looking at willow mounts, /shared and /home are mounted with nfsv3
over udp. But /mnt/user-store and /install don't show it, so they
are probably using nfsv4 over tcp. Is that intended?
_______________________________________________ Toolserver-l
mailing list (Toolserver-l@lists.wikimedia.org)
https://lists.wikimedia.org/mailman/listinfo/toolserver-l Posting
guidelines for this list:
https://wiki.toolserver.org/view/Mailing_list_etiquette
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.12 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://www.enigmail.net/
iEYEARECAAYFAlCx8+8ACgkQAXWvBxzBrDBSyQCfc7mOdoj45Phyx0p+9Be5sm99
tdcAn0m3hTWswEuvfBGAIBlsmMW9uhNO
=+rBS
-----END PGP SIGNATURE-----
_______________________________________________
Toolserver-l mailing list (Toolserver-l@lists.wikimedia.org)
https://lists.wikimedia.org/mailman/listinfo/toolserver-l
Posting guidelines for this list:
https://wiki.toolserver.org/view/Mailing_list_etiquette
_______________________________________________
Toolserver-l mailing list (Toolserver-l@lists.wikimedia.org)
https://lists.wikimedia.org/mailman/listinfo/toolserver-l
Posting guidelines for this list:
https://wiki.toolserver.org/view/Mailing_list_etiquette