On Fri, Jan 13, 2012 at 12:02 AM, Simon Matthews <[email protected]> wrote: > I have an installation of SGE 6.2U4 that I downloaded some years ago that I > have installed on a couple of qmaster hosts.
Are you using the same version of SGE (SGE 6.2u4) on both the qmaster & the node? You can run "qconf -help | head -1" on both the master & the node to show the version. Most common GDI errors are due to mismatching versions of Grid Engine - and if you are running the same version of SGE, then let us know, I will dig the code to see what can possibly go wrong. And don't worry about still using the Sun binaries, I work with sites that have even older versions of Grid Engine. Sun contributed the code to open source, and without Sun we wouldn't have this community. Rayson > I hope that I do not offend the users of this list by asking for help using > a binary installation, using binaries built by Sun. > > I hope that someone can shed some light on the problem. > > I have built some new virtualized clients using KVM on a Centos 6 host. The > Centos 5 client seems to work properly, but the Centos 4 client does not. I > need a Centos 4 execd for testing purposes. > > I cannot install sge_execd, because of the qconf problems. > > qconf -sh results in: > > qconf -sh > ERROR: failed receiving gdi request response for mid=1 (got no message). > > I get this message if I try this client against the new cluster and a > cluster that has been running for several years. Other Centos 4 clients can > run "qconf -ch" against both clusters without problem. > > qping works from the problematic client: > qping -info sgemaster 6444 qmaster 1 > 01/12/2012 20:59:38: > SIRM version: 0.1 > SIRM message id: 1 > start time: 01/12/2012 16:31:57 (1326414717) > run time [s]: 16052 > messages in read buffer: 0 > messages in write buffer: 0 > nr. of connected clients: 2 > status: 2 > info: MAIN: E (16052.50) | signaler000: E (16052.05) | > event_master000: E (0.58) | timer000: E (1.58) | worker000: W (41.59) | > worker001: W (101.61) | listener000: W (5.58) | listener001: W (5.58) | > scheduler000: W (5.57) | ERROR > malloc: arena(0) |ordblks(1) | smblks(0) | hblksr(0) | > hblhkd(0) usmblks(0) | fsmblks(0) | uordblks(0) | fordblks(0) | keepcost(0) > Monitor: disabled > > Simon > > > > _______________________________________________ > users mailing list > [email protected] > https://gridengine.org/mailman/listinfo/users > _______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
