Hi Reuti,

Thank you very much for explaining and pointing to the valuable learning
resource.

I ran into a very weird issue.

My running env:

one 64bit QMASTER and a few of 64bit QEXECD running good. One 32bit Qexecd
on 32bit SLES10SP4.

32bit Qexecd works fine if it join all 32bit SGE environment. I use the
same binary to start, once it point to 64bit Qmaster, the segment fault
happens. Even, when I use qping to collect Qmaster info, it works when
collecting 32bit Qmaster, but it failed when connecting 64Qmaster for some
reason.

I am so confused. :( Any idea?

Thanks,
Edgy


On Thu, Dec 17, 2015 at 7:46 AM, Reuti <re...@staff.uni-marburg.de> wrote:

>
> > Am 17.12.2015 um 13:19 schrieb Steven Du <edgef...@gmail.com>:
> >
> > Hi Reuti/Joshua,
> >
> > I tried but failed. Here I wonder ( I am using 2011.11p1 version.):
> >
> > 1. If all GRID members have to be NFS shared spooling dir, such as
> $SGE_CELL/common. Otherwise, the client cannot be started. I doubt, but
> once I used the different name, the sge_execd cannot be started.
>
> You would at least need a copy of it on all nodes. But this is not related
> to different architectures.
>
> https://arc.liv.ac.uk/SGE/howto/nfsreduce.html
>
>
> > 2. This is related to the 1st question. In bootstrap config, it need to
> config qmaster spooling dir, if there is no NFS share, the sge_execd cannot
> access qmaster spooling dir. Then why do we need config qmaster spooling
> dir?
>
> Not all information in the files in $SGE_ROOT/default/common is read by
> all daemons. The sgeexecd will never access the spooling dir of the
> qmaster. It's to be set up there only for sgemaster to know where to look
> at. OTOH the binary_path setting targets both daemons.
>
>
> > 3. I can start sge_execd daemon and it can listen on one port, but it
> cannot communicate to QMASTER, even if I configered QMASTER port and
> act_qmaster server name.
> >
> > 4. When I started sge_execd, and run any qstat command, I got "Segment
> fault" for some reason. I will follow this one later.
>
> You complied this version on your own? On a platform with the same
> architecture which the final nodes will use? Otherwise maybe the compiler
> needs settings not to include the latest CPU features.
>
> -- Reuti
>
>
> > Thanks,
> > Edgy
> >
> > On Tue, Dec 15, 2015 at 11:38 PM, Steven Du <edgef...@gmail.com> wrote:
> > Hi Reuti, Joshua,
> >
> > Thank you very much GURU!
> >
> > That is a really good news!!! Actually, that is exactly situation we are
> facing right now. We need have SGE Master on em64t, and submission host on
> x86, and execution hosts are x86 and x64(em64t).
> >
> > We used to use all NFS share copy for SGE master and exec client, except
> for exec client spooling dir. Then, as all hosts are x86, so all hosts,
> including master and client, share one bootstrap file. But, now, we are
> going to put some x64 boxes to GRID. That's why I need to find the way to
> work out.
> >
> > Just very curious, I could not find any documents or web page for
> building hybrid environment. So I suspect my solution even if I thought it
> should work. :)
> >
> > Thank you again! You save us a lot time and effort!
> >
> > I will work on this in the next few days and update what I get.
> >
> > Edgy
> >
> > On Tue, Dec 15, 2015 at 6:24 AM, Reuti <re...@staff.uni-marburg.de>
> wrote:
> >
> > > Am 15.12.2015 um 05:45 schrieb Steven Du <edgef...@gmail.com>:
> > >
> > > Thank you very much!
> > >
> > > Does it mean there is no any issue on SGE master to manage x64 and x32
> client? Is it right?
> >
> > Yep, you can even mix different operating systems and throw in some AIX
> or FreeBSD clients. To ease the creation of uniform job scripts which run
> independent from the machine they are execute on, one can use the
> environment variable $ARC and organize the binaries and/or their containing
> directories accordingly:
> >
> > #!/bin/sh
> > # Set some stuff
> > FOO=BAZ
> > # Execute the binary
> > /opt/software/$ARC/foobar
> >
> > Having directories /opt/software/lx-amd64 and /opt/software/lx-x86 you
> will always get the correct binary. I even use this to distinguish between
> amd64 and em64t to get the correct binaries for each type of CPU although
> the jobscript stays the same.
> >
> > -- Reuti
> >
> >
> > > I will try to setup experimental environment.
> > >
> > > Will update later once I prove.
> > >
> > > Thanks,
> > > Edgy
> > >
> > > On Mon, Dec 14, 2015 at 11:28 PM, Joshua Baker-LePain <j...@salilab.org>
> wrote:
> > > On Mon, 14 Dec 2015 at 4:53pm, Steven Du wrote
> > >
> > > I wonder if I am able to build hybrid SGE computing environment.
> > >
> > > It is about running one SGE master on Intel x64 host with RHEL, and SGE
> > > execd on 64bit and 32bit RHEL. And then, I am able to submit my jobs
> to any
> > > exec hosts I like. Such as, some jobs go to 32bit only, others go to
> 64 bit
> > > only.
> > >
> > > Based on my understanding, it should work.
> > >
> > > I did a Google search, but I could not find any article about it. Do
> you
> > > know? Or if anyone has the similar running environment. Please share
> you
> > > thought! I very appreciate your help.
> > >
> > > Yep, this is not a problem.  I run such an environment.  Be sure that
> users submit jobs with the proper "-l arch=" request so that they go to the
> right architecture.
> > >
> > > --
> > > Joshua Baker-LePain
> > > QB3 Shared Cluster Sysadmin
> > > UCSF
> > >
> > > _______________________________________________
> > > users mailing list
> > > users@gridengine.org
> > > https://gridengine.org/mailman/listinfo/users
> >
> >
> >
>
>
_______________________________________________
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users

Reply via email to