Hello all,
I'm trying the integration of SGE (GE 6.1u3) + Globus (globus-4.0.7),
but I still have the same old problem which I had in previous attempts.
I'm trying the Globus + SGE integration provide by the LESC,
http://www.lesc.ic.ac.uk/projects/SGE-GT4.html
I can send the jobs with Fork correctly and I can send jobs with *qsub*
correctly too, and besides I have enabled *reporting_params
reporting=true* and accessible for globus.
I attach on this e-mail the outputs that I considerer more important. I
send the job with *globusrun* command to SGE, the job enters in
execution under SGE correctly and it finish well (according to SGE). The
files *.stdout and *.stderr are generated correctly in the user Home,
and *.stdout file contains the correct output for the job, but for some
reason the jobManager doesn't see that the job has finished, and it
remains on *Current job state: Unsubmitted* without finish until I
execute [ctrl + c].
I have been looking
*/usr/local/globus-4.0.7/lib/perl/Globus/GRAM/JobManager/sge.pm*, and in
the function in which check if the job has fineshed with command qstat
-j, *sub poll* function, doing a debug it isn't doing the qstat, it
execute the qsub correctly and it gets the jobID, but I don't know in
what step, it stops and don't execute poll function.
On the other hand, we have configured 'sge_qstat' in order to don't be
necessary execute qstat -u '*' to see the all jobs running/queued,
therefore the difference with previous versions of SGE is minimum.
[EMAIL PROTECTED] ~]$ cat /usr/local/sge/pro/default/common/sge_qstat
-u *
I would appreciate any help, and comments are welcome.
Thanks in advance,
Esteban
[EMAIL PROTECTED] ~]$ globusrun-ws -submit -pft -T 10000 -s -S -F
https://test01.egee.cesga.es:8443/wsrf/services/ManagedJobFactoryService -Ft
SGE -c /bin/date
Delegating user credentials...Done.
Submitting job...Done.
Job ID: uuid:ab28dbd2-799d-11dd-9826-00163e1c20ad
Termination time: 09/06/2008 09:49 GMT
Current job state: Unsubmitted
*/ It remains in that state until I execute [ctrl + c] */
[EMAIL PROTECTED] sge_globus_packages]$ gpt-postinstall -force
running /usr/local/globus-4.0.7/setup/globus/setup-globus-common..[ Changing to
/usr/local/globus-4.0.7/setup/globus ]
creating globus-sh-tools-vars.sh
creating globus-script-initializer
creating Globus::Core::Paths
checking globus-hostname
Done
..Done
running /usr/local/globus-4.0.7/setup/globus/setup-globus-rls-server..[
Changing to /usr/local/globus-4.0.7/setup/globus ]
creating SXXrls
creating globus-rls-server.conf
creating rls-ldif.conf
creating odbc.ini
/usr/local/globus-4.0.7/etc/globus-rls-server.conf already exists, not
replacing with /usr/local/globus-4.0.7/setup/globus/globus-rls-server.conf
/usr/local/globus-4.0.7/var/odbc.ini already exists, not replacing with
/usr/local/globus-4.0.7/setup/globus/odbc.ini
/usr/local/globus-4.0.7/var/lrc1000.db already exists, not recreating lrc1000
database.
/usr/local/globus-4.0.7/var/rli1000.db already exists, not recreating rli1000
database.
Done
..Done
running
/usr/local/globus-4.0.7/setup/globus/setup-globus-scheduler-provider-fork..[
Changing to /usr/local/globus-4.0.7/setup/globus ]
checking for touch... /bin/touch
find-fork-provider-tools: creating ./config.status
config.status: creating
/usr/local/globus-4.0.7/libexec/globus-scheduler-provider-fork
..Done
running /usr/local/globus-4.0.7/setup/globus/setup-gram-service-common..[
Changing to /usr/local/globus-4.0.7/setup/globus ]
Running /usr/local/globus-4.0.7/setup/globus/setup-gram-service-common
Determining system information...
Exception in thread "main" java.lang.NoClassDefFoundError:
org/apache/tools/ant/launch/Launcher
Error creating GRAM configuration.
..Done
running /usr/local/globus-4.0.7/setup/globus/setup-gram-service-fork..[
Changing to /usr/local/globus-4.0.7/setup/globus ]
Running /usr/local/globus-4.0.7/setup/globus/setup-gram-service-fork
..Done
running /usr/local/globus-4.0.7/setup/globus/setup-gram-service-multi..[
Changing to /usr/local/globus-4.0.7/setup/globus ]
Running /usr/local/globus-4.0.7/setup/globus/setup-gram-service-multi
..Done
running /usr/local/globus-4.0.7/setup/globus/setup-gram-service-sge..[ Changing
to /usr/local/globus-4.0.7/setup/globus ]
Running /usr/local/globus-4.0.7/setup/globus/setup-gram-service-sge
..Done
running /usr/local/globus-4.0.7/setup/globus/setup-rft..[ Changing to
/usr/local/globus-4.0.7/setup/globus ]
Running /usr/local/globus-4.0.7/setup/globus/setup-rft
Exception in thread "main" java.lang.NoClassDefFoundError:
org/apache/tools/ant/launch/Launcher
Error creating RFT configuration.
..Done
running /usr/local/globus-4.0.7/setup/gsi_openssh_setup/setup-openssh..[
Changing to /usr/local/globus-4.0.7/setup/gsi_openssh_setup ]
Configuring gsi_openssh
------------------------------------------------------------
Executing...
Notes:
o Privilege separation is off.
o GSI-OpenSSH website is <http://grid.ncsa.uiuc.edu/ssh/>.
------------------------------------------------------------
Finished configuring gsi_openssh.
..Done
running
/usr/local/globus-4.0.7/setup/globus/setup-globus-gaa-authz-callout-message..[
Changing to /usr/local/globus-4.0.7/setup/globus ]
If you wish to configure the optional GAA-based Globus Authorization
callouts, run the setup-globus-gaa-authz-callout setup script.
..Done
running /usr/local/globus-4.0.7/setup/globus/setup-globus-gatekeeper..[
Changing to /usr/local/globus-4.0.7/setup/globus ]
Creating gatekeeper configuration file...
Done
..Done
running /usr/local/globus-4.0.7/setup/globus/setup-globus-gram-auditing..[
Changing to /usr/local/globus-4.0.7/setup/globus ]
..Done
running /usr/local/globus-4.0.7/setup/globus/setup-seg-fork.pl..[ Changing to
/usr/local/globus-4.0.7/setup/globus ]
..Done
running /usr/local/globus-4.0.7/setup/globus/setup-seg-sge.pl..[ Changing to
/usr/local/globus-4.0.7/setup/globus ]
..Done
running /usr/local/globus-4.0.7/setup/globus/setup-globus-gram-job-manager..[
Changing to /usr/local/globus-4.0.7/setup/globus ]
Creating state file directory.
Done.
Reading gatekeeper configuration file...
Determining system information...
Creating job manager configuration file...
Done
..Done
running /usr/local/globus-4.0.7/setup/globus/setup-globus-job-manager-fork..[
Changing to /usr/local/globus-4.0.7/setup/globus ]
find-fork-tools: WARNING: "Cannot locate mpiexec"
find-fork-tools: WARNING: "Cannot locate mpirun"
checking for mpiexec... no
checking for mpirun... no
find-fork-tools: creating ./config.status
config.status: creating fork.pm
..Done
running /usr/local/globus-4.0.7/setup/globus/setup-globus-job-manager-sge..[
Changing to /usr/local/globus-4.0.7/setup/globus ]
MPI_PE == NONE
find-sge-tools: WARNING: Cannot locate mpirun
find-sge-tools: WARNING: Sun MPI not available
find-sge-tools: WARNING: Could not determine Grid Engine's mode. Assuming SGE
checking for gawk... gawk
checking for mpirun... no
checking for mprun... no
checking for cat... /bin/cat
checking for qsub... /usr/local/sge/pro/bin/lx26-x86/qsub
checking for qstat... /usr/local/sge/pro/bin/lx26-x86/qstat
checking for qdel... /usr/local/sge/pro/bin/lx26-x86/qdel
checking for qselect... /usr/local/sge/pro/bin/lx26-x86/qselect
checking for qhost... /usr/local/sge/pro/bin/lx26-x86/qhost
checking for qconf... /usr/local/sge/pro/bin/lx26-x86/qconf
checking for qacct... /usr/local/sge/pro/bin/lx26-x86/qacct
find-sge-tools: creating ./config.status
config.status: creating sge.pm
..Done
[EMAIL PROTECTED] sge_globus_packages]$
#!/bin/sh
# Grid Engine batch job script built by Globus job manager
#$ -S /bin/sh
#$ -m n
#$ -o /home/cyteduser/773ef49a-799e-11dd-89b9-00163e1c20ad.0.stdout
#$ -e /home/cyteduser/773ef49a-799e-11dd-89b9-00163e1c20ad.0.stderr
GLOBUS_LOCATION=/usr/local/globus-4.0.7/; export GLOBUS_LOCATION
X509_CERT_DIR=/etc/grid-security/certificates; export X509_CERT_DIR
X509_USER_PROXY=; export X509_USER_PROXY
X509_USER_CERT=; export X509_USER_CERT
X509_USER_KEY=; export X509_USER_KEY
HOME=/home/cyteduser; export HOME
LOGNAME=cyteduser; export LOGNAME
SCRATCH_DIRECTORY=/home/cyteduser/.globus/scratch; export SCRATCH_DIRECTORY
JAVA_HOME=/usr/java/j2sdk1.4.2_18/jre; export JAVA_HOME
GLOBUS_GRAM_JOB_HANDLE=https://84.21.169.31:8443/wsrf/services/ManagedExecutableJobService?77868080-799e-11dd-8ac7-b8e2378e8bad;
export GLOBUS_GRAM_JOB_HANDLE
LD_LIBRARY_PATH=;
export LD_LIBRARY_PATH;
. /usr/local/sge/pro/default/common/settings.sh
# Change to directory requested by user
cd /home/cyteduser
/bin/date < /dev/null&
wait
[EMAIL PROTECTED] ~]# cat
/home/cyteduser/05fc787a-799e-11dd-9be2-00163e1c20ad.0.stdout
Wed Sep 5 10:52:42 CEST 2008