Hi Martin
Sorry for late response. While this error was resolved (condor was not
running on the gatekeeper node), I now see a different error.
I submit the following simple job:
It remains in Pending State forever. On the gatekeeper end, its status
is "Idle" forever
The GridManager Log indicate 2 types of errors:
9/5 13:37:02 [26586] Calling pipe Handler
<&GahpServer::err_pipe_ready> for Pipe end=65540 <m_gahp_errorfd>
9/5 13:37:02 [26586] Return from pipe Handler
9/5 13:37:02 [26586] Calling pipe Handler
<&GahpServer::err_pipe_ready> for Pipe end=65540 <m_gahp_errorfd>
9/5 13:37:02 [26586] Return from pipe Handler
followed by
9/5 13:37:02 [26586] refresh_credentials(https://tg-s037.ncsa.teragrid.org:8443/wsrf/services/DelegationService?517b6220-7ab6-11dd-b49c-9368617134a5
) failed!
Any idea what could be causing this? I should mention that couple of
weeks ago, everything was working just fine.
Thanks
-Neha
On Sep 2, 2008, at 1:34 PM, Martin Feller wrote:
Neha,
Please submit the job again with globusrun-ws running in debug mode
(globusrun-ws -dbg ... ) and send the output on the client-side.
Also: I you have a server-side container log available, please send
it too.
Do you know the server-side GT version?
Martin
Neha Sharma wrote:
Hi
I am now seeing a problem with globusrun-ws job submission against
a web services container which was working fine before.
If I run a job that requires streaming, I see that its says
"fileCleanUp" element is invalid
-bash-3.00$ globusrun-ws -submit -Jf neha.epr.tg -F
tg.opensciencegrid.org:9443 -Ft Cemon -streaming -se n.err -so
n.out -c /bin/date
Delegating user credentials...Done.
Submitting job...Done.
Job ID: uuid:5712d3e8-790e-11dd-bbd6-001422086c92
Termination time: 09/03/2008 16:43 GMT
Current job state: Failed
Destroying job...Done.
Cleaning up any delegated credentials...Done.
globusrun-ws: Job failed: Invalid fileCleanUp element.
No value found for RSL substitution variable GLOBUS_USER_HOME.
If I run a job with no streaming, it says "directory" element is
invalid
-bash-3.00$ globusrun-ws -submit -Jf neha.epr.tg -F
tg.opensciencegrid.org:9443 -Ft Cemon -c /bin/date
Submitting job...Done.
Job ID: uuid:4ff49830-790e-11dd-a3db-001422086c92
Termination time: 09/03/2008 16:43 GMT
Current job state: Failed
Destroying job...Done.
globusrun-ws: Job failed: Invalid directory element.
No value found for RSL substitution variable GLOBUS_USER_HOME.
-bash-3.00$
Apparently, in both cases, it looks like value of GLOBUS_USER_HOME
is not being set..I checked that the home area for the grid user is
available on gatekeeper node.
What else could be wrong here?
Thanks
-Neha