Re: journal errors - return code 121 \\.\pipe\jnl

2003-02-11 Thread Pete Tanenhaus
I understand the concern, the point I was trying to make was that in some
cases it's difficult to determine
if  particular return codes merit something being written to the errorlog.

In the case of a named pipe error (on either the journal daemon or the
backup client) it isn't easy to
determine if the error occurs because the other process has terminated (or
timed out) either normally or abnormally.

All that having been said, the error messages in question really provide
diagnostic information that
probably isn't all that useful to the end user and is really is only useful
to development or support
when attempting to track down problems in the field.

So I agree that the messages should either be more meaningful or shouldn't
be in the errorlog
at all (probably more appropriate for the client tracing facility).

By the way, I tracked down the specific message in this thread ( return
code 121 \\.\pipe\jnl).

There are two types of named pipes used to communicate between the two
processes (journal daemin and b/a client).

One named pipe (\\.\pipe\jnl) name is created by the journal daemon during
startup and is used to receive inbound requests from
the b/a client.

The b/a client determines if the journal daemon is running by attempting to
connect to this pipe.

Return code 121 indicates that the b/a client timed out attempting to
connect to the pipe, which most likely means
that the journal daemon isn't running.

I think a more accurate message in the errorlog would be something like
"Connection to Journal Daemon timed out"
(and the specific return code could be documented with tracing on).

The other type(s) of named pipes are created by the b/a client and used to
receive responses from the journal daemon.

There is at least one response pipe created by the b/a client for each
backup session (and with multi-threaded backup
possible more since there can be multiple backup sessions).

Any request sent to the journal daemon which requires a response includes
this response pipe name.

When the journal daemon processes the response, it connects to the supplied
response pipe and sends response
data on it as needed.

The b/a client backup session waits for data to arrive on the response pipe
and the reads it.

An example of this would be a request from the b/a client to query all of
the objects in the journal for a particular file system.

The b/a client creates a pipe for the journal daemon to send the list of
matching objects on, and supplies the name of this
pipe to the journal daemon in the query request.

The journal daemon processes the query request, connects to the supplied
pipe, performs the query, and sends the responses
on the pipe..

The b/a client expects to look  (peeks) at the request pipe to determine if
there is any response data to read and continues
to read until it is empty.

Usually data will be available on the response pipe as soon as the b/a
client requests it, but in some cases (as with very
long running queries) the journal daemon doesn't post the data on the
response pipe in time and the b/a client must continue
to look (peek) at the pipe until it arrives.

I believe the above condition is the source of the  "NpPeek: No data"
message in the errorlog.

Since the b/a client will eventually timeout the session if the data isn't
received on the response pipe in
a reasonable amount of time, I don't see any use of logging the above
message, so I will remove it in a future release/ptf.

Anyway, I hope this helps .









Pete Tanenhaus
Tivoli Storage Solutions Software Development
email: [EMAIL PROTECTED]
tieline: 320.8778, external: 607.754.4213

"Those who refuse to challenge authority are condemned to conform to it"

-- Forwarded by Pete Tanenhaus/San Jose/IBM on
02/11/2003 11:58 AM ---

"Magura, Curtis" <[EMAIL PROTECTED]>@VM.MARIST.EDU> on 02/11/2003
11:32:26 AM

Please respond to "ADSM: Dist Stor Manager" <[EMAIL PROTECTED]>

Sent by:"ADSM: Dist Stor Manager" <[EMAIL PROTECTED]>


To:[EMAIL PROTECTED]
cc:
Subject:Re: journal errors - return code 121 \\.\pipe\jnl



Pete, think you said it all with the statement below! Very confusing in the
current state to decide if there is a problem or not.

"That being said, I think development (myself) needs to look at  the np
error logging on both sides and try to eliminate logging messages which
aren't really errors, but in some situations it's difficult to determine if
an error condition is legitimate and should be logged or if it is innocuous
and can be ignored."

Curt Magura
Lockheed Martin EIS
Orlando, Fla.
321-235-1203


-Original Message-
From: Pete Tanenhaus [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, February 11, 2003 10:27 AM
To: [EMAIL PROTECTED]
Subject: Re: journal errors - return code 121 \\.\pipe\jnl


Np errors in the back

Re: journal errors - return code 121 \\.\pipe\jnl

2003-02-11 Thread Magura, Curtis
Pete, think you said it all with the statement below! Very confusing in the
current state to decide if there is a problem or not.

"That being said, I think development (myself) needs to look at  the np
error logging on both sides and try to eliminate logging messages which
aren't really errors, but in some situations it's difficult to determine if
an error condition is legitimate and should be logged or if it is innocuous
and can be ignored."

Curt Magura
Lockheed Martin EIS
Orlando, Fla.
321-235-1203


-Original Message-
From: Pete Tanenhaus [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, February 11, 2003 10:27 AM
To: [EMAIL PROTECTED]
Subject: Re: journal errors - return code 121 \\.\pipe\jnl


Np errors in the backup client errorlog indicate the opposite condition,
that is the backup client
is trying to read a response sent from the journal daemon which isn't
available at the moment
the read is being done.

This error can happen if the journal daemon ends (obviously a problem) or
(I believe) if the
response the backup client is looking for from the journal daemon is still
in progress, meaning
that the journal daemon hasn't finished processing/sending it.

In most cases the response is ready when the backup client goes to read it,
but if it isn't
the backup client will keep trying to read the response until it either
arrives or a timeout occurs
(don't know the exact wait time of the top of my head).

That being said, I think development (myself) needs to look at  the np
error logging on both sides
and try to eliminate logging messages which aren't really errors, but in
some situations it's difficult
to determine if an error condition is legitimate and should be logged or if
it is innocuous and can be ignored.

Hope this helps ..

Pete Tanenhaus
Tivoli Storage Solutions Software Development
email: [EMAIL PROTECTED]
tieline: 320.8778, external: 607.754.4213



Re: journal errors - return code 121 \\.\pipe\jnl

2003-02-11 Thread Pete Tanenhaus
Np errors in the backup client errorlog indicate the opposite condition,
that is the backup client
is trying to read a response sent from the journal daemon which isn't
available at the moment
the read is being done.

This error can happen if the journal daemon ends (obviously a problem) or
(I believe) if the
response the backup client is looking for from the journal daemon is still
in progress, meaning
that the journal daemon hasn't finished processing/sending it.

In most cases the response is ready when the backup client goes to read it,
but if it isn't
the backup client will keep trying to read the response until it either
arrives or a timeout occurs
(don't know the exact wait time of the top of my head).

That being said, I think development (myself) needs to look at  the np
error logging on both sides
and try to eliminate logging messages which aren't really errors, but in
some situations it's difficult
to determine if an error condition is legitimate and should be logged or if
it is innocuous and can be ignored.

Hope this helps ..

Pete Tanenhaus
Tivoli Storage Solutions Software Development
email: [EMAIL PROTECTED]
tieline: 320.8778, external: 607.754.4213

"Those who refuse to challenge authority are condemned to conform to it"

-- Forwarded by Pete Tanenhaus/San Jose/IBM on
02/11/2003 10:13 AM ---

"Magura, Curtis" <[EMAIL PROTECTED]>@VM.MARIST.EDU> on 02/11/2003
09:27:26 AM

Please respond to "ADSM: Dist Stor Manager" <[EMAIL PROTECTED]>

Sent by:"ADSM: Dist Stor Manager" <[EMAIL PROTECTED]>


To:[EMAIL PROTECTED]
cc:
Subject:Re: journal errors - return code 121 \\.\pipe\jnl



Pete,

What about the message - 02/11/2003 04:02:14 NpPeek: No data. ?

We are also seeing this daily in dsmserror on the same clients. Is there a
connection?

Curt Magura
Lockheed Martin EIS
Orlando, Fla.
321-235-1203


-Original Message-
From: Pete Tanenhaus [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, February 11, 2003 9:18 AM
To: [EMAIL PROTECTED]
Subject: Re: journal errors - return code 121 \\.\pipe\jnl


This error occurs when the journal daemon process attempts to send a
response on a named pipe provided by
a backup client process and the pipe no longer exists or isn't valid.

This can happen if the backup process/session ends or closes the pipe
before the journal daemon
sends or is finished sending the response or in some cases when the journal
daemon is shutting down and
cleaning up resources.

By itself the error is innocuous but I understand it is annoying to see
error messages in the log which can't
be explained.

Hope this helps 


Pete Tanenhaus
Tivoli Storage Solutions Software Development
email: [EMAIL PROTECTED]
tieline: 320.8778, external: 607.754.4213

"Those who refuse to challenge authority are condemned to conform to it"

-- Forwarded by Pete Tanenhaus/San Jose/IBM on
02/11/2003 09:14 AM ---

"Magura, Curtis" <[EMAIL PROTECTED]>@VM.MARIST.EDU> on 02/11/2003
06:49:34 AM

Please respond to "ADSM: Dist Stor Manager" <[EMAIL PROTECTED]>

Sent by:"ADSM: Dist Stor Manager" <[EMAIL PROTECTED]>


To:[EMAIL PROTECTED]
cc:
Subject:Re: journal errors - return code 121 \\.\pipe\jnl



Don't have good news other to say that I still see the message at 5.1.5.9
of
the client talking to 5.1.5.4 server. Haven't had time to call the support
center yet. Working a different call with the support center related to
cleanup backupgroups on one of the servers that just went to 5.1.5.4!

Curt Magura
Lockheed Martin EIS
Orlando, Fla.
321-235-1203


-----Original Message-
From: chris rees [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, February 11, 2003 3:34 AM
To: [EMAIL PROTECTED]
Subject: journal errors - return code 121 \\.\pipe\jnl


Hi All

I'm getting the following from a new client that is running journaling.
TSM Client is 5.1.5.2, server is 5.1.5.4.

10-02-2002 09:20:05 NpOpen : Named pipe error connecting to server
WaitOnPipe failed.
NpOpen: call failed with return code:121 pipe name \\.\pipe\jnl


I see there was a similar thread last week ( G.Gill and C.Magura )and
wondered if you guys got any further with it.  We have a few clients at the
above levels using journaling but only 1 seems to be giving the pipe
problem.

TIA

Chris










_
Tired of spam? Get advanced junk mail protection with MSN 8.
  http://join.msn.com/?page=features/junkmail



Re: journal errors - return code 121 \\.\pipe\jnl

2003-02-11 Thread Magura, Curtis
Pete,

What about the message - 02/11/2003 04:02:14 NpPeek: No data. ?

We are also seeing this daily in dsmserror on the same clients. Is there a
connection?

Curt Magura
Lockheed Martin EIS
Orlando, Fla.
321-235-1203


-Original Message-
From: Pete Tanenhaus [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, February 11, 2003 9:18 AM
To: [EMAIL PROTECTED]
Subject: Re: journal errors - return code 121 \\.\pipe\jnl


This error occurs when the journal daemon process attempts to send a
response on a named pipe provided by
a backup client process and the pipe no longer exists or isn't valid.

This can happen if the backup process/session ends or closes the pipe
before the journal daemon
sends or is finished sending the response or in some cases when the journal
daemon is shutting down and
cleaning up resources.

By itself the error is innocuous but I understand it is annoying to see
error messages in the log which can't
be explained.

Hope this helps 


Pete Tanenhaus
Tivoli Storage Solutions Software Development
email: [EMAIL PROTECTED]
tieline: 320.8778, external: 607.754.4213

"Those who refuse to challenge authority are condemned to conform to it"

-- Forwarded by Pete Tanenhaus/San Jose/IBM on
02/11/2003 09:14 AM ---

"Magura, Curtis" <[EMAIL PROTECTED]>@VM.MARIST.EDU> on 02/11/2003
06:49:34 AM

Please respond to "ADSM: Dist Stor Manager" <[EMAIL PROTECTED]>

Sent by:"ADSM: Dist Stor Manager" <[EMAIL PROTECTED]>


To:    [EMAIL PROTECTED]
cc:
Subject:Re: journal errors - return code 121 \\.\pipe\jnl



Don't have good news other to say that I still see the message at 5.1.5.9
of
the client talking to 5.1.5.4 server. Haven't had time to call the support
center yet. Working a different call with the support center related to
cleanup backupgroups on one of the servers that just went to 5.1.5.4!

Curt Magura
Lockheed Martin EIS
Orlando, Fla.
321-235-1203


-Original Message-
From: chris rees [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, February 11, 2003 3:34 AM
To: [EMAIL PROTECTED]
Subject: journal errors - return code 121 \\.\pipe\jnl


Hi All

I'm getting the following from a new client that is running journaling.
TSM Client is 5.1.5.2, server is 5.1.5.4.

10-02-2002 09:20:05 NpOpen : Named pipe error connecting to server
WaitOnPipe failed.
NpOpen: call failed with return code:121 pipe name \\.\pipe\jnl


I see there was a similar thread last week ( G.Gill and C.Magura )and
wondered if you guys got any further with it.  We have a few clients at the
above levels using journaling but only 1 seems to be giving the pipe
problem.

TIA

Chris










_
Tired of spam? Get advanced junk mail protection with MSN 8.
 http://join.msn.com/?page=features/junkmail



Re: journal errors - return code 121 \\.\pipe\jnl

2003-02-11 Thread Pete Tanenhaus
This error occurs when the journal daemon process attempts to send a
response on a named pipe provided by
a backup client process and the pipe no longer exists or isn't valid.

This can happen if the backup process/session ends or closes the pipe
before the journal daemon
sends or is finished sending the response or in some cases when the journal
daemon is shutting down and
cleaning up resources.

By itself the error is innocuous but I understand it is annoying to see
error messages in the log which can't
be explained.

Hope this helps 


Pete Tanenhaus
Tivoli Storage Solutions Software Development
email: [EMAIL PROTECTED]
tieline: 320.8778, external: 607.754.4213

"Those who refuse to challenge authority are condemned to conform to it"

-- Forwarded by Pete Tanenhaus/San Jose/IBM on
02/11/2003 09:14 AM ---

"Magura, Curtis" <[EMAIL PROTECTED]>@VM.MARIST.EDU> on 02/11/2003
06:49:34 AM

Please respond to "ADSM: Dist Stor Manager" <[EMAIL PROTECTED]>

Sent by:"ADSM: Dist Stor Manager" <[EMAIL PROTECTED]>


To:    [EMAIL PROTECTED]
cc:
Subject:Re: journal errors - return code 121 \\.\pipe\jnl



Don't have good news other to say that I still see the message at 5.1.5.9
of
the client talking to 5.1.5.4 server. Haven't had time to call the support
center yet. Working a different call with the support center related to
cleanup backupgroups on one of the servers that just went to 5.1.5.4!

Curt Magura
Lockheed Martin EIS
Orlando, Fla.
321-235-1203


-Original Message-
From: chris rees [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, February 11, 2003 3:34 AM
To: [EMAIL PROTECTED]
Subject: journal errors - return code 121 \\.\pipe\jnl


Hi All

I'm getting the following from a new client that is running journaling.
TSM Client is 5.1.5.2, server is 5.1.5.4.

10-02-2002 09:20:05 NpOpen : Named pipe error connecting to server
WaitOnPipe failed.
NpOpen: call failed with return code:121 pipe name \\.\pipe\jnl


I see there was a similar thread last week ( G.Gill and C.Magura )and
wondered if you guys got any further with it.  We have a few clients at the
above levels using journaling but only 1 seems to be giving the pipe
problem.

TIA

Chris










_
Tired of spam? Get advanced junk mail protection with MSN 8.
 http://join.msn.com/?page=features/junkmail



Re: journal errors - return code 121 \\.\pipe\jnl

2003-02-11 Thread Magura, Curtis
Don't have good news other to say that I still see the message at 5.1.5.9 of
the client talking to 5.1.5.4 server. Haven't had time to call the support
center yet. Working a different call with the support center related to
cleanup backupgroups on one of the servers that just went to 5.1.5.4!

Curt Magura
Lockheed Martin EIS
Orlando, Fla.
321-235-1203


-Original Message-
From: chris rees [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, February 11, 2003 3:34 AM
To: [EMAIL PROTECTED]
Subject: journal errors - return code 121 \\.\pipe\jnl


Hi All

I'm getting the following from a new client that is running journaling.
TSM Client is 5.1.5.2, server is 5.1.5.4.

10-02-2002 09:20:05 NpOpen : Named pipe error connecting to server
WaitOnPipe failed.
NpOpen: call failed with return code:121 pipe name \\.\pipe\jnl


I see there was a similar thread last week ( G.Gill and C.Magura )and
wondered if you guys got any further with it.  We have a few clients at the
above levels using journaling but only 1 seems to be giving the pipe
problem.

TIA

Chris










_
Tired of spam? Get advanced junk mail protection with MSN 8.
http://join.msn.com/?page=features/junkmail



journal errors - return code 121 \\.\pipe\jnl

2003-02-11 Thread chris rees
Hi All

I'm getting the following from a new client that is running journaling.
TSM Client is 5.1.5.2, server is 5.1.5.4.

10-02-2002 09:20:05 NpOpen : Named pipe error connecting to server
WaitOnPipe failed.
NpOpen: call failed with return code:121 pipe name \\.\pipe\jnl


I see there was a similar thread last week ( G.Gill and C.Magura )and
wondered if you guys got any further with it.  We have a few clients at the
above levels using journaling but only 1 seems to be giving the pipe
problem.

TIA

Chris










_
Tired of spam? Get advanced junk mail protection with MSN 8.
http://join.msn.com/?page=features/junkmail