Two simple TCPIP / FTPSERVE questions.

2011-07-06 Thread Colin Allinson
This is very basic and simple but I can't seem to see the answer anywhere 
(although it is probably staring me in face).

Once TCPIP has decided that it has tried to restart a server (FTPSERVE) 
the maximum number of time and abandons further attempts - how can I reset 
this once the problem has been resolved?

We rely very heavily on incoming FTP but we have found that FTPSERVE does 
not like receiving files onto a corrupt disk and sulks heavily (Sorting 
the cause of the corruption is a separate problem and not as simple as it 
first appears). Is there any easy way (or a tool) to monitor that FTPSERVE 
is currently functional so that remedial action can be taken automatically 
(in the middle of the night) if it is not?



Colin Allinson
VM Systems Support
Amadeus Data Processing GmbH




IMPORTANT  -  CONFIDENTIALITY  NOTICE  - This e-mail is intended only for 
the use of the individual or entity shown above as addressees. It may 
contain information which is privileged, confidential or otherwise 
protected from disclosure under applicable laws.  If the reader of this 
transmission is not the intended recipient, you are hereby notified that 
any dissemination, printing, distribution, copying, disclosure or the 
taking of any action in reliance on the contents of this information is 
strictly prohibited.  If you have received this transmission in error, 
please immediately notify us by reply e-mail or using the address below 
and delete the message and any attachments from your system. 

Amadeus Data Processing GmbH 
Geschäftsführer: Eberhard Haag 
Sitz der Gesellschaft: Erding 
HR München 48 199 
Berghamer Strasse 6 
85435 Erding 
Germany

Re: Two simple TCPIP / FTPSERVE questions.

2011-07-06 Thread Alan Altmark
On Wednesday, 07/06/2011 at 04:25 EDT, Colin Allinson 
cgallin...@amadeus.com wrote:
 This is very basic and simple but I can't seem to see the answer 
anywhere 
 (although it is probably staring me in face). 
 
 Once TCPIP has decided that it has tried to restart a server (FTPSERVE) 
the 
 maximum number of time and abandons further attempts - how can I reset 
this 
 once the problem has been resolved? 

From Usage Note #1 for MaxRestart:
 If a user is restarted MAXRESTART times, the TCP/IP server will send a
 message to the INFORM list and stop restarting the user. After that, you 
can tell
 the TCP/IP server to attempt to restart the user another MAXRESTART times 
by
 dynamically specifying the user's PORT statement without the NOAUTOLOG
 operand.

So you can use OBEYFILE or NETSTAT OBEY.

 We rely very heavily on incoming FTP but we have found that FTPSERVE 
does not 
 like receiving files onto a corrupt disk and sulks heavily (Sorting the 
cause 
 of the corruption is a separate problem and not as simple as it first 
appears). 
 Is there any easy way (or a tool) to monitor that FTPSERVE is currently 
 functional so that remedial action can be taken automatically (in the 
middle of 
 the night) if it is not?

You could watch FTPSERVE's console for unexpected ending conditions.   You 
may also wish to use FTP 127.0.0.1 (loopback) with the program stack and 
the EXIT parameter to see if FTP is up.  Or just establish a TCP 
connection to port 21, look for the welcome banner, then issue QUIT.

Alan Altmark

Senior Managing z/VM and Linux Consultant
IBM System Lab Services and Training 
ibm.com/systems/services/labservices 
office: 607.429.3323
mobile; 607.321.7556
alan_altm...@us.ibm.com
IBM Endicott


Re: Two simple TCPIP / FTPSERVE questions.

2011-07-06 Thread David Boyes
Very simple way: set up a small Linux instance and install Nagios on it. 
Configure a FTP probe in Nagios, and configure a notification to a user on the 
VM system. The Nagios system will test the FTP server by connecting and 
attempting to transfer a small file periodically. If it fails, it sends a 
notification email.  Handle the email with PROP.

That approach tests not only whether the server is logged in, but whether it's 
actually functioning. Works well for lots of things, and is low-cost (no cost 
if you run Debian or Fedora for Z).



Re: Two simple TCPIP / FTPSERVE questions.

2011-07-06 Thread Colin Allinson
Alan,

Thanks - that is great. I can work from that.


Colin Allinson
VM Systems Support

Amadeus Data Processing GmbH



Re: Two simple TCPIP / FTPSERVE questions.

2011-07-06 Thread Colin Allinson
David Boyes dbo...@sinenomine.net wrote:-

 Very simple way: set up a small Linux instance and install Nagios on it. 
Configure a FTP probe in Nagios, and configure a notification to a user on 
the VM
 system. The Nagios system will test the FTP server by connecting and 
attempting to transfer a small file periodically. If it fails, it sends a 
notification 
 email.  Handle the email with PROP.
 
 That approach tests not only whether the server is logged in, but 
whether it?s actually functioning. Works well for lots of things, and is 
low-cost 
 (no cost if you run Debian or Fedora for Z). 


David, I am afraid we are in lock down here (essential maintenance only) 
so no chance of installing a new LINUX server. However, I could use your 
idea from an existing hartbeat server between VM systems.



Colin Allinson
VM Systems Support
Amadeus Data Processing GmbH



Re: Two simple TCPIP / FTPSERVE questions.

2011-07-06 Thread Rob van der Heij
On Wed, Jul 6, 2011 at 4:35 PM, David Boyes dbo...@sinenomine.net wrote:

 That approach tests not only whether the server is logged in, but whether 
 it’s actually functioning. Works well for lots of things, and is low-cost (no 
 cost if you run Debian or Fedora for Z).

Depending on how the FTP server fails, you might also see it in your
performance monitor...

With anything that monitors a service by probing, the issue is only
noticed after some time. Once you increase the polling to detect it
quick enough that you still have time to fix it within SLA, the
process uses a lot of resources (in a former life we tested e-mail
delivery like that until 50% of our traffic was probes and 90% of the
outages were caused by them...)

Using SCIF on the FTPSERVE userid, PROP can watch the console and
notice the outage immediately. CMS gives you easy tools to arrange the
restart of the server.

Running a Linux virtual machine with agents just to poll the FTP
server is only no cost when you have too much resources and spare
time.

| Rob


Re: Two simple TCPIP / FTPSERVE questions.

2011-07-06 Thread David Boyes
 Depending on how the FTP server fails, you might also see it in your
 performance monitor...

Also true. OTOH, there are failure modes (such as the one Colin mentioned about 
getting unhappy with a minidisk) that won't show up in the console log or will 
show misleading symptoms (large buffer exhaustion, for example). 

 Running a Linux virtual machine with agents just to poll the FTP
 server is only no cost when you have too much resources and spare
 time.

So run it on an external system, where the cost on Z is periodically another 
client session. Either way, you get a end-user perspective of the behavior you 
want to test rather than inferring it from secondary sources.

There are also likely to be other services that need similar testing, which 
could benefit from a similar approach, and there is substantial existing code 
that can do the job without writing a bunch of stuff from scratch. It takes a 
lot less time to configure an existing tool than to gather the information to 
second-guess what is happening from the console. 


Re: Two simple TCPIP / FTPSERVE questions.

2011-07-06 Thread David Boyes

David, I am afraid we are in lock down here (essential maintenance only) so no 
chance of installing a new LINUX server. However, I could use your idea from an 
existing hartbeat server between VM systems.

World work, I'd think. That'd also catch the socket timeout delay problem 
if/when you have more than a few thousand connections waiting for the TCP close 
processing to complete.


Re: Two simple TCPIP / FTPSERVE questions.

2011-07-06 Thread Michael Harding
You've been wearing your Linux appliance hat too long.  Much lighter-weight
would be a few lines of Rexx with or without Romney's FTP package, running
periodically as a task in your automation solution or a standalone (CMS)
VSM.

--
Mike Harding
z/VM System Support

mhard...@us.ibm.com
mike.b.hard...@kp.org
mikehard...@mindless.com
(925) 926-3179 (w)
(925) 323-2070 (c)
IM: VMBearDad (AIM),  mbhcpcvt (Y!)


The IBM z/VM Operating System IBMVM@LISTSERV.UARK.EDU wrote on 07/06/2011
07:35:19 AM:

 From: David Boyes dbo...@sinenomine.net
 To: IBMVM@LISTSERV.UARK.EDU
 Date: 07/06/2011 07:35 AM
 Subject: Re: Two simple TCPIP / FTPSERVE questions.
 Sent by: The IBM z/VM Operating System IBMVM@LISTSERV.UARK.EDU

 Very simple way: set up a small Linux instance and install Nagios on
 it. Configure a FTP probe in Nagios, and configure a notification to
 a user on the VM system. The Nagios system will test the FTP server
 by connecting and attempting to transfer a small file periodically.
 If it fails, it sends a notification email.  Handle the email with PROP.

 That approach tests not only whether the server is logged in, but
 whether it’s actually functioning. Works well for lots of things,
 and is low-cost (no cost if you run Debian or Fedora for Z).


Re: Two simple TCPIP / FTPSERVE questions.

2011-07-06 Thread David Boyes


been wearing your Linux appliance hat too long. Much lighter-weight would be a 
few lines of Rexx with or without Romney's FTP package, running periodically as 
a task in your automation solution or a standalone (CMS) VSM.

Perhaps. OTOH, up and usefully running in less than 10 minutes with no 
development time has it’s appeal. But, I guess I’m getting old enough that 
rolling my own isn’t as much fun any more. I suggested A solution, not THE 
solution. Lots of ways to boil this lobster.

-- db