Re: VARY too many devices offline

2007-10-28 Thread Rick Fochtman


Q:  What would you do with dishonest consultants?

A:  The same thing you do with dishonest interns, trainees, employees, 
managers, CEOs, Chairmen of the Board, etc. And it depends on who "you" 
is. A "you" with sufficient authority must be found to take the needed 
action and who can take into consideration all the repercussions of his 
action, such as the effect on morale if he does or does not do what many 
think he should. Consultants do not have a monopoly on dishonesty or 
making mistakes. Nor do employees have a monopoly on company loyalty. 
One size still does not fit all.

---
100% agreement, as far as it goes. Complete honesty and integrity have 
to be the top priority, but not the only one. I maintain that 
discrimination on the basis of ability is still an acceptable practice, 
provided methods of improvement are available. An operator who makes a 
mistake because of lack of education should have an oppurtunity to 
learn, and should be better supervised. An operator who willingly 
diverts funds, or other resources, to his private use and profit, to the 
company's detriment, should be terminated and, if possible and feasable, 
prosecuted. Every employee has an obligation to safeguard the 
stockholders from unethical and/or illegal activities that cause losses, 
either financial or in the public image of the company (resulting in 
indirect losses).


As systems programmers, regardless of the actual title, we hold 
positions of high trust and, as such, we need to set high examples. Like 
Caesar's wife, Calpurnia, we have to not only BE pure, but also be 
PERCEIVED as pure. And we need to set an example for the PFCSK's that 
will follow us!


--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-28 Thread Ed Gould

On Oct 28, 2007, at 8:07 AM, (IBM Mainframe Discussion List) wrote:


In a message dated 10/25/2007 1:15:00 PM Central Standard Time,
[EMAIL PROTECTED] writes:


What would you do with dishonest consultants?


The same thing you do with dishonest interns, trainees, employees,  
managers,
CEOs, Chairmen of the Board, etc.  And it depends on who "you" is.   
A "you"
with sufficient authority must be found to take the needed action  
and who can
take into consideration all the repercussions of his action, such  
as the effect
on morale if he does or does not do what many think he should.   
Consultants do
not have a monopoly on dishonesty or making mistakes.  Nor do  
employees have

a monopoly on company loyalty.  One size still does not fit all.

Bill Fairchild
Franklin, TN



Bill,

Point taken. Since the company I worked for was non-confrontational  
the whole incident  was more or less swept under the carpet. Except  
for the additional person that was hired to keep any future  
trespassing from happening nothing really happened. It was a strange  
company as employees were extremely loyal and honest. There was no  
money handling (that I ever heard of) (except for petty cash) that  
was done at the site so honesty was really never an issue. They were  
strict when it came to expense accounts as one time I got called up  
and about a dinner I had at GUIDE and I had to redo parts of the  
expense report. I just reshuffled the distribution around. They  
grumbled a little bit. But it wasn't like I was extravagant I think I  
marked down I had a 26 dollar dinner (I was allowed $20). (this was  
in the 70's) . The computer operations was the hot spot about firing  
people. The manager there was the "man". He ran his "ship" like a  
captain in the 1600's, he let his managers get away with murder but  
the peons (operators) were regularly  whipped. Despite that the  
operators were extremely loyal and really did work.


The company (division) was pretty much run on a almost family type  
basis. The operations being the exception to the rule. I was not  
aware of the severe politics of the corporate headquarters until I  
was temporarily assigned there a few years later. I was asked to stay  
on but said no because of the politics. I am sure of this had  
occurred out of the corporate HQ that they would have been fired on  
the spot. We had a few political people that worked in the DC that  
made major mistakes that almost cost the company millions of dollars  
and they skated through without being fired (although in truth they  
were put in positions of less importance).


The point to this was that people that were not employed by the  
company (consultants) were held to a different level than employees,  
it was a much more restrictive level. Just by that level alone they  
should have been let go.


Ed

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-28 Thread (IBM Mainframe Discussion List)
In a message dated 10/25/2007 1:15:00 PM Central Standard Time, 
[EMAIL PROTECTED] writes:

> What would you do with dishonest consultants?

The same thing you do with dishonest interns, trainees, employees, managers, 
CEOs, Chairmen of the Board, etc.  And it depends on who "you" is.  A "you" 
with sufficient authority must be found to take the needed action and who can 
take into consideration all the repercussions of his action, such as the effect 
on morale if he does or does not do what many think he should.  Consultants do 
not have a monopoly on dishonesty or making mistakes.  Nor do employees have 
a monopoly on company loyalty.  One size still does not fit all.

Bill Fairchild
Franklin, TN


**
 See what's new at 
http://www.aol.com

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-26 Thread R.S.

1. Everyone can make a mistake, including those who claim otherwise.

2. Firing someone for an error is not good solution, at least as a rule. 
Education is better than terror. 

IMHO it is much better to avoid mistakes. How to do it:
2.1 Education. Skilled operator *understands* command, syntax, which 
reduces risk of some mistakes. Not applicable to Hal's example.
2.2 Procedures. Simply *avoid* risky commands. Do you want o vary some 
devices offline ? Why ? Can it be *scripted* ? Maybe this is well known, 
fixed range of devices.


My operators do make mistakes. However they have never blew up any 
system. Never. I didn't give them an opportunity (chance) to do it.
Everytime I hear about operator who blew up something, he did it while 
performing dangerous things. In Poland we say "ape with razor" or "you 
shouldn't give gun to a madman". If the action is dangerous, then you 
shouldn't rely on less educated staff.


My $0.02 (after 40% taxes and 23% social sec).
--
Radoslaw Skorupka
Lodz, Poland



Hal Merritt wrote:

Personally, I'd view this as a management issue, not a technical issue.
The root problem in my opinion was the need for an operator to issue the
command in the first place. When you require human intervention, errors
are to be expected and therefore tolerated. 


There are a number of such commands where a single keystroke error can
and will bring you down. For example, try the command "$P PRINTER1", but
hit enter rather than the space bar. (That is recoverable, but it is
unlikely anyone will discover that until after the IPL.)

So, yes, reassign the operators (all of them) and don't replace them. 


My $0.02 (before taxes)

-Original Message-
From: IBM Mainframe Discussion List [mailto:[EMAIL PROTECTED] On
Behalf Of Jan MOEYERSONS
Sent: Monday, October 22, 2007 7:05 AM
To: IBM-MAIN@BAMA.UA.EDU
Subject: Re: VARY too many devices offline


Not being to tongue in cheek here.. but we used to fire the operator.

Ed



And what good does that do to the integrity of your systems???

Does that prevent anyone else from making a mistake? 


Did you never make a typo in any of the commands you ever entered?  (If
you
never did, then that means you never entered any...)

Jantje.




--
BRE Bank SA
ul. Senatorska 18
00-950 Warszawa
www.brebank.pl

Sd Rejonowy dla m. st. Warszawy 
XII Wydzia Gospodarczy Krajowego Rejestru Sdowego, 
nr rejestru przedsibiorców KRS 025237

NIP: 526-021-50-88
Wedug stanu na dzie 01.01.2007 r. kapita zakadowy BRE Banku SA (w caoci 
opacony) wynosi 118.064.140 z. W zwizku z realizacj warunkowego 
podwyszenia kapitau zakadowego, na podstawie uchwa XVI WZ z dnia 21.05.2003 
r., kapita zakadowy BRE Banku SA moe ulec podwyszeniu do kwoty 118.760.528 
z. Akcje w podwyszonym kapitale zakadowym bd w caoci opacone.

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-25 Thread Ed Gould

On Oct 25, 2007, at 7:13 PM, Tony wrote:


Lots of people had there 2cents worth so I might as well give mine.

A max units offline in SYSPARMS would kill this forever and surely  
take

10/20 minutes to code.

Does anyone ever ask IBM to do anything these days or do we just  
sit back

and take what we are given?


Tony,

Way back when 3.8  (?) there was a limit (IIRC) and people asked for  
it to be lifted. Its been AGES so I could be wrong but I am pretty  
sure there was a limit at one time.


Ed

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-25 Thread Ed Gould

On Oct 25, 2007, at 6:29 PM, Gerhard Adam wrote:

Where do these scenarios get dreamed up?  If you have a services  
issue, then
you can always fire someone if they aren't living up to the terms  
(or your
perception) of an agreement.  If you're wrong, then a lawsuit by  
the other

party may help clear things up.

If you suspect something criminal, by all means you can see if a  
District
Attorney is willing to go for it, otherwise you can pursue a civil  
suit.


The rest is simply nonsense.  Firing someone for a mistake is silly.
Despite the melodramatic points raised, even when someone is  
KILLED, the
individual responsible may be liable, but unless negligence or  
criminal
intent can be proven its highly unlikely that they will be fired.   
In fact,
from the legal perspective the individual making the mistake may  
not even be

considered responsible if they can prove that they weren't adequately
trained for the responsibility given to them.

Adam



Adam:

I am not sure I agree with you (all the time). Most companies that I  
have seen have fired individuals for mistakes. It comes down, IMO the  
seriousness of the mistake. In a typical data center it is unlikely  
that it would cause a death but in a Hospital setting it is possible  
for that to happen. In the other extreme I have seen people save  
their jobs because of stupidity . A manager deleted dd statements out  
of a proc because not enough tape drives weren't available, if that  
wasn't bad enough it was a permanent change so backups of system  
volumes stopped happening for over 2 years before it was discovered  
that backups weren't being taken. Causing all sorts of chaos when the  
drive(s) went south. He blamed it on operations for not noticing it  
and got away with it.


As to liability of an operator I have never heard of an operator  
being sued, maybe his/her boss but not an operator. After all  
corporate pockets are a lot deeper than a typical operators. What is  
the saying, you can't get blood from a turn-up. It may have happened  
but it would have to be rare. Besides I would think that management  
would rather have an operator as a scapegoat after all operators are  
pretty much a dime a dozen. I would think that a person testing blood  
would not have personal liability insurance but the company he/she  
works for would have liability insurance. After all you go after the  
entity with the deep pockets.


Ed

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-25 Thread Tony
Lots of people had there 2cents worth so I might as well give mine.

A max units offline in SYSPARMS would kill this forever and surely take
10/20 minutes to code.

Does anyone ever ask IBM to do anything these days or do we just sit back
and take what we are given?

-Original Message-
From: IBM Mainframe Discussion List [mailto:[EMAIL PROTECTED] Behalf
Of Field, Alan C.
Sent: 19 October 2007 15:14
To: IBM-MAIN@BAMA.UA.EDU
Subject: VARY too many devices offline


Yesterday someone issued an RO *ALL,V  708-7014,OFFLINE which led to a
number of unplanned IPLs. 

 

Now mgmt wants to implement a fingerchecker. 

 

I searched CBT for a command exit that might ask something like "you
really want to vary 26,000 devices offline?" 

 

How do others shops handle this, have you got an exit you'd be willing
to share before we re-invent the wheel.

 

Thanks,

 

Alan 


--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html
No virus found in this incoming message.
Checked by AVG Free Edition. 
Version: 7.5.488 / Virus Database: 269.15.0/1076 - Release Date: 17/10/2007
19:53

No virus found in this outgoing message.
Checked by AVG Free Edition. 
Version: 7.5.503 / Virus Database: 269.15.10/1091 - Release Date: 24/10/2007
14:31
 

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-25 Thread Ed Gould

On Oct 25, 2007, at 4:29 PM, Ted MacNEIL wrote:


I was getting fed up with Ed Gould's constant talk of firing people.


Ed has been retired for so long that any opinion he has is not  
worth listening to.


From STAR TREK ("Friday's Child"):

His words are unimportant, and we do not hear them!



Ted:

If you too busy to stop for gas you are going to run out of oil:)

Ed

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-25 Thread Ed Gould

On Oct 25, 2007, at 4:21 PM, Eric Bielefeld wrote:

I guess I saw the part about turning SMF recording off, but it  
didn't register.  Yes, if they were stealing services or committing  
some act of fraud, maybe they should get fired.  I used the quote  
below more because I was getting fed up with Ed Gould's constant  
talk of firing people.  I know I have caused a few IPLs in my  
career.  I don't think people should be fired for one or two honest  
mistakes.  If someone makes mistakes over and over, thats one  
thing, but making one mistake that causes an outage shouldn't be  
grounds for firing.



Stealing *IS* grounds for firing or do you want to have dishonest  
people working at you bank robbing you blind all the time .. Once is OK?


Ed

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-25 Thread Gerhard Adam
Where do these scenarios get dreamed up?  If you have a services issue, then
you can always fire someone if they aren't living up to the terms (or your
perception) of an agreement.  If you're wrong, then a lawsuit by the other
party may help clear things up.

If you suspect something criminal, by all means you can see if a District
Attorney is willing to go for it, otherwise you can pursue a civil suit.

The rest is simply nonsense.  Firing someone for a mistake is silly.
Despite the melodramatic points raised, even when someone is KILLED, the
individual responsible may be liable, but unless negligence or criminal
intent can be proven its highly unlikely that they will be fired.  In fact,
from the legal perspective the individual making the mistake may not even be
considered responsible if they can prove that they weren't adequately
trained for the responsibility given to them.

Adam


  Yes, if they were stealing services or committing some act of 
fraud, maybe they should get fired.  

> A clue was given when it was said that SMF had been turned off. Now, if
> charge-back accounting was being done AND these consultants were in some
> way being charged for CPU time (and/or other resources) that they used
> then this was a form of theft or embezzlement/fraud.
>
> That being the case: depending on what corporate counsel advised, I
> would present them with a bill for 1.5 times their average use on a
> weekend, point out specific points in any contract that shows or states
> this is a material breach.

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-25 Thread Ed Gould

On Oct 25, 2007, at 5:52 PM, Rick Fochtman wrote:


What would you do with dishonest consultants?

I'd be taking one really HARD LOOK at the selection process! Let's  
treat the whole problem, not just the most obvious  symptom!


Rick,

The consultants came out of the departments pocket. Their budget was  
entirely outside of the DC budget. We had little to say (read no say)  
about how they spent their $$.


Ed


--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-25 Thread Campbell Jay
A rogue oprerative may sneak in...
I think they have a few times...
But we will definitely find that person.

And... As an added bonus...
Whatever they thought they may have destroyed...
It's all recoverable.

SUPER DISCLAIMER - This is my own perception
Not to be affiliated in any way with my company. 

-Original Message-
From: IBM Mainframe Discussion List [mailto:[EMAIL PROTECTED] On
Behalf Of Rick Fochtman
Sent: Thursday, October 25, 2007 6:53 PM
To: IBM-MAIN@BAMA.UA.EDU
Subject: Re: VARY too many devices offline

What would you do with dishonest consultants?

I'd be taking one really HARD LOOK at the selection process! Let's treat
the whole problem, not just the most obvious  symptom!

--
For IBM-MAIN subscribe / signoff / archive access instructions, send
email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO Search
the archives at http://bama.ua.edu/archives/ibm-main.html

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-25 Thread Rick Fochtman

What would you do with dishonest consultants?

I'd be taking one really HARD LOOK at the selection process! Let's treat 
the whole problem, not just the most obvious  symptom!


--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-25 Thread Ted MacNEIL
>I was getting fed up with Ed Gould's constant talk of firing people.

Ed has been retired for so long that any opinion he has is not worth listening 
to.

>From STAR TREK ("Friday's Child"):

His words are unimportant, and we do not hear them!

-
Too busy driving to stop for gas!

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-25 Thread Eric Bielefeld
I guess I saw the part about turning SMF recording off, but it didn't 
register.  Yes, if they were stealing services or committing some act of 
fraud, maybe they should get fired.  I used the quote below more because I 
was getting fed up with Ed Gould's constant talk of firing people.  I know I 
have caused a few IPLs in my career.  I don't think people should be fired 
for one or two honest mistakes.  If someone makes mistakes over and over, 
thats one thing, but making one mistake that causes an outage shouldn't be 
grounds for firing.


Eric Bielefeld
Sr. z/OS Systems Programmer
Milwaukee, Wisconsin
414-475-7434

- Original Message - 
From: "Thompson, Steve" <[EMAIL PROTECTED]>

Eric:

What would you do with dishonest consultants?



A clue was given when it was said that SMF had been turned off. Now, if
charge-back accounting was being done AND these consultants were in some
way being charged for CPU time (and/or other resources) that they used
then this was a form of theft or embezzlement/fraud.

That being the case: depending on what corporate counsel advised, I
would present them with a bill for 1.5 times their average use on a
weekend, point out specific points in any contract that shows or states
this is a material breach. It would also be pointed out, if the contract
does not preclude it, that this behavior crossed the line of GRAND
THEFT. And then tell them we would entertain a payment schedule to heal
any breach that this behavior caused.

But the situation would be well documented and would definitely be part
of any future negotiations.

Regards,
Steve Thompson


--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-25 Thread Thompson, Steve
-Original Message-
From: IBM Mainframe Discussion List [mailto:[EMAIL PROTECTED] On
Behalf Of Ed Gould
Sent: Thursday, October 25, 2007 2:13 PM
To: IBM-MAIN@BAMA.UA.EDU
Subject: Re: VARY too many devices offline



Eric:

What would you do with dishonest consultants?



A clue was given when it was said that SMF had been turned off. Now, if
charge-back accounting was being done AND these consultants were in some
way being charged for CPU time (and/or other resources) that they used
then this was a form of theft or embezzlement/fraud. 

That being the case: depending on what corporate counsel advised, I
would present them with a bill for 1.5 times their average use on a
weekend, point out specific points in any contract that shows or states
this is a material breach. It would also be pointed out, if the contract
does not preclude it, that this behavior crossed the line of GRAND
THEFT. And then tell them we would entertain a payment schedule to heal
any breach that this behavior caused.

But the situation would be well documented and would definitely be part
of any future negotiations.

Regards,
Steve Thompson

-- Opinions expressed are strictly my own --

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-25 Thread Ed Gould

On Oct 25, 2007, at 1:50 PM, Eric Bielefeld wrote:

Ed - You sure have a thing about firing people.  Its a good thing  
you're not employed now, as you can't fire anyone.  Lets look at  
some of the costs of firing a bunch of consultants.  Lets just say  
there were 5 consultants who did what you described.  Say the  
average time they have been working was 6 months.  By now they know  
the projects they are working on, and how your company does  
things.  You would have your management throw out all the training  
and knowledge in your systems these people had, and more than  
likely waste most of the work they already did.  Firing people  or  
consultants for mistakes can be very expensive.


Eric Bielefeld
Sr. z/OS Systems Programmer
Milwaukee, Wisconsin
414-475-7434


Eric:

What would you do with dishonest consultants?

Ed

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-25 Thread Eric Bielefeld
Ed - You sure have a thing about firing people.  Its a good thing you're not 
employed now, as you can't fire anyone.  Lets look at some of the costs of 
firing a bunch of consultants.  Lets just say there were 5 consultants who 
did what you described.  Say the average time they have been working was 6 
months.  By now they know the projects they are working on, and how your 
company does things.  You would have your management throw out all the 
training and knowledge in your systems these people had, and more than 
likely waste most of the work they already did.  Firing people  or 
consultants for mistakes can be very expensive.


Eric Bielefeld
Sr. z/OS Systems Programmer
Milwaukee, Wisconsin
414-475-7434

- Original Message - 
From: "Ed Gould" <[EMAIL PROTECTED]>

They were not happy at all. I just wish the consultants had been fired.

Ed


--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-25 Thread Edward Jaffe

David Long wrote:
In 1989 I was working in a shop where the operator accidentally entered the 
ipl date as yy/mm/98 instead of yy/mm/89.  This was not noticed until all the 
jobs that read tapes started failing because the tape datasets had expired.


I ended up writing a little program to make the operator verify that the date 
was correct before the ipl could procede.
  


But, the operator that made the mistake was fired on the spot. Right? ;-)

--
Edward E Jaffe
Phoenix Software International, Inc
5200 W Century Blvd, Suite 800
Los Angeles, CA 90045
310-338-0400 x318
[EMAIL PROTECTED]
http://www.phoenixsoftware.com/

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-25 Thread Hal Merritt
Personally, I'd view this as a management issue, not a technical issue.
The root problem in my opinion was the need for an operator to issue the
command in the first place. When you require human intervention, errors
are to be expected and therefore tolerated. 

There are a number of such commands where a single keystroke error can
and will bring you down. For example, try the command "$P PRINTER1", but
hit enter rather than the space bar. (That is recoverable, but it is
unlikely anyone will discover that until after the IPL.)

So, yes, reassign the operators (all of them) and don't replace them. 

My $0.02 (before taxes)

-Original Message-
From: IBM Mainframe Discussion List [mailto:[EMAIL PROTECTED] On
Behalf Of Jan MOEYERSONS
Sent: Monday, October 22, 2007 7:05 AM
To: IBM-MAIN@BAMA.UA.EDU
Subject: Re: VARY too many devices offline

>
>Not being to tongue in cheek here.. but we used to fire the operator.
>
>Ed
>

And what good does that do to the integrity of your systems???

Does that prevent anyone else from making a mistake? 

Did you never make a typo in any of the commands you ever entered?  (If
you
never did, then that means you never entered any...)

Jantje.

 
NOTICE: This electronic mail message and any files transmitted with it are 
intended
exclusively for the individual or entity to which it is addressed. The message, 
together with any attachment, may contain confidential and/or privileged 
information.
Any unauthorized review, use, printing, saving, copying, disclosure or 
distribution 
is strictly prohibited. If you have received this message in error, please 
immediately advise the sender by reply email and delete all copies.

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-25 Thread Ed Gould

On Oct 25, 2007, at 9:19 AM, David Long wrote:


On Tue, 23 Oct 2007 18:29:02 -0500, Ed Gould
<[EMAIL PROTECTED]> wrote:


I wish the person this had happened to would pipe up, but to set the
record more precisely because of a bad date RACF (This is hear say)
did something to the (RACF) database that essentially rendered the
system not operatrional. Just by IPLing (again) with the correct date
was too late as the RACF database unusable. I do not know if they had
backups or any specifics. I heard they were down for a day or so.
Luckily this was a weekend.

Ed


Ed,
In 1989 I was working in a shop where the operator accidentally  
entered the
ipl date as yy/mm/98 instead of yy/mm/89.  This was not noticed  
until all the
jobs that read tapes started failing because the tape datasets had  
expired.


I ended up writing a little program to make the operator verify  
that the date

was correct before the ipl could procede.

Dave Long




Dave,

Congratulations.  At one place where I worked. We had to have two  
operators sign off on the date/time at IPL time. However one Sunday  
when I was testing they sill managed to flub it. It didn't matter as  
it was a IPL we were doing to test out an IPO.


Speaking of "operator error", this was back 30 years or so.  We had a  
data center that worked most weekends but not all. We had a  
consulting company come in and IPL the system (on their own with no  
operations people around). They IPLed with a wrong date (on purpose)  
and turned off SMF and a few other things. That Monday morning I was  
doing some research and ran across the IPL on a console that was out  
of view  of the system console. I verified with the operations people  
that no one had worked and also within our sysprog group about IPL's  
and it was all denied. I took the console printout up to the VP of  
the data center and gave it to him and told him of my suspicions .  
The jobs that were run when the system was supposed to be down were  
of a division that had lots of contract programmers. The VP called up  
the division's manager and let him have it. After that the data  
center always had a baby sitter when no one was scheduled to work. He  
charged the other 3 divisions with the labor cost of the baby sitter.  
They were not happy at all. I just wish the consultants had been fired.


Ed

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-25 Thread David Long
On Tue, 23 Oct 2007 18:29:02 -0500, Ed Gould 
<[EMAIL PROTECTED]> wrote:

>I wish the person this had happened to would pipe up, but to set the
>record more precisely because of a bad date RACF (This is hear say)
>did something to the (RACF) database that essentially rendered the
>system not operatrional. Just by IPLing (again) with the correct date
>was too late as the RACF database unusable. I do not know if they had
>backups or any specifics. I heard they were down for a day or so.
>Luckily this was a weekend.
>
>Ed

Ed,
In 1989 I was working in a shop where the operator accidentally entered the 
ipl date as yy/mm/98 instead of yy/mm/89.  This was not noticed until all the 
jobs that read tapes started failing because the tape datasets had expired.

I ended up writing a little program to make the operator verify that the date 
was correct before the ipl could procede.

Dave Long

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-24 Thread Rick Fochtman



I had a problem with IPLing I think a year behind the current date.  
This was running under VM.  VM was IPL'd with the wrong date, passing 
the date along to each guest.  We had 5 DOS guests, one MVS guest, and 
one VS1 guest at the time.  This was our first conversion of one of 
the DOS guests to MVS. I don't think there were any problems while 
running a year back, but something led me to check the date, and I 
discovered we had the wrong one. We shut everything down, and reIPL'd 
VM with the correct date, and brought everything back up.  The only 
problem after coming back up was that anyone who had logged on TSO 
while the date was 1 year behind couldn't log on. That was me, and a 
few of the programmers and the operator.


This led to my getting my 2nd TSO id, which I wanted in the beginning, 
but a girl who worked there for a while didn't want anyone to have 2 
TSO ids.  It so happened that she wasn't home, so I kind of sat there 
and couldn't even log on for about an hour, until she finally got home 
and called me.  Then I logged on with her password and ID and reset my 
own ID.  Needless to say, it didn't take much convincing to get a 2nd 
TSO id after that.


--
RACF has the capability of disallowing a user to logon if he's been 
inactive for a certain number of days. We got stung by this at Clearing 
long ago, when even STC wasn't allowed to logon. We installed an exit to 
bypass that difficulty. I believe now that RACF has been altered such 
that started tasks aren't limited in that fashion. We also had a proc 
that could be started from the console to restore certain critical ID's 
so we could recover from a similar disaster in the future. When we first 
got stung, we were down for an entire CBOT trading session and the 
repercussions were VERY MESSY, to say the least.


--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-24 Thread Thompson, Steve
-Original Message-
From: IBM Mainframe Discussion List [mailto:[EMAIL PROTECTED] On
Behalf Of Ed Gould
Sent: Tuesday, October 23, 2007 6:29 PM
To: IBM-MAIN@BAMA.UA.EDU
Subject: Re: VARY too many devices offline

On Oct 23, 2007, at 3:17 AM, Zaromil Tisler wrote:

> On Tue, 23 Oct 2007 08:54:31 +0100, Kenny Fogarty wrote:
>
>> I agree, but, if the wrong date, or IPL parm, or whatever is entered,

>> then the chances are you're going to have to re-IPL to rectify the 
>> situation. As you said above, if RACF doesn't start, you can go back 
>> to see why, and take steps to fix the issue.

I wish the person this had happened to would pipe up, but to set the
record more precisely because of a bad date RACF (This is hear say) did
something to the (RACF) database that essentially rendered the system
not operatrional. Just by IPLing (again) with the correct date was too
late as the RACF database unusable. I do not know if they had backups or
any specifics. I heard they were down for a day or so.  
Luckily this was a weekend.


During Y2K testing, a future date was picked. Since all RACF IDs were
beyond the revoke dates...

So IPL with the correct date left them with having to do an OPERATOR
override and allow a SPECIAL/AUDITOR ID (similar to IBMUSER, or it was
IBMUSER) to logon even though it was revoked. Then it is a long
laborious process to re-activate every revoked account.

It was my understanding that this happened during a required Y2K test at
a Nuclear plant. I'm told it got real interesting when all the access
badges went dead as they hadn't been used for over 2 years as far as the
security system was concerned.

Regards,
Steve Thompson

-- Opinions expressed are my own. --

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-23 Thread Eric Bielefeld
Tom,

No one died, and no one got fired either.

Eric Bielefeld
Sr. z/OS Systems Programmer
Milwaukee, Wisconsin
414-475-7434

- Original Message -
From: Tom Schmidt <[EMAIL PROTECTED]>

> Eric,
> 
> Nobody died as a result of the operator error, right?  ;)  
> 
> --
> Tom Schmidt 
> Madison, WI> Search the archives at http://bama.ua.edu/archives/ibm-
main.html
>

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-23 Thread Tom Schmidt
On Tue, 23 Oct 2007 19:48:11 -0500, Eric Bielefeld wrote:

>I had a problem with IPLing I think a year behind the current date.  This
>was running under VM.  VM was IPL'd with the wrong date, passing the date
>along to each guest.  
...snip...
>The only problem after coming back up was that anyone
>who had logged on TSO while the date was 1 year behind couldn't log on.
>That was me, and a few of the programmers and the operator.
 
Eric,
 
Nobody died as a result of the operator error, right?  ;)  
 
--
Tom Schmidt 
Madison, WI

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-23 Thread Eric Bielefeld
I had a problem with IPLing I think a year behind the current date.  This 
was running under VM.  VM was IPL'd with the wrong date, passing the date 
along to each guest.  We had 5 DOS guests, one MVS guest, and one VS1 guest 
at the time.  This was our first conversion of one of the DOS guests to MVS. 
I don't think there were any problems while running a year back, but 
something led me to check the date, and I discovered we had the wrong one. 
We shut everything down, and reIPL'd VM with the correct date, and brought 
everything back up.  The only problem after coming back up was that anyone 
who had logged on TSO while the date was 1 year behind couldn't log on. 
That was me, and a few of the programmers and the operator.


This led to my getting my 2nd TSO id, which I wanted in the beginning, but a 
girl who worked there for a while didn't want anyone to have 2 TSO ids.  It 
so happened that she wasn't home, so I kind of sat there and couldn't even 
log on for about an hour, until she finally got home and called me.  Then I 
logged on with her password and ID and reset my own ID.  Needless to say, it 
didn't take much convincing to get a 2nd TSO id after that.


Eric Bielefeld
Sr. z/OS Systems Programmer
Milwaukee, Wisconsin
414-475-7434

- Original Message - 
From: "Ed Gould" <[EMAIL PROTECTED]>
I wish the person this had happened to would pipe up, but to set the 
record more precisely because of a bad date RACF (This is hear say)  did 
something to the (RACF) database that essentially rendered the  system not 
operatrional. Just by IPLing (again) with the correct date  was too late 
as the RACF database unusable. I do not know if they had  backups or any 
specifics. I heard they were down for a day or so.  Luckily this was a 
weekend.


Ed


--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-23 Thread Ed Gould

On Oct 23, 2007, at 2:54 AM, Kenny Fogarty wrote:



How would you handle a person that scrutinizes blood for a living and
mistakes a diagnosis ?
In some case an operator is just as "guilty" as the blood analyzer.
If you say thats not the same, I would agree but not in all


I wouldn't even begin to make the analogy. But, mistakes happen.
That's a fact of life, and, putting an unbearable amount of strain on
someone, as in - make a mistake and you're fired, will not, under any
circumstances, help that person to not make mistakes. In fact, I'd go
as far as to say it would only make things worse.


Well, I am not sure I agree with you about "worse". The operators  
usually had a great time (especially off day shift) The day shift  
only had one occurrence after that bad IPL. We (sysprogs) were  
consulted before any first time commands were entered so we were by  
the operators side as it was keyed in. Issues? not really the  
operators enjoyed their jobs and everyone was a little more  
productive. We did have an issue with delays of tape mounts we ended  
up buying hardware to monitor those events . The finger pointing was  
put squarely on the operators. But other than that the operators  
really liked their job (of course there were one or two exceptions)  
and everyone went about their own jobs happily. I think that  
department had the lowest turnover of any.



If an operator put in a wrong date at IPL and (because
of that) RACF refuses to come up and there is no backout or even
worse datasets gets scratched because of the operator error which
leads to a fine from say the SEC (or take you pick of agency).


See all of those issues? All perfectly valid. But, if I were having to
unravel the mess that came about from the wrong input at the console,
the operator would not be the person who should be blamed. There
should be contingency in place so that if RACF refuses to come up, we
get alerted very early on as to why, and have steps in place to remedy
the situation. Perhaps by re-IPL'ing. After all, that's what you're
going to do in 99% of cases if a wrong parameter is passed at IPL
time.


See above reply

If datasets get scratched, where's the back up? What's the contingency
in place to restore the data. If there isn't one, that's not the guy
who entered 'U' on the console instead of 'N''s fault.


Backup in our case was  24 hours ago, I can't speak for the actual  
company that it happened to.




 There are degrees of error of course some are who cares to a  
possible

company going bankrupt there are in the last case MANY people being
out of work (possibly 1000's or more) would you not fire the person?


If the company went bankrupt, it wouldn't be because someone varied
off the wrong device.


Hmmm well how about this scenario. System A is writing the master  
file to tape drive  d system b varies online the same tape drive and  
it starts to write to that tape drive you would have a clobbered tape  
and not know it for some time . If it was discovered during the  
database load that the tape was no good. The database would not be  
loaded and the firm could not open the next day. Not far fetched at all.




I think you are comparing apples and oranges. An operator can by  
mistake put
the company out of business, a programmer can cause loss revenue  
and yes

possibly a fine.


I'd love to see how the wrong prompt on the console was traced back to
the one thing that put the company out of business. Seriously, if
anyone has any stories along those lines, I'd love to hear it. As
would any maker of automation software, because it would be the most
amazing sales pitch ever.


See above and wait to see if the sysprog it happened to will pipe up.



 BUT that should have been found in
QA before the program goes live. In other words their work is checked
by others.


QA can pick up a lot of things, but, for example, can QA pick up an
application program that performs ten million inserts and no commit
into a DB2 table, then, for whatever reason, abend, and have DB2
rollback all its work, thus rendering the objects unavailable for x
hours? I've seen it done. - Didn't make the company go bankrupt
though.


It depends on the company. If its a matter of opening (or not) for  
daily trading chances are good they will be out of business. If its  
for a small business I might agree but small business's probably  
don't have mainframes either.






 An operator does not have this luxury. Yes programmers can
make mistakes but (in most cases) its not a shut the front doors and
turn off the power whoever is the last one to leave. An operator can
do so with a small "oops". That is why an operator, IMO must go
through several years of training so they CAN'T make stupid mistakes.


I agree that console commands are free from any sort of QA, however,
there are ways and means to ensure that mistakes are minimised.
Automation products can help here, or, if they're not available, an
application program can write out WTO or

Re: VARY too many devices offline

2007-10-23 Thread Ed Gould

On Oct 23, 2007, at 3:17 AM, Zaromil Tisler wrote:


On Tue, 23 Oct 2007 08:54:31 +0100, Kenny Fogarty wrote:


I agree, but, if the wrong date, or IPL parm, or whatever is entered,
then the chances are you're going to have to re-IPL to rectify the
situation. As you said above, if RACF doesn't start, you can go back
to see why, and take steps to fix the issue.


I wish the person this had happened to would pipe up, but to set the  
record more precisely because of a bad date RACF (This is hear say)  
did something to the (RACF) database that essentially rendered the  
system not operatrional. Just by IPLing (again) with the correct date  
was too late as the RACF database unusable. I do not know if they had  
backups or any specifics. I heard they were down for a day or so.  
Luckily this was a weekend.


Ed

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-23 Thread Ed Gould

On Oct 23, 2007, at 9:51 AM, Jon Brock wrote:


I hate to even think about the pain that might have caused.

Jon




Jon,

A person here on the list had to go through the exercise maybe he  
will speak up and give details.


Ed

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-23 Thread Zahir Hemini
Yes, I quite agree with you. At our organization we do brainstorm about what
could potentially go wrong and we admit that humans are error prone for a
whole variety of reasons. So we do try to have contingencies planned as well
as procedures to catch gross errors. My management is very strict that we
should present the benefits and drawbacks of all of our procedures, and
consider what could go wrong. Sometimes this makes us slow to implement new
systems, but they do tend to be quite reliable. But they have to be, because
many people depend upon our services, including the dispatch of police and
ambulances, as well as payrolls for public services.

On 10/23/07, Howard Brazee <[EMAIL PROTECTED]> wrote:
>
> On 22 Oct 2007 14:12:07 -0700, [EMAIL PROTECTED] (Zahir Hemini)
> wrote:
>
> >This is exactly why there are products like CA OPS/MVS and Automan and
> >probably a few others. People sometimes are new to a procedure, they do
> >accidentally make mistakes, and read and write instructions incorrectly.
>
> Which is the response to the message that compared operator errors
> with blood tester errors.
>
> People do make mistakes.   When the results of likely mistakes are too
> expensive, procedures and tools need to be created to minimize the
> impact of those mistakes.
>
> Lots of software design is like the design of sidewalks which have
> lines scored on them to encourage the breaks into a predictable
> direction.   Don't deny that errors happen- handle them.
>
> --
> For IBM-MAIN subscribe / signoff / archive access instructions,
> send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
> Search the archives at http://bama.ua.edu/archives/ibm-main.html
>

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-23 Thread Howard Brazee
On 22 Oct 2007 18:43:18 -0700, [EMAIL PROTECTED] wrote:

> What 
>should happen to a computer operator in the  US Marine Corps whose typo causes 
>an infantry platoon to be destroyed by  friendly fire or a human error 
>resulting in a $1 loss?  One size does  not fit all.  Let the punishment fit 
>the 
>crime.

If that happens, punishment is too late.Management has screwed up
relying on a system that is that vulnerable.

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-23 Thread Howard Brazee
On 22 Oct 2007 14:12:07 -0700, [EMAIL PROTECTED] (Zahir Hemini)
wrote:

>This is exactly why there are products like CA OPS/MVS and Automan and
>probably a few others. People sometimes are new to a procedure, they do
>accidentally make mistakes, and read and write instructions incorrectly. 

Which is the response to the message that compared operator errors
with blood tester errors.

People do make mistakes.   When the results of likely mistakes are too
expensive, procedures and tools need to be created to minimize the
impact of those mistakes.

Lots of software design is like the design of sidewalks which have
lines scored on them to encourage the breaks into a predictable
direction.   Don't deny that errors happen- handle them.

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-23 Thread Jon Brock
I hate to even think about the pain that might have caused.

Jon



or the operator who put a future date at IPL time and really screwed  
up RACF. (story I heard from another company)


--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-23 Thread Chase, John
> -Original Message-
> From: IBM Mainframe Discussion List On Behalf Of Bob Shannon
> 
> > Operators (especially console operators) are extremely 
> competent and 
> > if they screw up, IMO, they need to be fired.
> 
> As a systems programmer I'm glad I wasn't fired every time I 
> screwed something up.

Indeed.  In that kind of environment I'd have become a machinist a long
time ago (probably be a "former machinist" by now; I've certainly made
my share of scrap in the shop working part-time).

-jc-

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-23 Thread McKown, John
> -Original Message-
> From: IBM Mainframe Discussion List 
> [mailto:[EMAIL PROTECTED] On Behalf Of Bob Shannon
> Sent: Monday, October 22, 2007 4:34 PM
> To: IBM-MAIN@BAMA.UA.EDU
> Subject: Re: VARY too many devices offline
> 
> 
> > Operators (especially console operators) are extremely 
> competent and if
> > they screw up, IMO, they need to be fired.
> 
> As a systems programmer I'm glad I wasn't fired every time I 
> screwed something up.
> 
> Bob Shannon

Yea! Like the time under OS/VS1 that I compressed SYS1.LINKLIB using
IEBCOPY. So much for __that__ system! Luckily for me, this shop had two
370/145's running the same version of the OS, so that I could use the
2nd system to recover SYS1.LINKLIB of the 1st one. This was when I was
first in the business around 1979.

Or, more recently, so of the problems that we've been having going from
1.6 to 1.8 (messed up the APF list and CAS9 failed - not a good thing
since our IPL is automated).

--
John McKown
Senior Systems Programmer
HealthMarkets
Keeping the Promise of Affordable Coverage
Administrative Services Group
Information Technology

The information contained in this e-mail message may be privileged
and/or confidential.  It is for intended addressee(s) only.  If you are
not the intended recipient, you are hereby notified that any disclosure,
reproduction, distribution or other use of this communication is
strictly prohibited and could, in certain circumstances, be a criminal
offense.  If you have received this e-mail in error, please notify the
sender by reply and delete this message without copying or disclosing
it.

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-23 Thread Vernooy, C.P. - SPLXM
"Zaromil Tisler" <[EMAIL PROTECTED]> wrote in message
news:<[EMAIL PROTECTED]>...
> On Tue, 23 Oct 2007 08:54:31 +0100, Kenny Fogarty wrote:
> 
> >I agree, but, if the wrong date, or IPL parm, or whatever is entered,
> >then the chances are you're going to have to re-IPL to rectify the
> >situation. As you said above, if RACF doesn't start, you can go back
> >to see why, and take steps to fix the issue.
> 
> I absolutely agree. It is just ridiculous to make operators
responsible for 
> design shortages in an operating system. How comes system programmers 
> haven't seen the danger of losing system(s) and / or data through a
single 
> error (or a typo) in all these cases?
> 
> -- 
> Zaromil

Right, like the beautiful $PQ command in the early 80's, when it still
was accepted without parameters, guess what it purged? After we
discovered it, the command was corrected to require parameters.

Kees.
**
For information, services and offers, please visit our web site:
http://www.klm.com. This e-mail and any attachment may contain
confidential and privileged material intended for the addressee
only. If you are not the addressee, you are notified that no part
of the e-mail or any attachment may be disclosed, copied or
distributed, and that any other action related to this e-mail or
attachment is strictly prohibited, and may be unlawful. If you have
received this e-mail by error, please notify the sender immediately
by return e-mail, and delete this message. 

Koninklijke Luchtvaart Maatschappij NV (KLM), its subsidiaries
and/or its employees shall not be liable for the incorrect or
incomplete transmission of this e-mail or any attachments, nor
responsible for any delay in receipt.
Koninklijke Luchtvaart Maatschappij N.V. (also known as KLM Royal
Dutch Airlines) is registered in Amstelveen, The Netherlands, with
registered number 33014286 
**

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-23 Thread Zaromil Tisler
On Tue, 23 Oct 2007 08:54:31 +0100, Kenny Fogarty wrote:

>I agree, but, if the wrong date, or IPL parm, or whatever is entered,
>then the chances are you're going to have to re-IPL to rectify the
>situation. As you said above, if RACF doesn't start, you can go back
>to see why, and take steps to fix the issue.

I absolutely agree. It is just ridiculous to make operators responsible for 
design shortages in an operating system. How comes system programmers 
haven't seen the danger of losing system(s) and / or data through a single 
error (or a typo) in all these cases?

-- 
Zaromil

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-23 Thread Kenny Fogarty
>
> How would you handle a person that scrutinizes blood for a living and
> mistakes a diagnosis ?
> In some case an operator is just as "guilty" as the blood analyzer.
> If you say thats not the same, I would agree but not in all

I wouldn't even begin to make the analogy. But, mistakes happen.
That's a fact of life, and, putting an unbearable amount of strain on
someone, as in - make a mistake and you're fired, will not, under any
circumstances, help that person to not make mistakes. In fact, I'd go
as far as to say it would only make things worse.

> If an operator put in a wrong date at IPL and (because
> of that) RACF refuses to come up and there is no backout or even
> worse datasets gets scratched because of the operator error which
> leads to a fine from say the SEC (or take you pick of agency).

See all of those issues? All perfectly valid. But, if I were having to
unravel the mess that came about from the wrong input at the console,
the operator would not be the person who should be blamed. There
should be contingency in place so that if RACF refuses to come up, we
get alerted very early on as to why, and have steps in place to remedy
the situation. Perhaps by re-IPL'ing. After all, that's what you're
going to do in 99% of cases if a wrong parameter is passed at IPL
time.
If datasets get scratched, where's the back up? What's the contingency
in place to restore the data. If there isn't one, that's not the guy
who entered 'U' on the console instead of 'N''s fault.

>  There are degrees of error of course some are who cares to a possible
> company going bankrupt there are in the last case MANY people being
> out of work (possibly 1000's or more) would you not fire the person?

If the company went bankrupt, it wouldn't be because someone varied
off the wrong device.

> I think you are comparing apples and oranges. An operator can by mistake put
> the company out of business, a programmer can cause loss revenue and yes
> possibly a fine.

I'd love to see how the wrong prompt on the console was traced back to
the one thing that put the company out of business. Seriously, if
anyone has any stories along those lines, I'd love to hear it. As
would any maker of automation software, because it would be the most
amazing sales pitch ever.

>  BUT that should have been found in
> QA before the program goes live. In other words their work is checked
> by others.

QA can pick up a lot of things, but, for example, can QA pick up an
application program that performs ten million inserts and no commit
into a DB2 table, then, for whatever reason, abend, and have DB2
rollback all its work, thus rendering the objects unavailable for x
hours? I've seen it done. - Didn't make the company go bankrupt
though.

>  An operator does not have this luxury. Yes programmers can
> make mistakes but (in most cases) its not a shut the front doors and
> turn off the power whoever is the last one to leave. An operator can
> do so with a small "oops". That is why an operator, IMO must go
> through several years of training so they CAN'T make stupid mistakes.

I agree that console commands are free from any sort of QA, however,
there are ways and means to ensure that mistakes are minimised.
Automation products can help here, or, if they're not available, an
application program can write out WTO or WTOR messages with meaningful
text, which can also help an operator make a decision.

Training does not, and never will ensure that mistakes are never made.
Training educates, and helps people understand better, but it never,
ever eradicates mistakes from any process.

> Its possible that a programmer could write a program that
> misdiagnoses a test (health) result and yes that could lead to the
> persons death, but presumably there are other fingers in the stew to
> catch the errors.

I agree with that, and, broadly, that's the point I was trying to
make. There should be enough tech support/ops support/sys progs around
to see what went wrong, and implement some sort of contingency to
rectify the mistake with the minimum of outage/cost to the company, be
that restoring data, re-IPLing a system, or whatever.

> In the case of an operator there is no way to catch all errors that could 
> cause a major issue.

There are ways to catch all operator entries from the console via
various automation products which can interrogate what has been
entered, and take appropriate measures.

> Catching a Vary is a small part of any possible error. Catching a bad date at 
> lets
> say early on in the IPL process  is impossible by any of the suggestions
> mentioned as the exits (programs) are not available then.

I agree, but, if the wrong date, or IPL parm, or whatever is entered,
then the chances are you're going to have to re-IPL to rectify the
situation. As you said above, if RACF doesn't start, you can go back
to see why, and take steps to fix the issue.

There must always be contingency plans in place to catch human errors,
but, to go back to the original

Re: VARY too many devices offline

2007-10-22 Thread Ed Gould

On Oct 22, 2007, at 11:28 PM, Mark Post wrote:


On Mon, Oct 22, 2007 at 11:31 PM, in message

<[EMAIL PROTECTED]>, Ed Gould
<[EMAIL PROTECTED]> wrote:
-snip-

I don't believe that its a matter of "first error" but what the error
was.


This was your argument for most of this thread.  Good to see you  
abandon it.


Even so, I'm glad I never worked in the same group or even company  
as you.  I would have left as soon as possible once I understood  
there was no room for human error.




Mark,

As I stated before errors are OK just as long as they don't hurt. Now  
the question comes in degrees of HURT.
BTW I agreed in someways about errors and any decision to fire   
person to a degree. It was essentially (to me) how bad was the  
command and if it did any damage. I know of one time an operator  
typed in quiesce and it did exactly what it was supposed to do put  
the system in a wait state.  Did it cause damage no  (unless you  
count response time) that to me was a tossup but it did impact  
several hundred data entry types and 100 or so programmers. Was he  
fired? NO but he was talked to rather sharply. Did he screw up again,  
nope. Lesson learned.


Ed

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-22 Thread Mark Post
>>> On Mon, Oct 22, 2007 at 11:31 PM, in message
<[EMAIL PROTECTED]>, Ed Gould
<[EMAIL PROTECTED]> wrote: 
-snip-
> I don't believe that its a matter of "first error" but what the error  
> was.

This was your argument for most of this thread.  Good to see you abandon it.

Even so, I'm glad I never worked in the same group or even company as you.  I 
would have left as soon as possible once I understood there was no room for 
human error.


Mark Post

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-22 Thread Ed Gould

On Oct 22, 2007, at 8:00 PM, Mark Post wrote:


On Mon, Oct 22, 2007 at  8:17 PM, in message

<[EMAIL PROTECTED]>, Ed Gould
<[EMAIL PROTECTED]> wrote:
-snip-

How would you handle a person that scrutinizes blood for a living and
mistakes a diagnosis ?


I'm sure it happens multiple times a day, all over the country.  I  
doubt very much they get fired for the first error.


That is when medical suits come into play, a lot of stuff is covered  
up by them. When is the last time you have ever heard of operator  
insurance?





-snip-

There
are degrees of error of course some are who cares to a possible
company going bankrupt there are in the last case MANY people being
out of work (possibly 1000's or more) would you not fire the person?


If it were their first error, probably not.
The big bosses don't know/care who they just want someone to take the  
fall. I believe the answer to "not their first error" would be fire  
the guy. Its CYA time.





-snip-

That is why an operator, IMO must go
through several years of training so they CAN'T make stupid mistakes.


Ah, you're one of those people that believe humans can be trained  
into not making mistakes.  Dr. Deming would be shaking his head in  
sadness over that, if he were still alive.  It is, at best, a very  
misguided notion.


Not sure Dr Deming is (or care)  misguided I will leave to the  
reader.  It all comes down to the seriousness of the error. I have  
done things by accident (no one was impacted other than myself) I  
have done other things that have impacted people. I admitted the  
mistake to management. Was it life or death, no, was it important  
sort of but no real damage was done.  If it had been I would have  
expected to get fired. So should an operator.


-snip-

Being an operator is a lot like being an anethesiologist the skill to
know how much sleeping  gas to administer so the patient does not
awake or die. The surgeon is one part (major) of the operation an
operator is a helper. That is what an operator is a helper.


I guess you've never heard of death by "medical misadventure."   
That also happens nearly every day.  Not too many of them get fired  
either, for their first mistake.
Hmmm... there was a case (in Chicago) earlier this year about a drunk  
doctor during an operation. He was fired. The person died.  There was  
also a case of a doctor not treating a man because he was Jewish. BIG  
time lawsuit, As I recall the guy didn't die (but could be wrong) the  
doctor was fired.


I don't believe that its a matter of "first error" but what the error  
was. The computer operator does not make life and death decisions he/ 
she run the system but as a consequence of entering incorrect command  
or response can cause major damage. Do you really want any joe blow  
guy off the street running a console and no consequences for not  
doing a bad job? That is why operators, IMO must be trained. Giving  
anyone improper training is a whole side issue and can be discussed  
forever. I would like to take the position that operators are highly  
trained individuals and they should be given pay commensurate with  
duties and *strictly* held responsible for their watch.


Ed

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-22 Thread (IBM Mainframe Discussion List)
 
 
In a message dated 10/22/2007 5:09:38 P.M. Central Daylight Time,  
[EMAIL PROTECTED] writes:
>Operators (especially console operators) are extremely competent and if  
they screw up, IMO, they need to be fired.
>That's just  ridiculous.

 
I have seen operators who knew much more about what certain messages meant  
than I, the resident guru, did.  I have seen midnight shift operators  retire 
to the underground parking structure to smoke marijuana.  I worked  at a DASD 
vendor shop with NO operators, 110 LPARs, extremely bright and  competent guru 
developers who had unlimited hands-on access to consoles,  and anyone who 
deleted a non-deletable I/O error message before the  microcoders could look at 
it 
would be fired.  Different environments have  different requirements.  What 
should happen to a computer operator in the  US Marine Corps whose typo causes 
an infantry platoon to be destroyed by  friendly fire or a human error 
resulting in a $1 loss?  One size does  not fit all.  Let the punishment fit 
the 
crime.

 
Bill  Fairchild
Plainfield, IL



** See what's new at http://www.aol.com

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-22 Thread Mark Post
>>> On Mon, Oct 22, 2007 at  8:17 PM, in message
<[EMAIL PROTECTED]>, Ed Gould
<[EMAIL PROTECTED]> wrote: 
-snip-
> How would you handle a person that scrutinizes blood for a living and  
> mistakes a diagnosis ?

I'm sure it happens multiple times a day, all over the country.  I doubt very 
much they get fired for the first error.

-snip-
> There  
> are degrees of error of course some are who cares to a possible  
> company going bankrupt there are in the last case MANY people being  
> out of work (possibly 1000's or more) would you not fire the person?  

If it were their first error, probably not.

-snip-
> That is why an operator, IMO must go  
> through several years of training so they CAN'T make stupid mistakes.  

Ah, you're one of those people that believe humans can be trained into not 
making mistakes.  Dr. Deming would be shaking his head in sadness over that, if 
he were still alive.  It is, at best, a very misguided notion.

-snip-
> Being an operator is a lot like being an anethesiologist the skill to  
> know how much sleeping  gas to administer so the patient does not  
> awake or die. The surgeon is one part (major) of the operation an  
> operator is a helper. That is what an operator is a helper.

I guess you've never heard of death by "medical misadventure."  That also 
happens nearly every day.  Not too many of them get fired either, for their 
first mistake.


Mark Post

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-22 Thread Ed Gould

On Oct 22, 2007, at 5:41 PM, Kenny Fogarty wrote:

It’s the concept of "one strike, you're out!" that I find amazing.  
What
constitutes a mistake serious enough to put someone's livelihood at  
stake?
With that kind of pressure hanging over anyone, how can they be  
expected to

learn, and grow and enhance the company they're working for?


Kevin:
How would you handle a person that scrutinizes blood for a living and  
mistakes a diagnosis ?
In some case an operator is just as "guilty" as the blood analyzer.  
If you say thats not the same, I would agree but not in all  
circumstances. If an operator put in a wrong date at IPL and (because  
of that) RACF refuses to come up and there is no backout or even  
worse datasets gets scratched because of the operator error which  
leads to a fine from say the SEC (or take you pick of agency). There  
are degrees of error of course some are who cares to a possible  
company going bankrupt there are in the last case MANY people being  
out of work (possibly 1000's or more) would you not fire the person?  
The gravity of the error is all important, of course.




I think we've all seen enough horror stories over the years, where a
command, or parameter has been entered in error, but from those horror
stories, procedures got tightened, people were educated, things got  
better.
People learn from mistakes, and go on to pass on their findings to  
their
colleagues, to their peers and people act on that, and learn and  
improve.


If you hang a sword of Damocles like that over everyone within an
organisation, nothing would get done.

Within a programming environment, has anyone ever written a 100%  
bug-free
piece of code, that has lasted for all eternity, never once needing  
to be
optimized or re-compiled or re-linked? Are bugs that are exposed  
over time
by new releases of the operating system, or various subsystems  
classed as
being serious enough to have your job and reputation absolutely  
caned? I

just think it’s a nonsense concept and a nonsense approach.


I think you are comparing apples and oranges. An operator can by  
mistake put the company out of business, a programmer can cause loss  
revenue and yes possibly a fine . BUT that should have been found in  
QA before the program goes live. In other words their work is checked  
by others. An operator does not have this luxury. Yes programmers can  
make mistakes but (in most cases) its not a shut the front doors and  
turn off the power whoever is the last one to leave. An operator can  
do so with a small "oops". That is why an operator, IMO must go  
through several years of training so they CAN'T make stupid mistakes.  
Its possible that a programmer could write a program that  
misdiagnoses a test (health) result and yes that could lead to the  
persons death, but presumably there are other fingers in the stew to  
catch the errors. In the case of an operator there is no way to catch  
all errors that could cause a major issue. Catching a Vary is a small  
part of any possible error. Catching a bad date at lets say early on  
in the IPL process  is impossible by any of the suggestions mentioned  
as the exits (programs) are not available then.


I would suggest an incorrect VARY command might cause some amount of  
grief but (probably) not enough get fired over. I know some places  
have intercepts to catch an incorrect vary command. That is almost as  
bad (IMO) as saying your fired as you don't trust your operators so I  
am going to attempt to intercept commands and double check them.  
There is no guarantee that the devices were not the right devices  
even with an exit.


Being an operator is a lot like being an anethesiologist the skill to  
know how much sleeping  gas to administer so the patient does not  
awake or die. The surgeon is one part (major) of the operation an  
operator is a helper. That is what an operator is a helper.


Ed

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-22 Thread Kenny Fogarty
It’s the concept of "one strike, you're out!" that I find amazing. What
constitutes a mistake serious enough to put someone's livelihood at stake? 
With that kind of pressure hanging over anyone, how can they be expected to
learn, and grow and enhance the company they're working for?

I think we've all seen enough horror stories over the years, where a
command, or parameter has been entered in error, but from those horror
stories, procedures got tightened, people were educated, things got better.
People learn from mistakes, and go on to pass on their findings to their
colleagues, to their peers and people act on that, and learn and improve.

If you hang a sword of Damocles like that over everyone within an
organisation, nothing would get done.

Within a programming environment, has anyone ever written a 100% bug-free
piece of code, that has lasted for all eternity, never once needing to be
optimized or re-compiled or re-linked? Are bugs that are exposed over time
by new releases of the operating system, or various subsystems classed as
being serious enough to have your job and reputation absolutely caned? I
just think it’s a nonsense concept and a nonsense approach.

"Oh, that's a typo right there, on the console. It's been nice working with
you, take some typing lessons next time, and leave your card on the table.
Bye!"

 

No virus found in this outgoing message.
Checked by AVG Free Edition. 
Version: 7.5.488 / Virus Database: 269.15.5/1085 - Release Date: 22/10/2007
10:35

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-22 Thread Michael J Flores
Sometimes that's called experience and the more of it one has obtained. 
The more opportunity one had for making mistakes. Normally you don't make 
the same mistake more than twice, especially when you have caused major 
damage. But that's the reason a company prefers someone with experience, 
so they will not have to pay for mistakes that an unexperienced person 
would make somewhere down the line. Although many companies are leaning 
towards trading experience for saving money.

If fired for making a mistake. 

Most  likely would result in employees concern with taking  educated 
chances and stump their growing potential. Which is a lose - lose 
situation for the employee and company. The only constant is change and 
change constitutes mistakes. 
I for one, would rather have changes and grow and make some mistakes, then 
to stay stagnant and safe.

Regards,
Michael J Flores
Triumph Performance and Technical Architecture AMEX 
 



Bob Shannon <[EMAIL PROTECTED]> 
Sent by: IBM Mainframe Discussion List 
10/22/2007 02:33 PM
Please respond to
IBM Mainframe Discussion List 


To
IBM-MAIN@BAMA.UA.EDU
cc

Subject
Re: VARY too many devices offline






> Operators (especially console operators) are extremely competent and if
> they screw up, IMO, they need to be fired.

As a systems programmer I'm glad I wasn't fired every time I screwed 
something up.

Bob Shannon
Rocket Software

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


American Express made the following annotations on Mon Oct 22 2007 15:30:49
--
"This message and any attachments are solely for the intended recipient and may 
contain confidential or privileged information. If you are not the intended 
recipient, any disclosure, copying, use, or distribution of the information 
included in this message and any attachments is prohibited. If you have 
received this communication in error, please notify us by reply e-mail and 
immediately and permanently delete this message and any attachments. Thank you."

American Express a ajouté le commentaire suivant le Mon Oct 22 2007 15:30:49

Ce courrier et toute pièce jointe qu'il contient sont réservés au seul 
destinataire indiqué et peuvent renfermer des renseignements confidentiels et 
privilégiés. Si vous n'êtes pas le destinataire prévu, toute divulgation, 
duplication, utilisation ou distribution du courrier ou de toute pièce jointe 
est interdite. Si vous avez reçu cette communication par erreur, veuillez nous 
en aviser par courrier et détruire immédiatement le courrier et les pièces 
jointes. Merci. 
**
---

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-22 Thread Pommier, Rex R.
 Kenny,

I can't tell if you're agreeing with the comment that operators who
screw up should be fired or if you're saying that idea/policy is
ridiculous.

All I can say is that if any/all of us who lurk on this listserv got the
axe every time we screwed something up we'd be job hopping quite often.
Something about an old saying "let him who is without sin cast the first
stone"

Rex

-Original Message-
From: IBM Mainframe Discussion List [mailto:[EMAIL PROTECTED] On
Behalf Of Kenny Fogarty
Sent: Monday, October 22, 2007 5:09 PM
To: IBM-MAIN@BAMA.UA.EDU
Subject: Re: VARY too many devices offline

Operators (especially console operators) are extremely competent and if
they screw up, IMO, they need to be fired.

That's just ridiculous.
 

No virus found in this outgoing message.
Checked by AVG Free Edition. 
Version: 7.5.488 / Virus Database: 269.15.5/1085 - Release Date:
22/10/2007
10:35

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-22 Thread Kenny Fogarty
Operators (especially console operators) are extremely competent and  
if they screw up, IMO, they need to be fired.

That's just ridiculous.
 

No virus found in this outgoing message.
Checked by AVG Free Edition. 
Version: 7.5.488 / Virus Database: 269.15.5/1085 - Release Date: 22/10/2007
10:35

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-22 Thread Bob Shannon
> Operators (especially console operators) are extremely competent and if
> they screw up, IMO, they need to be fired.

As a systems programmer I'm glad I wasn't fired every time I screwed something 
up.

Bob Shannon
Rocket Software

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-22 Thread Ed Gould

On Oct 22, 2007, at 3:58 PM, Howard Brazee wrote:


On 22 Oct 2007 12:56:30 -0700, [EMAIL PROTECTED] (Jon Brock) wrote:


Color me disbelieving.

I think in the 30+ years I have been around OS360 and MVS and z/os,
there has never been an operator mistake of a typo.


I liked the time where the Vax operator put in a date a century in the
future.   DIR SINCE TOMORROW found the file I created before I called
him.
or the operator who put a future date at IPL time and really screwed  
up RACF. (story I heard from another company)


Ed

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-22 Thread Zahir Hemini
This is exactly why there are products like CA OPS/MVS and Automan and
probably a few others. People sometimes are new to a procedure, they do
accidentally make mistakes, and read and write instructions incorrectly. It
is up to the systems management to make sure that proper process is in place
to catch and disallow mistakes that will bring the systems down. It is
perfectly legitimate to specify procedures that require certain devices to
always be online, or online at specific times and dates. It is reasonable
that these are documented and embedded into  software procedures to ensure
that they are adhered to. The operator should not be fired, but given
education. The systems management that allowed this to happen are a better
candidate for dismissal.

On 10/22/07, Howard Brazee <[EMAIL PROTECTED]> wrote:
>
> On 22 Oct 2007 12:56:30 -0700, [EMAIL PROTECTED] (Jon Brock) wrote:
>
> >Color me disbelieving.
> >
> >I think in the 30+ years I have been around OS360 and MVS and z/os,
> >there has never been an operator mistake of a typo.
>
> I liked the time where the Vax operator put in a date a century in the
> future.   DIR SINCE TOMORROW found the file I created before I called
> him.
>
> --
> For IBM-MAIN subscribe / signoff / archive access instructions,
> send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
> Search the archives at http://bama.ua.edu/archives/ibm-main.html
>

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-22 Thread Howard Brazee
On 22 Oct 2007 12:56:30 -0700, [EMAIL PROTECTED] (Jon Brock) wrote:

>Color me disbelieving.  
> 
>I think in the 30+ years I have been around OS360 and MVS and z/os,  
>there has never been an operator mistake of a typo.

I liked the time where the Vax operator put in a date a century in the
future.   DIR SINCE TOMORROW found the file I created before I called
him.

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-22 Thread Ed Gould

On Oct 22, 2007, at 2:38 PM, Jon Brock wrote:


Color me disbelieving.

I think in the 30+ years I have been around OS360 and MVS and z/os,
there has never been an operator mistake of a typo.




Not buying this, either:

I have seen operators make errors but the OS has caught all of
them and no harm was done.



Jan's take is correct: outlawing mistakes does not mean they won't
happen.  (They're "mistakes," see.)  It also doesn't do much for
protecting your system integrity; nor does it signify any sort of
effective management.
Demanding 100% correctness from your people on pain of termination is
not only futile and irrational, it can be counter-productive.

I'm not saying that errors should be taken lightly, but expecting that
you'll never see someone type "V 110E-1107,OFFLINE" instead of "V
1103-1107,OFFLINE" is not realistic.  And simply advocating a
sacrificial firing or two doesn't really help.  It's awfully easy  
to be

free with someone else's livelihood.



Jon,

I find operators (in general) are an honest and a good lot. If they  
do make mistakes they generally step up to the plate and take  
responsibility.
The places I have worked not just anyone can sit at a console, you  
must earn it. That means going from a print jockey to a tape jockey  
to a master console jockey. Along the lines if they make mistakes the  
average to poor ones never get promoted. So a master console type is  
extremely competent at his/her job. I have yet to see someone at a  
console that was not competent to do their job. Competent = no mistakes.
I tend to judge people (operators) as to their position on the food  
chain. I have seen shift managers that do not fit into the competency  
issue, so I treat them *generally* less than console types.


The above goes for non military type situations or union situations,  
that is a whole different ball of wax.


Operators (especially console operators) are extremely competent and  
if they screw up, IMO, they need to be fired.


Ed

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-22 Thread Jon Brock
Color me disbelieving.  
 
I think in the 30+ years I have been around OS360 and MVS and z/os,  
there has never been an operator mistake of a typo.




Not buying this, either:

I have seen operators make errors but the OS has caught all of  
them and no harm was done.



Jan's take is correct: outlawing mistakes does not mean they won't
happen.  (They're "mistakes," see.)  It also doesn't do much for
protecting your system integrity; nor does it signify any sort of
effective management.  
Demanding 100% correctness from your people on pain of termination is
not only futile and irrational, it can be counter-productive.  

I'm not saying that errors should be taken lightly, but expecting that
you'll never see someone type "V 110E-1107,OFFLINE" instead of "V
1103-1107,OFFLINE" is not realistic.  And simply advocating a
sacrificial firing or two doesn't really help.  It's awfully easy to be
free with someone else's livelihood.

Jon

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-22 Thread Ed Gould

On Oct 22, 2007, at 7:05 AM, Jan MOEYERSONS wrote:



Not being to tongue in cheek here.. but we used to fire the operator.

Ed



And what good does that do to the integrity of your systems???

Does that prevent anyone else from making a mistake?

Did you never make a typo in any of the commands you ever entered?   
(If you

never did, then that means you never entered any...)

Jantje


There is a difference between typing an email and operator console.  
If you cannot tell the difference then you shouldn't be an operator.  
Operators (or sysprog or whomever) that have access to the systems  
console MUST be aware of the fact and type accordingly.


I think in the 30+ years I have been around OS360 and MVS and z/os,  
there has never been an operator mistake of a typo. If the operator  
was in doubt he opened the IBM Book. If that didn't answer his (her)  
question a call to his supervisor or friendly sysprog. There have  
been times when commands weren't as well documented (or weren't at  
all) that calls were made and we typed a instruction sheet for the  
operator(s). Sorry, mistakes on the console (in some cases) could be  
life threatening and therefore operators did NOT make any errors.


Ed

PS: I have seen operators make errors but the OS has caught all of  
them and no harm was done.


pps: We had a case where the operator screwed up a IPL in the specify  
system parameters reply and we had to IPL again. The operator had a  
book (literally) thrown at him. The VP apologized but told him if he  
made another mistake he was out of there. He didn't.


--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-22 Thread Jan MOEYERSONS
>
>Not being to tongue in cheek here.. but we used to fire the operator.
>
>Ed
>

And what good does that do to the integrity of your systems???

Does that prevent anyone else from making a mistake? 

Did you never make a typo in any of the commands you ever entered?  (If you
never did, then that means you never entered any...)

Jantje.

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-19 Thread Ed Gould

On Oct 19, 2007, at 9:13 AM, Field, Alan C. wrote:


Yesterday someone issued an RO *ALL,V  708-7014,OFFLINE which led to a
number of unplanned IPLs.



Now mgmt wants to implement a fingerchecker.



I searched CBT for a command exit that might ask something like "you
really want to vary 26,000 devices offline?"



How do others shops handle this, have you got an exit you'd be willing
to share before we re-invent the wheel.



Thanks,



Alan




Not being to tongue in cheek here.. but we used to fire the operator.

Ed

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-19 Thread Chase, John
> -Original Message-
> From: IBM Mainframe Discussion List On Behalf Of Field, Alan C.
> 
> Yesterday someone issued an RO *ALL,V  708-7014,OFFLINE which 
> led to a number of unplanned IPLs. 
> 
> Now mgmt wants to implement a fingerchecker. 
> 
> I searched CBT for a command exit that might ask something 
> like "you really want to vary 26,000 devices offline?" 
> 
> How do others shops handle this, have you got an exit you'd 
> be willing to share before we re-invent the wheel.

We use a "home-grown" exit that examines DISPLAY and VARY commands,
requiring device addresses to be entered as 4-digit numbers and limiting
ranges to 32 devices, and suppresses the command along with a console
message if the operator "muffs" it.  I'll seek permission to send you a
copy off-list, if you wish.

-jc-

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


Re: VARY too many devices offline

2007-10-19 Thread McKown, John
> -Original Message-
> From: IBM Mainframe Discussion List 
> [mailto:[EMAIL PROTECTED] On Behalf Of Field, Alan C.
> Sent: Friday, October 19, 2007 9:14 AM
> To: IBM-MAIN@BAMA.UA.EDU
> Subject: VARY too many devices offline
> 
> Yesterday someone issued an RO *ALL,V  708-7014,OFFLINE which led to a
> number of unplanned IPLs. 
> 
> Now mgmt wants to implement a fingerchecker. 
> 
> I searched CBT for a command exit that might ask something like "you
> really want to vary 26,000 devices offline?" 
> 
> How do others shops handle this, have you got an exit you'd be willing
> to share before we re-invent the wheel.
> 
> Thanks,
> 
> Alan 

We use CA-OPS/MVS to validate the range on all VARY commands. Do you
have an automated operations package which can intercept operator
commands?

--
John McKown
Senior Systems Programmer
HealthMarkets
Keeping the Promise of Affordable Coverage
Administrative Services Group
Information Technology

The information contained in this e-mail message may be privileged
and/or confidential.  It is for intended addressee(s) only.  If you are
not the intended recipient, you are hereby notified that any disclosure,
reproduction, distribution or other use of this communication is
strictly prohibited and could, in certain circumstances, be a criminal
offense.  If you have received this e-mail in error, please notify the
sender by reply and delete this message without copying or disclosing
it.

--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html


VARY too many devices offline

2007-10-19 Thread Field, Alan C.
Yesterday someone issued an RO *ALL,V  708-7014,OFFLINE which led to a
number of unplanned IPLs. 

 

Now mgmt wants to implement a fingerchecker. 

 

I searched CBT for a command exit that might ask something like "you
really want to vary 26,000 devices offline?" 

 

How do others shops handle this, have you got an exit you'd be willing
to share before we re-invent the wheel.

 

Thanks,

 

Alan 


--
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html