Here's the update of our events last night...sorry I didn't respond back
last night, a bit busy :-)

EMC came back and stated that there were bad sectors in the luns...9
volumes were 'toast' (900 gig overall in volume group)

EMC rebuilt  the luns effected
Brought TSM back up,  9 volumes marked off-line
jfs logs rebuilt
disabled all sessions, except for a couple of effected 4 servers that had
to have data restored (TSM Server down for some time)
We ended up with  the decision to delete volumes involved, then create new
volumes
finally started backups

AIX Engineers and EMC evaluating the cause of issue 2 drives down.

Thank You to all who responded!



Nancy Backhaus
Enterprise Systems
[EMAIL PROTECTED]
Office: (716) 887-7979
Cell: (716)  609-2138




TSM_User <[EMAIL PROTECTED]>
Sent by: "ADSM: Dist Stor Manager" <ADSM-L@VM.MARIST.EDU>
05/04/2005 06:54 PM
Please respond to "ADSM: Dist Stor Manager"


        To:     ADSM-L@VM.MARIST.EDU
        cc:
        Subject:        Re: CX700 ATA failure - Audit Storage Pool Volumes


We lost 2 drives in the same array on a CX600 and they wouldn't rebuild.
Our problem did not cause a loss of data.  EMC had to bring down the whole
thing and when it came back up the rebuild kicked off and everything was
OK.  We were told that if we had a more recent version of microcode then
the rebuild would have worked from the start.

Bottom line wait until EMC is done before you do anything because you may
not have lost any data.
Steve Schaub <[EMAIL PROTECTED]> wrote:
First, make sure the drives were really bad - we had this happen to us on
a
cx500 and it turned out that the microcode was actually bad, and the 2nd
raid5 disk failure was phony. Have your ce double check this - we lost
tons
of data from this little glitch biting us several days in a row before
they
figured it out (when it did come back up the volumes were toast).

-----Original Message-----
From: ADSM: Dist Stor Manager [mailto:[EMAIL PROTECTED] On Behalf Of
Nancy L Backhaus
Sent: Wednesday, May 04, 2005 1:09 PM
To: ADSM-L@VM.MARIST.EDU
Subject: [ADSM-L] CX700 ATA failure - Audit Storage Pool Volumes

TSM Server 5.2.3.5
AIX Operating System 5.2

We have lost two drives on a single ATA RAID array. EMC CE is on site and
working with the EMC SAC to resolve the problem. We brought TSM Server
down while EMC is working on the issue. I started receiving the
following errors before we stop all processes, backups, migrations marking
the effected diskpool volumes read-only.

05/04/05 09:06:36 ANR1411W Access mode for volume
/tsmpool39/diskpool22a now
set to "read-only" due to write error. (SESSION:
125991)

My question is >

When we bring the server back up. I want to ensure that I am doing the
right thing, or if I am missing anything.

1. I will need to make sure the volumes affected are online and in a
read/write status.
2. Audit diskpools volumes, Inspect only first 3. If there are damaged
volumes,rerun audit volume, with Fix=Yes.
4. Suggestions?


Nancy Backhaus
Enterprise Systems
[EMAIL PROTECTED]
Office: (716) 887-7979
Cell: (716) 609-2138

CONFIDENTIALITY NOTICE: This email message and any attachments are for the
sole use of the intended recipient(s) and may contain proprietary,
confidential, trade secret or privileged information. Any unauthorized
review, use, disclosure or distribution is prohibited and may be a
violation
of law. If you are not the intended recipient or a person responsible for
delivering this message to an intended recipient, please contact the
sender
by reply email and destroy all copies of the original message.


Please see the following link for the BlueCross BlueShield of Tennessee
E-mail disclaimer: http://www.bcbst.com/email_disclaimer.shtm

__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around
http://mail.yahoo.com

Reply via email to