On 01.12.2003 20:22, Coats, Jack wrote:
Just a thought, take TSM server down.
Did so yesterday...
Use an OS native program (Microsoft Backup) using a hand fed scratch tape or two to backup the local system and restore a few files until you believe the drive will hold its own.
...and failed to get it to work with ntbackup - several problems from "too many tapes in library" to refusal to write to the tape in the drive. grrr.
I see your point (and thanks for the input), but does anyone have another approach?
It would be very much appreciated...
TIA, greets, tobi...
Then start with TSM. If the same tapes are read and written OK in other drives, then you still have a drive problem, IMHO. (Or cable, or SCSI controller). If all the errors are on the same drive, you may still have a problem.
LOL ... JC
-----Original Message----- From: Tobias Hofmann [SMTP:[EMAIL PROTECTED] Sent: Monday, December 01, 2003 11:19 AM To: [EMAIL PROTECTED] Subject: ANR8302E: How do I test new drive?
Ladies, gentlemen,
a new, replaced LTO drive gives me headaches - in the following situation:
OS: MS Win2k Advanced Server, 5.00.2195, SP2 TSM: Storage Management Server for Windows - Version 5, Release 1, Level 0.0
After having worked with no probs afaik, last week one of my two HP ultrium LTO drives died and was replaced by dell (guarantee/bronce contract). Following this, i wittnessed for two days the cumbersome process of firmware-updating the library (PV136T, equals a ADIC Scalar 100, or so I am told), after which the system seemed ok. I deleted drive and path definitions and hoped for the best. This is what I have seen two days later:
11/27/03 15:19:30 ANR0984I Process 22 for MIGRATION started in the BACKGROUND at 15:19:30. 11/27/03 15:19:30 ANR1000I Migration process 22 started for storage pool DISKPOOL. 11/27/03 15:20:17 ANR8337I LTO volume 000002L1 mounted in drive MT2.0.0.3 (mt2.0.0.3). 11/27/03 15:46:35 ANR8337I LTO volume 000005L1 mounted in drive MT1.0.0.3 (mt1.0.0.3). 11/27/03 15:46:38 ANR1340I Scratch volume 000005L1 is now defined in storage pool LTOPOOL1. 11/27/03 15:49:53 ANR8302E I/O error on drive MT1.0.0.3 (mt1.0.0.3) (OP=WRITE, Error Number=1117, CC=306, KEY=03, ASC=0C, ASCQ=00, SENSE=71.00.03.00.00.00.00.0E.00.00.00.00.0C.00 .00.00.75.0B.00.00.00.00., Description=Drive or media failure). Refer to Appendix D in the 'Messages' manual for recommended action. 11/27/03 15:49:53 ANR8359E Media fault detected on LTO volume 000005L1 in drive MT1.0.0.3 (mt1.0.0.3) of library LB0.0.0.3. 11/27/03 15:49:53 ANR1411W Access mode for volume 000005L1 now set to "read-only" due to write error. 11/27/03 15:49:53 ANR0523W Transaction failed for session 1837 for node CMS1 (WinNT) - error on output storage device. 11/27/03 15:49:59 ANR1341I Scratch volume 000005L1 has been deleted from storage pool LTOPOOL1. 11/27/03 15:50:49 ANR8468I LTO volume 000005L1 dismounted from drive MT1.0.0.3 (mt1.0.0.3) in library LB0.0.0.3. 11/27/03 15:51:41 ANR8337I LTO volume 000009L1 mounted in drive MT1.0.0.3 (mt1.0.0.3). 11/27/03 15:51:45 ANR1340I Scratch volume 000009L1 is now defined in storage pool LTOPOOL1. 11/27/03 15:53:40 ANR8302E I/O error on drive MT1.0.0.3 (mt1.0.0.3) (OP=WRITE, Error Number=1117, CC=306, KEY=03, ASC=0C, ASCQ=00, SENSE=71.00.03.00.00.00.00.0E.00.00.00.00.0C.00 .00.00.75.0B.00.00.00.00., Description=Drive or media failure). Refer to Appendix D in the 'Messages' manual for recommended action. 11/27/03 15:53:40 ANR8359E Media fault detected on LTO volume 000009L1 in drive MT1.0.0.3 (mt1.0.0.3) of library LB0.0.0.3. 11/27/03 15:53:40 ANR1411W Access mode for volume 000009L1 now set to "read-only" due to write error. 11/27/03 15:53:40 ANR0523W Transaction failed for session 1837 for node CMS1 (WinNT) - error on output storage device. 11/27/03 15:53:46 ANR1341I Scratch volume 000009L1 has been deleted from storage pool LTOPOOL1. 11/27/03 15:54:16 ANR8468I LTO volume 000009L1 dismounted from drive MT1.0.0.3 (mt1.0.0.3) in library LB0.0.0.3. 11/27/03 15:55:06 ANR8337I LTO volume 000010L1 mounted in drive MT1.0.0.3 (mt1.0.0.3). 11/27/03 15:55:11 ANR1340I Scratch volume 000010L1 is now defined in storage pool LTOPOOL1. 11/27/03 15:56:47 ANR8302E I/O error on drive MT1.0.0.3 (mt1.0.0.3) (OP=WEOF, Error Number=1117, CC=306, KEY=03, ASC=0C, ASCQ=00, SENSE=70.00.03.00.00.00.00.0E.00.00.00.00.0C.00 .00.00.75.0B.00.00.00.00., Description=Drive or media failure). Refer to Appendix D in the 'Messages' manual for recommended action. 11/27/03 15:56:47 ANR8359E Media fault detected on LTO volume 000010L1 in drive MT1.0.0.3 (mt1.0.0.3) of library LB0.0.0.3. 11/27/03 15:56:47 ANR1401W Mount request denied for volume 000010L1 - mount failed. 11/27/03 15:57:18 ANR8468I LTO volume 000010L1 dismounted from drive MT1.0.0.3 (mt1.0.0.3) in library LB0.0.0.3. 11/27/03 16:00:04 ANR8302E I/O error on drive MT1.0.0.3 (mt1.0.0.3) (OP=READ, Error Number=1117, CC=306, KEY=03, ASC=14, ASCQ=00, SENSE=F0.00.03.00.00.00.50.0E.00.00.00.00.14.00 .00.00.50.8F.00.00.00.00., Description=Drive or media failure). Refer to Appendix D in the 'Messages' manual for recommended action. 11/27/03 16:00:04 ANR8355E I/O error reading label for volume 000010L1 in drive MT1.0.0.3 (mt1.0.0.3). 11/27/03 16:00:11 ANR0482W Session 1836 for node CMS1 (WinNT) terminated - idle for more than 15 minutes. 11/27/03 16:00:35 ANR8381E LTO volume 000010L1 could not be mounted in drive MT1.0.0.3 (mt1.0.0.3). 11/27/03 16:00:35 ANR1402W Mount request denied for volume 000010L1 - volume unavailable. 11/27/03 16:00:35 ANR1410W Access mode for volume 000010L1 now set to "unavailable". ...
This is taking up my already few scratch tapes too fast...
(By the way: What is the correct way to turn these tapes into scratch tapes again? move data into the storagepool, and then manually turn them to scratch-status again? or do i miss something there?)
Tests done with the Dell-provided software tools have not given any hint on problems occuring in the library. I have now been advised to swap the two drives (one becomes two and vice versa) and see if the problem persists - but my problem now is that I would not know how to do reasonable testing.
I don,t know how to trigger migration process manually (searched the admin ref pdf to no avail), but even if I did, there is only so many migrations I can do, and then I am stumped.
I don,t know how to advise TSM to move data using a defined drive - say, from 1 to 2, and not from 2 to one - how can this be done?
I am quite limited in space and don,t have copy pools, so I am a bit hesitant to move data with a possibly faulty drive and breaking something - is there a clever approach doing a copy onto a single tape volume? Again, I checked the guide but did not see anything wrt this...
Any input would be very much appreciated,
greets, tobi
-- ---------------------------------------------------------------------- Dipl.-Ing. Tobias Hofmann Bauhaus-Universitaet Weimar D99423 Weimar Professur fuer Graphische Datenverarbeitung Projekt medienquadrat SnailMail: Bauhaus-Universitaet Weimar, Fak. Medien, D99421 Weimar Location: D99423 Weimar Karl-Haussknechtstr. 7 Zimmer 111 Fon: ++49-(0)3643-58-3780 Fax : -3701 e-mail: mailto:[EMAIL PROTECTED] ----------------------------------------------------------------------
-- ---------------------------------------------------------------------- Dipl.-Ing. Tobias Hofmann Bauhaus-Universitaet Weimar D99423 Weimar Professur fuer Graphische Datenverarbeitung Projekt medienquadrat SnailMail: Bauhaus-Universitaet Weimar, Fak. Medien, D99421 Weimar Location: D99423 Weimar Karl-Haussknechtstr. 7 Zimmer 111 Fon: ++49-(0)3643-58-3780 Fax : -3701 e-mail: mailto:[EMAIL PROTECTED] ----------------------------------------------------------------------