Re: TSM architecture
Also, please consider system maintenance. TSM is usually maintained during the day when backups are not running. Ask you newby architect if you can take this system down during the day for OS maintenance/problems/etc. ROBERT R. PRICE TSM Administrator CSC Phone: 412-342-1947 Fax: 412-342-1755 rpric...@csc.com This is a PRIVATE message. If you are not the intended recipient, please delete without copying and kindly advise us by e-mail of the mistake in delivery. NOTE: Regardless of content, this e-mail shall not operate to bind CSC to any order or other contract unless pursuant to explicit written agreement or government initiative expressly permitting the use of e-mail for such purpose. Wanda Prather wprat...@jasi.co M To Sent by: ADSM: ADSM-L@VM.MARIST.EDU Dist Stor cc Manager ads...@vm.marist Subject .EDU Re: TSM architecture 09/18/2009 12:05 PM Please respond to ADSM: Dist Stor Manager ads...@vm.marist .EDU FWIW, When you upgrade to 6.1, your TSM server will be running Websphere (for the ISC) and DB2, plus TSM. I think your current way of stacking via LPARs is a better choice. W On Fri, Sep 18, 2009 at 12:01 PM, Shawn Drew shawn.d...@americas.bnpparibas.com wrote: All of this depends on the amount of data involved, not the number of nodes. We have some TSM instances with 50 nodes that finish their backups by 3AM or so. and all the house-keeping finishes by 6AM. Regards, Shawn Shawn Drew Internet rrho...@firstenergycorp.com Sent by: ADSM-L@VM.MARIST.EDU 09/18/2009 11:49 AM Please respond to ADSM-L@VM.MARIST.EDU To ADSM-L cc Subject Re: [ADSM-L] TSM architecture Our TSM servers are busy around the clock. In fact, 6am-noon is some of the busiest with migrations going on. Haberstroh, Debbie (IT) habe...@voughtai To RCRAFT.COM ADSM-L@VM.MARIST.EDU Sent by: ADSM:cc Dist Stor Manager Subject ads...@vm.marist TSM architecture .EDU 09/18/2009 11:36 AM Please respond to ADSM: Dist Stor Manager ads...@vm.marist .EDU Hi All, My current environment is TSM 5.5.3, 1 library manager, 3 database servers. These are installed on a P550 AIX 5.3 system in separate LPAR's. We have 355 clients, 200 + are active. My current TSM databases are 100GB, 65-82% utilized. We are going to be doing a large business object installation which will add 30-50 new clients including multiple Oracle databases. Our proposal was to add an additional TSM server to handle the new requirements. We have a new architect that is not very familiar with TSM and his proposal is to stack TSM on another server that is running a different application. His argument is that TSM does most of it's work at night and the application (which one is TBD) does most of it's work during the day. From what I know, due to TSM's resource utilization, it should be on it's own hardware. Has anyone tried to do this and what were your results? I would love to get some good arguments to take back that would support our original position to install on separate hardware. Thanks to everyone for your ideas. Debbie Haberstroh TSM Server Administration - The information contained in this message is intended only for the personal and confidential use of the recipient(s) named above. If the reader of this message is not the intended recipient or an agent responsible for delivering it to the intended recipient, you are hereby notified that you have received this document in error and that any review, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this communication in error, please notify us immediately, and delete the original message. This message and any attachments (the message) is intended solely for the addressees and is confidential. If you receive this message in error, please delete it and immediately notify the sender. Any use not in accord with its purpose, any dissemination or disclosure, either whole or partial, is prohibited except formal approval. The internet can not guarantee the integrity of this message. BNP PARIBAS (and its subsidiaries) shall
Re: TSM library problem caused by IBM3584 virtual I/O
Hi John, We are seeing similar issues, albeit on a much smaller library. Two of our TSM Servers run with 3584 libraries on version 5.4.2.0. The nine LTO3 library does not have virtual I/O enabled and we see no problems, but the ten LTO3 library with virtual I/O enabled does get the ANR8300E errors that you documented. The effect we see are tapes going into unavailable status. To correct the problem, we need to idle tape mounts, audit the library and update the tape status to read/write. But it would be better to find the root cause of the problem and correct it. Robert R. Price ADSM/TSM Administrator Computer Sciences Corporation Phone: 412-374-3247 Fax: 412-374-6371 [EMAIL PROTECTED] This is a PRIVATE message. If you are not the intended recipient, please delete without copying and kindly advise us by e-mail of the mistake in delivery. NOTE: Regardless of content, this e-mail shall not operate to bind CSC to any order or other contract unless pursuant to explicit written agreement or government initiative expressly permitting the use of e-mail for such purpose. Schneider, John [EMAIL PROTECTED] ERCY.NET To Sent by: ADSM: ADSM-L@VM.MARIST.EDU Dist Stor cc Manager [EMAIL PROTECTED] Subject .EDU TSM library problem caused by IBM3584 virtual I/O 09/05/2008 12:51 PM Please respond to ADSM: Dist Stor Manager [EMAIL PROTECTED] .EDU Greetings, We are running TSM 5.4.3.0 on AIX 5.3ML5. We have a library master instance, and 9 other TSM instances that are library clients. They all share an IBM3584 with 24 LTO4 tape drives, and an EMC EDL virtual library emulating an IBM3584 with 128 LTO1 drives. Recently our IBM CE told us we should be running with virtual I/O, a feature of the IBM3584 library. The reason he recommended it is because we frequently have more than 32 outgoing tapes every day, and sometimes the Operators don't get around to taking the tapes out of the I/O doors, and checkouts have to wait. With virtual I/O turned on, the checkouts go ahead and run to completion, even though the tapes don't actually go into the I/O doors. Then later when the I/O doors get empty, the tape library moves the rest of the tapes into the I/O doors. That part seems to be working as expected. After we turned virtual I/O on, we started getting weird symptoms in TSM, like tapes that we would check back in to the library, but later TSM could not find them. So we decided that maybe virtual I/O changed the element number map, and we should have redefined the library to TSM. So we: 1) Deleted the drive paths, drives, library path, and library on the library master instance, and all client instances. 2) From the Tape library Web interface, performed a complete library inventory (just in case) 3) Defined the library, library path, drives, and drive paths on the library master instance, and all client instances. 4) Checked back in the scratch tapes 5) Checked back in the private tapes 6) Did an Audit library on the library master and all library clients. It was only a few days later that we started getting errors from TSM of the form: 09/04/08 22:00:56 ANR8300E I/O error on library SUN2079 (OP=6C03, CC=314, KEY=05, ASC=3B, ASCQ=0E, SENSE=70.00.05.00.00.00- .00.0A.00.00.00.00.3B.0E.00.C0.00.04., Description=The source slot or drive was empty in an attempt to move a volume). Refer to Appendix C in the 'Messages' manual for recommended action. (SESSION: 395703, PROCESS: 487) 09/04/08 22:00:56 ANR8312E Volume 101781L4 could not be located in library SUN2079. (SESSION: 395703, PROCESS: 487) 09/04/08 22:00:56 ANR8358E Audit operation is required for library SUN2079. (SESSION: 395703, PROCESS: 487) 09/04/08 22:00:56 ANR8381E NAS volume 101781L4 could not be mounted in drive LTO4_F2_D09 (c576t0l0). (SESSION: 395703, PROCESS: 487) 09/04/08 22:00:56 ANR1402W Mount request denied for volume 101781L4 - volume unavailable. (SESSION: 395703, PROCESS: 487) 09/04/08 22:00:56 ANR1410W Access mode for volume 101781L4 now set to unavailable. (SESSION: 395703, PROCESS: 487) It is different tapes every time, so we now have over a dozen tapes that are missing on account of this. Did we do something wrong with we turned on virtual I/O for this library? I found this technote, that sounds like it is supported. It also says we need to restart the TSM
Re: Improving TSM performance - memory related setting
We have experienced this phenomenon as well at 5.3.5.2 on a Solaris TSM Server. The system has 32GB memory, 6 CPU. We used 20GB for bufferpool and verified that there was no paging going on. Performance was terrible! As a measure, expiration took about a week. All other processes, sessions, queries were similarly bad. Reduced buffpool to about 1/2 GB and performance is now great. Expiration now runs in about 3 hours. Other measures improved dramatically as well. Current DB stats: Available Space (MB): 204,800 Assigned Capacity (MB): 202,860 Maximum Extension (MB): 1,940 Maximum Reduction (MB): 75,212 Page Size (bytes): 4,096 Total Usable Pages: 51,932,160 Used Pages: 32,695,965 Pct Util: 63.0 Max. Pct Util: 63.0 Physical Volumes: 16 Buffer Pool Pages: 131,072 Total Buffer Requests: 306,981,255 Cache Hit Pct.: 99.72 Cache Wait Pct.: 0.00 Robert R. Price TSM Administrator Computer Sciences Corporation Phone: 412-374-3247 Fax: 412-374-6371 [EMAIL PROTECTED] Computer Sciences Corporation Registered Office: 3170 Fairview Park Drive, Falls Church, Virginia 22042, USA Registered in Nevada, USA No: C-489-59 - This is a PRIVATE message. If you are not the intended recipient, please delete without copying and kindly advise us by e-mail of the mistake in delivery. NOTE: Regardless of content, this e-mail shall not operate to bind CSC to any order or other contract unless pursuant to explicit written agreement or government initiative expressly permitting the use of e-mail for such purpose. - Matthew Glanville matthew.glanvill [EMAIL PROTECTED] To Sent by: ADSM: ADSM-L@VM.MARIST.EDU Dist Stor cc Manager [EMAIL PROTECTED] Subject .EDU Re: Improving TSM performance - memory related setting 04/07/2008 09:43 AM Please respond to ADSM: Dist Stor Manager [EMAIL PROTECTED] .EDU Just a warning, don't necessarily go to 1/2 or 1/8th of your total physical memory.. If your server has 64 GB of memory, 8 GB (1/8th) for BUFPOOLSIZE is probably too high. I would keep it below 1 GB unless you prove to yourself with some testing that it is helping speed up the backups or restores. Just don't look at database cache hit rate. I think there's some inefficiency in how TSM database cache works, as it may have to search through that cached memory to find a hit. When you are caching 8 GB, that takes some time to search through all that. You'll see this as the dsmserv processes CPU use will increase the larger the BUFPOOLSIZE is. If it takes longer to search through that memory than it does to read the page from disk, you are hurting performance not helping it by caching in memory. Maybe DB2 TSM in version 6 will help this issue. I tried to put a problem in for it a few years back, but couldn't get beyond level 2 since no one understood what I was talking about, they kept thinking the server was slow due to the operating system using it's own page file to make up for the large memory setting, even though the 64 bit server had 32 GB of physical memory... Ahh well. I can use that other 30 GB for something else. or is it fixed now? Matt G. ADSM: Dist Stor Manager ADSM-L@VM.MARIST.EDU wrote on 04/04/2008 03:40:56 AM: Bufpoolsize should be set between 1/2 and 1/8 of the among of the total physical memory, if I remember well, that is, what best practices recommend. Also if you modify this parameter you will have to take a look at the percentage hit cache (q db f=d) that can be increase (modifying the bufpolsize), or perhaps decrease if the setting is incorrect. Percentage hit cache must be above 99 %. I will also recommend to take a look at the performance tuning guide, that will let you know more about many settings that can increase the performance of your TSM server. Regards, Bernaldo. - Mensaje original De: Paul Dudley [EMAIL PROTECTED] Para: ADSM-L@VM.MARIST.EDU Enviado: viernes, 4 de abril, 2008 5:44:02 Asunto: Re: [ADSM-L] Improving TSM
Run Script Query/Control
Is there a method to query whether any scripts are running? Is there a method to cancel a running script? Robert R. Price ADSM/TSM Administrator Computer Sciences Corporation Phone: 412-374-3247 Fax: 412-374-6371 [EMAIL PROTECTED] This is a PRIVATE message. If you are not the intended recipient, please delete without copying and kindly advise us by e-mail of the mistake in delivery. NOTE: Regardless of content, this e-mail shall not operate to bind CSC to any order or other contract unless pursuant to explicit written agreement or government initiative expressly permitting the use of e-mail for such purpose.
Re: TSM 5.3 Serial Parallel commands
I haven't tried this as yet, but I believe the syntax to be: PARALLEL command1 command2 ... SERIAL command3 Where the commands after SERIAL will wait until ALL commands between PARALLEL and SERIAL have been completed. Robert R. Price ADSM/TSM Administrator Computer Sciences Corporation Phone: 412-374-3247 Fax: 412-374-6371 [EMAIL PROTECTED] This is a PRIVATE message. If you are not the intended recipient, please delete without copying and kindly advise us by e-mail of the mistake in delivery. NOTE: Regardless of content, this e-mail shall not operate to bind CSC to any order or other contract unless pursuant to explicit written agreement or government initiative expressly permitting the use of e-mail for such purpose. Timothy Hughes Timothy.Hughes To: ADSM-L@VM.MARIST.EDU @OIT.STATE.NJ.US cc: Subject: TSM 5.3 Serial Parallel commands Sent by: ADSM: Dist Stor Manager ADSM-L 11/18/05 10:25 AM Please respond to ADSM: Dist Stor Manager Hello guys, Has anyone tested or is using the New Serial and Parallel commands in TSM 5.3? I am testing a script that would execute some daily routines, Such as backing up storage pools and updating the backuppool (disk) right now these routines are executed as Administrative Schedules. From what I have read, This command (Serial) is used to ensure that any preceding commands tasks are complete before proceeding with the next command in the script. The script below is the one that I am testing and would like to execute. Can anyone find any parameter that I may be missing if any? /* run multiple commands in Serial and wait for each one of them to /* complete their tasks before proceeding with the next command*/ Serial /* Backup Diskpool, Update Disk stgpool Backup Primary Tapepool*/ BA STG backuppool R3592POOL maxpr=x preview=no wait=yes update stgpool backuppool highmig=xx lowmig=xx wait=yes BA STG H3592POOL R3592POOL maxpr=x preview=no wait=yes /* Wait for previous commands task to finish */ Thanks for any help in advance!
TSM Operational Reporting
Trying OR (Version 5.3.0.0). What I have tried seems to work out just fine. Except for one issue. I have seven TSM Servers that I want a report at 8am. If I schedule them all at the same time (8am), then the first one works and the rest error out with Unable to load rept dll. RC: 2 and all subsequent reports fail. If I spread out the reporting over time (one hour per) then things work out quite nicely. I would like all of the reports at 8am though. Has anyone seen this behavior, and perhaps more importantly has anyone a fix? Robert R. Price ADSM/TSM Administrator Computer Sciences Corporation Phone: 412-374-3247 Fax: 412-374-6371 [EMAIL PROTECTED] This is a PRIVATE message. If you are not the intended recipient, please delete without copying and kindly advise us by e-mail of the mistake in delivery. NOTE: Regardless of content, this e-mail shall not operate to bind CSC to any order or other contract unless pursuant to explicit written agreement or government initiative expressly permitting the use of e-mail for such purpose.
Solaris TSM Server 5.2.4.0 and the Query Libvolume command
Can someone confirm this before I report it to IBM? Seems to be a bug in this release of TSM (5.2.4.0) with the command, query libvol, see below. A query volume wn3006 shows the tape WN3006 as being used by the ARCHTAPE storage pool. A query libvolume wn3006 shows that it is not in the library. Yet, when I executed an audit volume, the tape mounted and read without issue. If I execute a query libvolume qualstar WN3006, the tape shows up in the library. Note the difference is explicitly stating the library name, qualstar. The help file indicates that the default for the library name is all libraries. This matches how all TSM Servers worked at earlier levels. Note also that if one executes query libvolume (without any parameters) the tape in question DOES shows up. tsm: WINDSOR_TSMq v wn3006 Volume Name Storage Device Estimated Pct Volume Pool Name Class Name Capacity Util Status (MB) --- -- - - WN3006 ARCHTAPEAIT3 118,795.0 100.0 Filling tsm: WINDSOR_TSMq libv wn3006 ANR2034E QUERY LIBVOLUME: No match found using this criteria. ANS8001I Return code 11. tsm: WINDSOR_TSMq libv qualstar wn3006 Library Name Volume Name Status Owner Last Use HomeDevice Element Type --- -- -- - --- -- QUALSTAR WN3006 Private Data 54 tsm: WINDSOR_TSMhelp q libv QUERY LIBVOLUME QUERY LIBVOLUME (Query a Library Volume) Use this command to display information about one or more volumes that are checked into an automated library for use by the Tivoli Storage Manager server. Privilege Class Any administrator can issue this command. Syntax .-*. .-*---. -Query LIBVolume--+--+--+-+-- '-library_name-' '-volume_name-' Parameters library_name Specifies the name of the library. You can use wildcard characters to specify this name. This parameter is optional. The default is all libraries. volume_name Specifies the volume name. You can use wildcard characters to specify this name. This parameter is optional. The default is all volumes. ... tsm: WINDSOR_TSMq libv Library Name Volume Name Status Owner Last Use HomeDevice Element Type --- -- -- - --- -- QUALSTAR CLN044 Cleaner 3 QUALSTAR CLN077 Cleaner 2 QUALSTAR CLN078 Cleaner 1 QUALSTAR CLN079 Cleaner 0 QUALSTAR WN3006 Private Data 54 QUALSTAR WN3011 Private Data 48 QUALSTAR WN3013 Private Data 121 ... Robert R. Price ADSM/TSM Administrator Computer Sciences Corporation Phone: 412-374-3247 Fax: 412-374-6371 [EMAIL PROTECTED] This is a PRIVATE message. If you are not the intended recipient, please delete without copying and kindly advise us by e-mail of the mistake in delivery. NOTE: Regardless of content, this e-mail shall not operate to bind CSC to any order or other contract unless pursuant to explicit written agreement or government initiative expressly permitting the use of e-mail for such purpose.
Re: AIT3 and errors
We too, have seen many problems with AIT-3. I run 5 TSM Server that use AIT-3 drives in a Qualstar library. These all give me problems. I also run two TSM Servers using 3590E drives in a 3494 library. Almost no problems with these two. Of the 5 AIT-3 systems, problems seem to be proportional to the load on the TSM Server. Two of these ar AIX and three are Solaris. I have had problems with TSM 5.1 and 5.2, currently running 5.2.2.5 and 5.1.7.1 and 5.1.5.4. The most heavily used system presents problems absolutely every day. We replaced all of the drives - did not help. We ordered a new batch of tapes - did not help. Worked with the vendor - did not help. Some of the specific issues that I see are: a) Write errors (a couple per day), TSM sets volume access to readonly. Sometimes if I reset the volume access to readwrite and try again, the tape writes the next time. Once I get three write errors on a tape, I do a Move Data and reuse the tape as scratch. Almost all of the time, the next write pass works just fine on the tape. Occasionally the same tape gives me problems and I retire the tape. I have resorted to cleaning each drive once per day. Jury still out on if this is helping. b) Tape fails to unload. TSM reports Drive Failure with ASC/ASCQ=44/00 (internal target failure). Usually the tape unloads if I hold in the eject button for 20-30 seconds, sometimes I need to power cycle the drive to get the tape to unload. Microcode level 0205 seems to have al but eliminated this problem. But now I get ASC/ASCQ=3B/08 with about the same frequency as the 44/00 had been getting. The 3B/08 (reposition error) does not leave the tape stuck in the drive, but TSM does turn the drive offline. So it appears that the microcode upgrade traded the problem and helped somewhat. in that a physical visit to the stuck tape drive is no longer necessary. c0 Drive failures. I've had to replace 5 (of the 10) drives on this busiest system in the past month. Different failure modes: 1) Drive failed to unload tapes, including a cleaning cartridge, 2) Drive gets three flashing lights on any attempted tape load, cleaning cartridge works okay, 3) Every time I attempted to turn this drive online, it hung the SCSI bus, 4) One replacement drive was DOA, would not even power up, 5) Every tape loaded resulted in ASC/ASCQ=30/02 (media incompatibility). d) I had to replace a drive on the second most busy system today. The drive gets I/O Error Reading Label on every tape it tries to mount. This resulted in all (61) of my scratch tapes being rejected and put into private status. All of the filling status tapes were also tried, rejected and put into unavailable access. Took a while to sort all of this out. Tried cleaning the drive a couple of time, but this did not help. Is there some way that the three of us (at least) can get together to resolve some of these issues? Any ideas? Robert R. Price ADSM/TSM Administrator Computer Sciences Corporation Phone: 412-374-3247 Fax: 412-374-6371 [EMAIL PROTECTED] This is a PRIVATE message. If you are not the intended recipient, please delete without copying and kindly advise us by e-mail of the mistake in delivery. NOTE: Regardless of content, this e-mail shall not operate to bind CSC to any order or other contract unless pursuant to explicit written agreement or government initiative expressly permitting the use of e-mail for such purpose. Barth, Terry (MBS) To: [EMAIL PROTECTED] Terry.Barth cc: @MORTGAGEFAMILY. Subject: Re: AIT3 and errors COM Sent by: ADSM: Dist Stor Manager ADSM-L 11/11/04 11:59 AM Please respond to ADSM: Dist Stor Manager We have just had the library replaced along with many of the tapes and we still continue to see errors. :( -Original Message- From: ADSM: Dist Stor Manager [mailto:[EMAIL PROTECTED] On Behalf Of David Benigni Sent: Thursday, November 11, 2004 11:11 AM To: [EMAIL PROTECTED] Subject: Re: AIT3 and errors Terry, Yes! I'm glad to hear someone else is having problems too. I've had a number of drive failures occur too. Right now I'm working with the vendor to resolve these issues. Dave [EMAIL PROTECTED] 11/11/2004 10:02:30 AM Good Morning - Is anyone out there using AIT3 and experiencing high error issues, such as i/o errors reading labels, write errors, read errors, data integrity errors, stuck tapes? Thanks -- Terry
Re: Where did my scratch tapes go?
I have seen this problem occasionally. The sequence that I have seen is: 1) A process requests a scratch tape 2) The tape is mounted in a drive 3) The drive has some type of hardware error 4) The scratch tape being mounted is marked Private so it won't be used for scratch again 5) Another scratch tape is mounted to satisfy the original request 6) Repeat 2-5 until no more scratch tapes are available. The result is many tapes in Private status not assigned to any storage pool and no scratch tapes. Perhaps this could have been your scenario? Robert R. Price ADSM/TSM Administrator Computer Sciences Corporation Phone: 412-374-3247 Fax: 412-374-6371 [EMAIL PROTECTED] This is a PRIVATE message. If you are not the intended recipient, please delete without copying and kindly advise us by e-mail of the mistake in delivery. NOTE: Regardless of content, this e-mail shall not operate to bind CSC to any order or other contract unless pursuant to explicit written agreement or government initiative expressly permitting the use of e-mail for such purpose. Bill Boyer bill.boyer To: [EMAIL PROTECTED] @VERIZON.NETcc: Sent by: ADSM: Subject: Re: Where did my scratch tapes go? Dist Stor Manager ADSM-L 11/05/04 02:17 PM Please respond to ADSM: Dist Stor Manager I just look for tapes that have a last use of NULL. select volume_name from libvolumes where status='Private' and last_use is NULL It's normally a small list I get back, then I usually do: select * from volhistory where volume_name='volser' to see what the last status of each volume is. If the status is Private and the last volhistory is stgdelete, then you need to figure out whey the tape is private in the libvolumesmaybe write protectedno labeled someone checked it in as private by mistake...? Bill Boyer Some days you are the bug, some days you are the windshield. - ?? -Original Message- From: ADSM: Dist Stor Manager [mailto:[EMAIL PROTECTED] Behalf Of Ben Bullock Sent: Friday, November 05, 2004 2:11 PM To: [EMAIL PROTECTED] Subject: Re: Where did my scratch tapes go? We've see something similar before. Here's a place to start looking... Run this command to see if you have tapes that are checked in as private volumes, but actually have no client data on them. select volume_name from libvolumes where status='Private' and libvolumes.volume_name not in (select volume_name from volumes)and libvolumes.volume_name not in (select volume_name from volhistory where type in ('BACKUPFULL', 'BACKUPINCR','DBSNAPSHOT', 'EXPORT')) If you get back a list of tapes, you need to figure out how they got into that state. Perhaps they are import tapes that still have data on them. Or perhaps it is a procedural issue where your operator is checking in scratch tapes with the status=private flag, in which case TSM will never use them. If you find any tapes to clean up, you could check them out with the remove=no option and then re-check them back in as scratch tapes. Ben -Original Message- From: ADSM: Dist Stor Manager [mailto:[EMAIL PROTECTED] On Behalf Of Nancy L Backhaus Sent: Friday, November 05, 2004 12:00 PM To: [EMAIL PROTECTED] Subject: Where did my scratch tapes go? Hello, Background: TSM Version 5.2.3.4 AIX Operating System Version 5.2.2.0 2 libraries - one scratch count is steady, my LTOLIB library scratch count is too low, privates too high. What is happening? Where do I begin looking? 11/04/04 STATUS Unnamed[2] -- --- Cleaner 1 Private350 Scratch 24 10/04/04 STATUS Unnamed[2] -- --- Cleaner 1 Private250 Scratch 59 Nancy Backhaus Enterprise Systems [EMAIL PROTECTED] Office: (716) 887-7979 Cell: (716) 609-2138 CONFIDENTIALITY NOTICE: This email message and any attachments are for the sole use of the intended recipient(s) and may contain proprietary, confidential, trade secret or privileged information. Any unauthorized review, use, disclosure or distribution is prohibited and may be a violation of law. If you are not the intended recipient or a person responsible for delivering this message to an intended recipient, please contact the sender by reply email and destroy all copies of the original message.
Re: D2D vs. tape backups with TSM?
I know that I must be missing something here, but can someone explain why this D2D backup scheme utilizes what I assume to be the sequential FILE device type? Why not just use the (random) DISK storage pool type. This would eliminate any co-location issues, reclaims and not waste ANY disk space with deleted files? Would someone enlighten me? This is a PRIVATE message. If you are not the intended recipient, please delete without copying and kindly advise us by e-mail of the mistake in delivery. NOTE: Regardless of content, this e-mail shall not operate to bind CSC to any order or other contract unless pursuant to explicit written agreement or government initiative expressly permitting the use of e-mail for such purpose. TSM_User tsm_userTo: [EMAIL PROTECTED] @YAHOO.COM cc: Sent by: ADSM: Subject: Re: D2D vs. tape backups with TSM? Dist Stor Manager ADSM-L 07/29/04 08:03 PM Please respond to ADSM: Dist Stor Manager I aggree about not using collocation but this customer has always collocated everything offsite and onsite. Moving to ATA was step one. Step two is getting them to turn off collocation on all the Large File data. Step three is going to be turnning off collocation for everything on disk. Rushforth, Tim [EMAIL PROTECTED] wrote:We have always used compression going to Disk. We use exclude.compression for things like .zip etc. You may want to use compressalways yes to avoid resending data that grows. We are not collocating at all - why would you want to? From a restore perspective (using multi-session restore) it is better to have the data spread out across multiple volumes. -Original Message- From: TSM_User [mailto:[EMAIL PROTECTED] Sent: July 28, 2004 8:29 PM To: [EMAIL PROTECTED] Subject: Re: D2D vs. tape backups with TSM? We are using 25 GB volumes right now. We are also still collocating the storage pools that use the file device class by node. This has worked out fine for us. Sad to admit but I wasn't aware of the Technical Exchange recommendation. Is there a white paper from that you could refer me to. We are contemplating turning on node compression everywhere to also help reduce disk space. Also, I made mention in a previous post that we were reclaiming down to 50% and that was fine. Well, like always when you make a comment like that it makes you think and they you go look. I found that we were using around 16 TB's of ATA space in all when you look at the In Use numbers. When I looked at the actual disk in use it was closer to 21 TB's of data. I am currently reclaiming everything down to 40 and I plan to get down to 25 again. At that point I will compare the numbers and see how much I can reduce the 21 TB's in use. Also somewhat interesting information. We have found that the I/O capabilities of the latest and greatest servers can really help push a lot more data to disk. We have always been told by our disk vendor that the bottleneck wasn't them. We ruled out many things except them. Finally we looked at a more detailed performance monitor of our systems and we found that the we were killing the processor during times when we were pushing a lot of data to disk. With these new servers we see migrations from Fibre disk to ATA disk at over 150 GB/hr. We do have 60 TB's of ATA space though so we have a lot of disks to write to. Rushforth, Tim wrote: Just curious what size of file volumes are you using? We were originally using 25 GB, and then I listened to the Disk Only Backup Strategies Technical Exchange where they recommended 2-4 GB volumes. Thanks, Tim Rushforth City of Winnipeg -Original Message- From: TSM_User [mailto:[EMAIL PROTECTED] Sent: July 27, 2004 6:41 PM To: [EMAIL PROTECTED] Subject: Re: D2D vs. tape backups with TSM? Funny, we set ours down to 25% as well just to see what would happen. This worked but we have since set all of the ATA Pools to 50% and we just leave them there. Theoretically what could happen is we could be wasting twice as much space but the fact is the volumes were going from 25% to 50% in a matter of days and when we looked at how many volumes were between 25% and 50% in our environment we determined there was no need to reclaim down that far. From all outward signs there was no issues with reclaiming down to 25% we just didn't think it was worth doing the extra work to get back such a small amount of disk. Disk is cheap, right! lol Rushforth, Tim wrote: We've set ours at 25%. We are just piloting an all disk backup pool for some clients on one of our
Re: AIT drive I/O problems
Same story here with AIT-3 drives. I see lots of read errors that usually go away on a retry. See a good number of write errors as well. Some of these are repeatable on any drive and I chalk that up to defective media. We clean the drives twice a week which is twice as often as recommended by Sony. I also see the problem with drives hanging up with dismount failures after an ASC/ASQ=44/00. Drive shows all three lights blinking. Need power cycle to clear it. Good to hear that Sony is looking for a firmware fix. Robert R. Price ADSM/TSM Administrator Computer Sciences Corporation Phone: 412-374-3247 Fax: 412-374-6371 [EMAIL PROTECTED] This is a PRIVATE message. If you are not the intended recipient, please delete without copying and kindly advise us by e-mail of the mistake in delivery. NOTE: Regardless of content, this e-mail shall not operate to bind CSC to any order or other contract unless pursuant to explicit written agreement or government initiative expressly permitting the use of e-mail for such purpose. Riley, Craig Riley.Craig To: [EMAIL PROTECTED] @TCHDEN.ORG cc: Sent by: ADSM: Subject: Re: AIT drive I/O problems Dist Stor Manager ADSM-L 06/25/04 01:26 PM Please respond to ADSM: Dist Stor Manager We have been seeing something similar with out AIT3 drives running in a Spectralogic 64K. Also almost exclusivly during reclamation processing . The difference with our situation is in addition to read errors occationally the volume will fail to unmout and remain stuck in the drive . We then have to power cycle the drive to eject the tape. running scsi traces on the drives when this issue comes up we have found that the event is always preceded by a scsi forward command asking the drive firmware to move the tape position forward by some increment. Sony has identified this as a firmware problem and is testing a fixed version of code right now. Craig Riley The Children's Hospital in Denver -Original Message- From: ADSM: Dist Stor Manager [mailto:[EMAIL PROTECTED] Behalf Of Steven Bridge Sent: Friday, June 25, 2004 8:02 AM To: [EMAIL PROTECTED] Subject: AIT drive I/O problems Currently running TSM server version 5.1.8 on an AIX 5.2 machine. We have a Qualstar TLS-412600 library with three AIT-2 drives. We are having continual problems with I/O errors almost exclusively during the reclamation of tapes. Most often 98-99% of the reclaim works but we are seeing with perhaps quarter to a half of our reclaims a number of read errors ( 100 ). In almost all cases, when we then perform a 'move data' on the errant tape, it reads the remaining data off without any problems. Errors have been seen on two of the drives over the past month - but I'm not sure about the relative frequency of use of all 3 drives to determine whether the other drive is error free or just lucky. What is most frustrating about this problem, is that a drive experiencing read errors then hangs. The reclaim process is cancelled when the volume has no reads 'logged' for some time - but the process usually takes between 4 to 12 hours to stop - presumably waiting on some I/O timeout. The drive can be observed performing some activity during this time - continual retries perhaps ? If we can't wait for 12 hours for the drives to be available again, the whole AIX box has to be reloaded to clear the situation. Drives have been replaced following tape jams but the replacement drives still exhibit the same problems. We have set the drives up with a cleaning frequency of 1000 GB - so they are being cleaned every now and then. I would be interested to hear if anyone else has experienced the same problems with these drives - assuming anyone else uses AIT drives. I wonder whether the problem is symptomatic with these drives or whether there are any firmware upgrades that might fix the problem. How do you find out what version is on the drive ? I would also be very interested in any suggestions for preventing the interminable hangs. Is there anywhere that this timeout can be reduced ? Examples of errors logged : 2004-06-24 15:12:14 ANR8302E I/O error on drive DRIVE0 (/dev/mt0) (OP=READ, Error Number=7 8, CC=205, KEY=FF, ASC=FF, ASCQ=FF, SENSE=**NONE**, Description=SCSI adapter failure). Ref er to Appendix D in the 'Messages' manual for recommended action. then eventually when the cancel process completes ; 2004-06-24 23:17:49 ANR8302E I/O error on drive DRIVE0 (/dev/mt0) (OP=FSR, Error Number=78 , CC=205, KEY=FF, ASC=FF, ASCQ=FF, SENSE=**NONE**, Description=SCSI
Re: Need your thoughts on best RESTORE approach
You may want to consider defining a separate storage pool heirarchy for each of your 16 concurrent sessions (or split between 8 or ?). That way each disk pool would migrate to a different tape pool and thus a different tape. Up to 16 restore sessions would then be possible in parallel if you have enough tape drives. For the copy pool, its your choice on using a single copy pool or use 16 different copy pools (or any number in between). This would allow up to 16 restore sessions to proceed in parallel at the Disaster recovery site. Robert R. Price ADSM/TSM Administrator Computer Sciences Corporation Phone: 412-374-3247 Fax: 412-374-6371 [EMAIL PROTECTED] This is a PRIVATE message. If you are not the intended recipient, please delete without copying and kindly advise us by e-mail of the mistake in delivery. NOTE: Regardless of content, this e-mail shall not operate to bind CSC to any order or other contract unless pursuant to explicit written agreement or government initiative expressly permitting the use of e-mail for such purpose. MC Matt Cooper (2838) To: [EMAIL PROTECTED] Matt.Cooper cc: @AMGREETINGS.COM Subject: Need your thoughts on best RESTORE approach Sent by: ADSM: Dist Stor Manager ADSM-L 06/21/04 02:47 PM Please respond to ADSM: Dist Stor Manager Hello all, We backup an ORACLE ERP that is about 350GB. We back it up by running a script that uses 16 concurrent sessions, each one backing up a series of File Systems, all sesions going to disk pool. It is compressed by the client.This method works well. We backup the data in less than 2 hrs and it looks like 130GB of compressed data. I know the data will be single threaded through 1 tape drive to tape during migration, because it is all coming from the same server. The offsite copy is then made running tapepool to copypool, (yes I know it would be better to make the offsite from disk first but I do not have enough disk for enough time to do that).I know that our 9840 tapes will be getting about 20GB each so there should be 5 or 6 tapes (at least) containing this servers backup. What would be the optimal way of doing the RESTORE of this data? Ideally I ask TSM to just use 6 drives and all is well or know where the break points are on the tapes. Has anyone worked (and tested) this out yet? Our current attempts take about 18-24 hours Thanks in advance Matt
Re: 3494 libray lifespan??
We have been running our 3494 for seven and a half years now. No sign of wearing out. Just replace broken parts when necessary. Robert R. Price ADSM/TSM Administrator Computer Sciences Corporation Phone: 412-374-3247 Fax: 412-374-6371 [EMAIL PROTECTED] Prather, Wanda Wanda.Prather To: [EMAIL PROTECTED] @JHUAPL.EDU cc: Sent by: ADSM: Subject: 3494 libray lifespan?? Dist Stor Manager ADSM-L 12/13/02 01:16 PM Please respond to ADSM: Dist Stor Manager Does anybody have an idea what is reasonable to expect for the lifespan of a 3494 library? My manager wants to know when we should budget for a new one. But I've never heard of anybody replacing a 3494 because of age - just upgrading to new drives!