Re: [Veritas-bu] Getting a list of files from a failed backup
Hi Jeff. For the 41 issue for a specific client, there a few items you want to check: 1) With that many files, you will want to increase the CLIENT_READ_TIMEOUT value with the bp.conf file residing on the client. By default, it is 300 and you may want to try 900, etc. 2) On the client, perform the following steps: - Add VERBOSE = 5 to the /usr/openv/netbackup/bp.conf file. - Create the bpbkar directory within /usr/openv/netbackup/logs (on the client). - Create a touch file called bpbkar_path_tr within the /usr/openv/netbackup directory (on the client). - Run job twice. Check the bpbkar log file that is created. Does the backup job stop on the same file during both runs??? If so, add it to an exclude list. If not, you do not have a file or filesystem issue. 3) If 1 or 2 doesn't fix issue, check netstat -I (on client). Need to make sure Ierrs and/or Oerrs are not increasing during the backup job. If they are increasing, you have a client NIC configuration issue. Any disconnect of the backup stream (from client to media server) will cause the 41 error. Hope this info assists you. Mitch -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of [EMAIL PROTECTED] Sent: Friday, January 04, 2008 12:00 PM To: veritas-bu@mailman.eng.auburn.edu Subject: Veritas-bu Digest, Vol 21, Issue 5 Send Veritas-bu mailing list submissions to veritas-bu@mailman.eng.auburn.edu To subscribe or unsubscribe via the World Wide Web, visit http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu or, via email, send a message with subject or body 'help' to [EMAIL PROTECTED] You can reach the person managing the list at [EMAIL PROTECTED] When replying, please edit your Subject line so it is more specific than Re: Contents of Veritas-bu digest... Today's Topics: 1. Samba share as Disk Storage Unit? (Yang Xiao) 2. ExaGrid D2D w/ NBU 5x or 6.5 (Roemmele, Scott) 3. Getting a list of files from a failed backup (Jeff Cleverley) -- Message: 1 Date: Fri, 4 Jan 2008 10:33:59 -0500 From: Yang Xiao [EMAIL PROTECTED] Subject: [Veritas-bu] Samba share as Disk Storage Unit? To: veritas-bu@mailman.eng.auburn.edu Message-ID: [EMAIL PROTECTED] Content-Type: text/plain; charset=iso-8859-1 Hi Guys, I'm running NB 6 MP5 on Win 2003, have been trying to configure a samba share as a disk storage unit and it sees the storage fine, just whenever I try to do a backup to it, I get invalid STS storage and failed to mount. I've tried both using the UNC path and a mapped drive letter when defining the DSU, but that didn't make a difference. HELP! Thanks, - Yang -- next part -- An HTML attachment was scrubbed... URL: http://mailman.eng.auburn.edu/pipermail/veritas-bu/attachments/20080104/05528663/attachment.html -- Message: 2 Date: Fri, 04 Jan 2008 10:41:06 -0500 From: Roemmele, Scott [EMAIL PROTECTED] Subject: [Veritas-bu] ExaGrid D2D w/ NBU 5x or 6.5 To: Veritas-bu@mailman.eng.auburn.edu Message-ID: [EMAIL PROTECTED] Content-Type: text/plain; charset=US-ASCII Anybody out there using D2D from ExaGrid along with NBU? How has your experience been? Speed to Backup/Restore? Using Replication between sites? What kind of compression are you seeing? Any info regarding their management scalability? Any information would be greatly appreciated. Thanks, Scott -- Message: 3 Date: Fri, 04 Jan 2008 10:54:29 -0700 From: Jeff Cleverley [EMAIL PROTECTED] Subject: [Veritas-bu] Getting a list of files from a failed backup To: veritas-bu@mailman.eng.auburn.edu Message-ID: [EMAIL PROTECTED] Content-Type: text/plain; charset=ISO-8859-1; format=flowed Greetings, I have a file system that has started to fail full backups on a regular basis. The incremental backups run fine. This file system is on a hpux 11.11 server, There is only 228 gig of used data, but it does have ~5.5m inodes in use. The backup fails with a status 41. This server has other file systems on it with more used inodes and more used space, and they don't have this problem. I've run the backup at various times when I know the network and file system load is low, but it doesn't change. The current backup environment is 5.1 mp2. It was supposed to be retired by now which is why it hasn't been upgraded. The reason I want to see what files it is backing up before it fails is that all of the backups seem to backup just over 195 gig before they fail. The file counts vary quite a bit. I'm expecting that I may have a very poorly laid out structure with hundreds of thousands of files in one directory that is causing this problem. This may also be caused by some timeout if it does hit this, but I haven't been able to find it in any of the log files. I was able to recovery an image from the tape listing by running
[Veritas-bu] Number of restore tape drives
Hi. We have a Netbackup 5.1 MP6 master/media server with 8 tape drives. We run a large standard backup (1.5 TB data size w/multistreaming and no multiplexing) that uses all 8 of the drives during the backup. The backups run successfully and we have no problem in this area. We see an issue when we attempt to perform a full restore of the above mentioned data. When the restore process occurs, all 8 drives are populated and the restore starts and appears to be laying data down on the client. Because all 8 of the drives are in use, all other backup jobs sit in the queue and cannot run. Is there anyway to limit the number of drives the restore will use during the restore process? Is there a touch file or a configuration we could put in place to accomplish this? Thanks for the help. ___ Veritas-bu maillist - Veritas-bu@mailman.eng.auburn.edu http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu
[Veritas-bu] Flashbackup concerns
Hi. We have a few Windows file servers using NBU 5.1 MP4 which we are currently running standard backups on. The amount of data and number of files on these servers has increased significantly within the last year. It is now taking NBU almost 3 days to completed a full backup on one of these servers. We have been testing the Advanced Client feature Flashbackup and we are seeing significant performance increases (3 days -- 10 hours). We would like to incorporate this feature into our environment, but wanted to see if anyone has had any issues/problems with this feature. Any info would be appreciated. Thanks. ___ Veritas-bu maillist - Veritas-bu@mailman.eng.auburn.edu http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu
[Veritas-bu] FlashBackup restore
Hi. We are testing the NB 5.1 Flashbackup Advanced Client feature and it is working well. When attempting to perform a restore of the Flashbackup data the files within the java gui window have a red circle with a slash icon next to each file name. I was thinking that this icon might mean read-only file, but am not sure. Does anyone know what it means??? Thanks. ___ Veritas-bu maillist - Veritas-bu@mailman.eng.auburn.edu http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu
[Veritas-bu] Fixed or Variable length blocks
Hi. I am working on a project to duplicate several legacy tapes that were written using a AIT-2 (SONY SDX-500C) tape drive to a different media format. I am getting read errors when attempting to perform the duplication on almost ever backup image. I have adjusted the st.conf (Solaris media server) file to the suggested values, but it has not helped. The AIT-2 drives are running the 0200 firmware, so this shouldn't be a dip switch setting issue. Does anyone know of a way to tell if the tapes may have been written in fixed-length blocks instead of variable-length blocks??? Thanks!!! ___ Veritas-bu maillist - Veritas-bu@mailman.eng.auburn.edu http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu
[Veritas-bu] Netbackup 5.1 and checkpoints
We are using Netbackup 5.1 MP4 and have enabled checkpoints for our Standard (Unix) and Windows-based backups. The checkpoints are occurring every 30 minutes I need to know if a backup job fails during the 1st attempt (example: status 84) will the 2nd attempt (retry) continue from the 1st attempt last checkpoint or start all over? Thanks for the help. Mitch ___ Veritas-bu maillist - Veritas-bu@mailman.eng.auburn.edu http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu
[Veritas-bu] RE: Netbackup monitoring
Question 6: Does anyone use external monitoring tools for netbackup? We have many custom scripts created to perform some monitoring tasks but I have been tasked with looking for something that can automate allot of the monitoring without having to write a customer script for everything. Management also wants our NOC people to take over some of the monitoring for us to allow us to get to the real work instead of worry about upping drives or restarting failed jobs. Any ideas? --- We use Nagios (formerly NetSaint) w/ Netbackup plugins. Monitors: - NBU Drive status - NBU Daemon status - NBU Queue size - When last backup occurred. (i.e. Jobs hung) - Basic core system resources (disk space, cpu load, memory, etc.) Can write shell or perl plugin for anything you want and price is right (free/Open Source) http://www.nagios.org I hope this helps. Mitch ___ Veritas-bu maillist - Veritas-bu@mailman.eng.auburn.edu http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu
[Veritas-bu] Netbackup 5.1 MP4 Vault eject failures
Hi. We have a Vault duplication job that duplicates Virtual tapes (using a EMC CDL) onto standard tapes (using 9940B drives/L700 Tape Lib). Within the Vault Profile, we have 00_009_TLD as the source Volume Group. This volume group is used by our EMC CDL (Virtual Tape library). The destination tape library we are duplicating to has a Volume Group of 00_002_TLD. This volume group is used by our STK L700 tape library and has not been entered into the Vault profile at all. When we run the Vault duplication everything is working correctly with the exception of the eject process. The tapes that are suppose to be getting ejected are not and we are seeing the following in the detail.log of the Vault session: 13:40:53.646 [8218] GenMediaEjectList(): media M01212 selected but not in robot group '00_009_TLD', skipping eject 13:40:53.646 [8218] GenMediaEjectList(): media M01443 selected but not in robot group '00_009_TLD', skipping eject 13:40:53.646 [8218] GenMediaEjectList(): media M01657 selected but not in robot group '00_009_TLD', skipping eject 13:40:53.646 [8218] GenMediaEjectList(): media M02162 selected but not in robot group '00_009_TLD', skipping eject We think the issue has something to do with the source Volume Group of 00_009_TLD mentioned above, but are not sure. The first tape, M01212, is in the 00_002_TLD volume group and we do not know why it cares that it is not in the 00_009_TLD group. Any help would be appreciated. Thanks. Mitch ___ Veritas-bu maillist - Veritas-bu@mailman.eng.auburn.edu http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu
[Veritas-bu] RE: ] list all client version
I have been using the bpgetconfig command to query our client version. We are in the mist of upgrading to version 5.1. The query shows that the clients are 5.1, How do I verify that mp4 has been applied ? --- As Wayne had stated, the only way to find out the client version is to check the /usr/openv/netbackup/bin/version file on the client. When you are looking at Client Properties with the java gui/admin console or using the equivalent command line arguments (bpgetconfig), you are only seeing the bpcd daemon version. The bpcd daemon (listening on port 13782) has the Netbackup version build into the source code and you will only get the GA version from it. Hope this helps. ___ Veritas-bu maillist - Veritas-bu@mailman.eng.auburn.edu http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu
[Veritas-bu] RE: Can't kill an Active job..
Hi Aaron. To remove the active job form the job monitor, you will want to do the following: 1) Shutdown Netbackup 2) Move the /usr/openv/netbackup/db/jobs/bpjobd.act.db file to bpjobd.act.db.date (directory listing shown below). DO NOT move the bpjobd.db file. 3) Restart Netbackup. The bpjobd.act.db will rebuild itself. The above steps were for Unix, but the equivalent can be performed in Windows. Hope this helps. Directory Listing = # pwd /usr/openv/netbackup/db/jobs # ls -la total 14784 drwxrwxrwx 6 root sys 8192 Dec 8 07:03 . drwxr-xr-x 14 root bin 8192 Nov 15 2004 .. -rw--- 1 root root863052 Dec 8 07:03 bpjobd.act.db -rw--- 1 root sys6298520 Dec 8 07:03 bpjobd.db drw-r-xr-x 2 root sys 96 Oct 31 2003 done drwxr-xr-x 3 root sys 90112 Dec 8 07:03 ffilelogs -rw--- 1 root sys 7 Dec 8 07:03 jobid -rw--- 1 root sys 16 Dec 8 07:03 jobid.lock drw-r-xr-x 2 root sys 180224 Dec 8 07:03 restart drwxr-xr-x 3 root sys 81920 Dec 8 07:03 trylogs -Original Message- From: Aaron Mills [mailto:[EMAIL PROTECTED] Sent: 07 December 2005 20:58 To: veritas-bu@mailman.eng.auburn.edu Subject: [Veritas-bu] Can't kill an Active job... I have several jobs that are stuck in active mode but not doing anything. JobID Type State Status PolicySchedule Client Dest Media Svr Active PID 9716 Backup Activeinbound ftpif foo.com foo.com 5376 9018 Backup Activeinbound ftpif foo.com foo.com 19580 9845 Backup Queued DBArchive Oracle-Policy bar.com 9844 Backup Queued DBArchive Oracle-Policy bar.com I tried to kill the job with: bpdbjobs -cancel 9716 to no avail. The FAQ-O-Matic says I need to restart NBU and manually delete the db files? Is this accurate or is there an easier way to clean these buggers up? Thanks. Aaron Mills System Administrator Return Path, Inc. 303.642.4111 [EMAIL PROTECTED] http://www.returnpath.biz http://www.returnpath.biz ___ Veritas-bu maillist - Veritas-bu@mailman.eng.auburn.edu http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu