Re: strange disk offline error
On Wed, Jul 23, 2003 at 10:57:31AM -0400, Kurt Yoder wrote: Looking more closely in /tmp/amanda/sendsize...debug, I see something strange: [...] sendsize[7578]: time 0.137: Usage: dump [-?CDVcgl -L[v] -a[v] -f[v] -h[dnv] -o[v] -r[dnv] -s[dnv] -t[nv] -T index1[, index2]] file(s) On Solaris 7: % dump Usage: dump [-agcd:fhln:oprstvCDLT:V?] file(s) ... % which dump /usr/ccs/bin/dump % ufsdump Usage: ufsdump [0123456789fustdWwnDCcbavloS [argument]] filesystem dump is for displaying ELF files in human-readable format. The backup program we all know and love is called ufsdump. SCO might be similar. -- | | /\ |-_|/ Eric Siegerman, Toronto, Ont.[EMAIL PROTECTED] | | / When I came back around from the dark side, there in front of me would be the landing area where the crew was, and the Earth, all in the view of my window. I couldn't help but think that there in front of me was all of humanity, except me. - Michael Collins, Apollo 11 Command Module Pilot
Re: strange disk offline error
On Fri, Jul 25, 2003 at 02:37:43PM -0400, Kurt Yoder wrote: sendsize[505]: time 0.054: error [cannot find user backup in passwd file] This message is a bit misleading. It really means: `getpwnam(backup) was unsuccessful'. Perhaps getpwnam() searched /etc/passwd, but perhaps it tried to use NIS or some SCOish mechanism. So there might be other things to look at beyond /etc/passwd and its symlink target. -- | | /\ |-_|/ Eric Siegerman, Toronto, Ont.[EMAIL PROTECTED] | | / When I came back around from the dark side, there in front of me would be the landing area where the crew was, and the Earth, all in the view of my window. I couldn't help but think that there in front of me was all of humanity, except me. - Michael Collins, Apollo 11 Command Module Pilot
Re: strange disk offline error
So it looks like this has something to do with me compiling this on SCO. I've asked my local knowledgeable SCO guy and he gave me an explanation that I don't understand but perhaps one of the coders would. Should I direct this thread to amanda-hackers instead? Kurt Yoder said: More information about this problem: Looking more closely in /tmp/amanda/sendsize...debug, I see something strange: sendsize[7578]: time 0.020: calculating for amname '/', dirname '/', spindle -1 sendsize[7578]: time 0.020: getting size via dump for / level 0 sendsize[7578]: time 0.020: calculating for device '/' with '' sendsize[7578]: time 0.022: running /bin/dump 0Esf 1048576 - / sendsize[7578]: time 0.023: running /usr/local/libexec/killpgrp sendsize[7578]: time 0.137: Usage: dump [-?CDVcgl -L[v] -a[v] -f[v] -h[dnv] -o[v] -r[dnv] -s[dnv] -t[nv] -T index1[, index2]] file(s) ... sendsize[7578]: time 0.138: . sendsize[7578]: estimate time for / level 0: 0.116 sendsize[7578]: no size line match in /bin/dump output for / sendsize[7578]: . sendsize[7578]: estimate size for / level 0: -1 KB sendsize[7578]: time 0.138: asking killpgrp to terminate sendsize[7578]: time 1.149: done with amname '/', dirname '/', spindle -1 Notice line 6 Usage: message. Perhaps dump is not acting the way amanda expects? Is this my problem? I had trouble getting gnu tar to compile, but maybe I should try again so I can use it instead of dump. -- Kurt Yoder Sport Health network administrator
Re: strange disk offline error
So I'm still getting the disk offline error, and seeing this in my /tmp/amanda/sendsize...debug: sendsize: debug 1 pid 502 ruid 19 euid 19: start at Fri Jul 25 14:24:16 2003 sendsize: version 2.4.4 sendsize[502]: time 0.026: waiting for any estimate child sendsize[505]: time 0.026: calculating for amname '/', dirname '/', spindle -1 sendsize[505]: time 0.026: getting size via gnutar for / level 0 sendsize[505]: time 0.042: spawning /usr/local/libexec/runtar in pipeline sendsize[505]: argument list: /usr/local/bin/tar --create --file /dev/null --directory / --one-file-system --listed-incremental /usr/local/var/amanda/gnutar-lists/db.shcorp.com__0.new --sparse --ignore-failed-read --totals --exclude-from /tmp/amanda/sendsize._.20030725142416.exclude . sendsize[505]: time 0.054: error [cannot find user backup in passwd file] sendsize[505]: time 0.056: sendsize[505]: time 0.057: . sendsize[505]: estimate time for / level 0: 0.015 sendsize[505]: no size line match in /usr/local/bin/tar output for / sendsize[505]: . sendsize[505]: estimate size for / level 0: -1 KB sendsize[505]: time 0.057: waiting for /usr/local/bin/tar / child sendsize[505]: time 0.160: after /usr/local/bin/tar / wait sendsize[505]: time 0.160: done with amname '/', dirname '/', spindle -1 sendsize[502]: time 0.160: child 505 terminated normally sendsize: time 0.160: pid 502 finish time Fri Jul 25 14:24:16 2003 I'm pretty certain my problem has to do with the cannot find user backup in passwd file. But this makes no sense. I can su - backup and then view /etc/passwd. The user backup is definitely in it. Any ideas for what's causing this? -- Kurt Yoder Sport Health network administrator
Re: strange disk offline error
Joshua Baker-LePain said: snipped Yup, that's definitely the problem. Is amanda picking the right dump for your filesystem? You can tell by the output of ./configure (it might also be in the amandad*debug files -- I'm not sure). tar is nice for being so cross platform. If you can get that to go, it might be your better option. OK, I got gnu tar 1.13 compiled and working. However, I still get the disk offline error. Now in /tmp/amanda/sendsize... I see this which doesn't look good either: sendsize: debug 1 pid 15490 ruid 19 euid 19: start at Thu Jul 24 00:30:05 2003 sendsize: version 2.4.4 sendsize[15490]: time 0.009: waiting for any estimate child sendsize[15492]: time 0.009: calculating for amname '/', dirname '/', spindle -1sendsize[15492]: time 0.015: getting size via gnutar for / level 0 sendsize[15492]: time 0.016: spawning /usr/local/libexec/runtar in pipeline sendsize[15492]: argument list: /usr/local/bin/tar --create --file /dev/null --directory / --one-file-system --listed-incremental /usr/local/var/amanda/gnutar-lists/db.shcorp.com__0.new --sparse --ignore-failed-read --totals --exclude-from /tmp/amanda/sendsize._.20030724003005.exclude . sendsize[15492]: time 0.028: error [cannot find user backup in passwd file] sendsize[15492]: time 0.059: sendsize[15492]: time 0.060: . sendsize[15492]: estimate time for / level 0: 0.044 sendsize[15492]: no size line match in /usr/local/bin/tar output for / sendsize[15492]: . sendsize[15492]: estimate size for / level 0: -1 KB sendsize[15492]: time 0.061: waiting for /usr/local/bin/tar / child sendsize[15492]: time 0.061: after /usr/local/bin/tar / wait sendsize[15492]: time 0.061: done with amname '/', dirname '/', spindle -1 sendsize[15490]: time 0.061: child 15492 terminated normally Again I see cannot find user backup in passwd file. In looking at /etc, I noticed the passwd file is actually a symlink to a passwd file in another directory (For anyone who has SCO, why in GOD'S NAME do they do this symlink BS for everything? It is horrible!). Is this the only problem that's stopping me here, or is there anything else that needs fixing? -- Kurt Yoder Sport Health network administrator
Re: strange disk offline error
On Thu, 24 Jul 2003 at 10:26am, Kurt Yoder wrote tar is nice for being so cross platform. If you can get that to go, it might be your better option. OK, I got gnu tar 1.13 compiled and working. However, I still get the disk offline error. Now in /tmp/amanda/sendsize... I see this which doesn't look good either: tar 1.13 is Bad. Grab 1.13.25 from alpha.gnu.org. sendsize[15492]: time 0.028: error [cannot find user backup in passwd file] sendsize[15492]: time 0.059: sendsize[15492]: time 0.060: . sendsize[15492]: estimate time for / level 0: 0.044 sendsize[15492]: no size line match in /usr/local/bin/tar output for / Again I see cannot find user backup in passwd file. In looking at /etc, I noticed the passwd file is actually a symlink to a passwd file in another directory (For anyone who has SCO, why in GOD'S NAME do they do this symlink BS for everything? It is horrible!). Is this the only problem that's stopping me here, or is there anything else that needs fixing? I'm guessing that if you fix access to /etc/passwd, you'll be a lot closer to making this work. What are the owner and permissions on what /etc/passwd points to (and yes, that is brain dead)? -- Joshua Baker-LePain Department of Biomedical Engineering Duke University
Re: strange disk offline error
On Thu, Jul 24, 2003 at 10:26:30AM -0400, Kurt Yoder wrote: Again I see cannot find user backup in passwd file. In looking at /etc, I noticed the passwd file is actually a symlink to a passwd file in another directory (For anyone who has SCO, why in GOD'S NAME do they do this symlink BS for everything? It is horrible!). Is this the only problem that's stopping me here, or is there anything else that needs fixing? Not sure about your passwd problem, but this earlier line needs work. OK, I got gnu tar 1.13 compiled and working. However, I still get 1.13 is a known BAD version of gnutar for amanda. Get 1.13.19 or 1.13.25. Sorry I forget the url, but it is something like ftp://alpha.gnu.org. Check the archives, it is mentioned regularly. -- Jon H. LaBadie [EMAIL PROTECTED] JG Computing 4455 Province Line Road(609) 252-0159 Princeton, NJ 08540-4322 (609) 683-7220 (fax)
Re: strange disk offline error
On Thursday 24 July 2003 10:26, Kurt Yoder wrote: Joshua Baker-LePain said: snipped Yup, that's definitely the problem. Is amanda picking the right dump for your filesystem? You can tell by the output of ./configure (it might also be in the amandad*debug files -- I'm not sure). tar is nice for being so cross platform. If you can get that to go, it might be your better option. OK, I got gnu tar 1.13 compiled and working. I certainly hope that version above is a typo. 1.13 is broken, you need 1.13-25, getting it from alpha.gnu.org. However, I still get the disk offline error. Now in /tmp/amanda/sendsize... I see this which doesn't look good either: sendsize: debug 1 pid 15490 ruid 19 euid 19: start at Thu Jul 24 00:30:05 2003 sendsize: version 2.4.4 sendsize[15490]: time 0.009: waiting for any estimate child sendsize[15492]: time 0.009: calculating for amname '/', dirname '/', spindle -1sendsize[15492]: time 0.015: getting size via gnutar for / level 0 sendsize[15492]: time 0.016: spawning /usr/local/libexec/runtar in pipeline sendsize[15492]: argument list: /usr/local/bin/tar --create --file /dev/null --directory / --one-file-system --listed-incremental /usr/local/var/amanda/gnutar-lists/db.shcorp.com__0.new --sparse --ignore-failed-read --totals --exclude-from /tmp/amanda/sendsize._.20030724003005.exclude . sendsize[15492]: time 0.028: error [cannot find user backup in passwd file] sendsize[15492]: time 0.059: sendsize[15492]: time 0.060: . sendsize[15492]: estimate time for / level 0: 0.044 sendsize[15492]: no size line match in /usr/local/bin/tar output for / sendsize[15492]: . sendsize[15492]: estimate size for / level 0: -1 KB sendsize[15492]: time 0.061: waiting for /usr/local/bin/tar / child sendsize[15492]: time 0.061: after /usr/local/bin/tar / wait sendsize[15492]: time 0.061: done with amname '/', dirname '/', spindle -1 sendsize[15490]: time 0.061: child 15492 terminated normally Again I see cannot find user backup in passwd file. In looking at /etc, I noticed the passwd file is actually a symlink to a passwd file in another directory (For anyone who has SCO, why in GOD'S NAME do they do this symlink BS for everything? It is horrible!). Is this the only problem that's stopping me here, or is there anything else that needs fixing? -- Cheers, Gene AMD [EMAIL PROTECTED] 320M [EMAIL PROTECTED] 512M 99.27% setiathome rank, not too shabby for a WV hillbilly Yahoo.com attornies please note, additions to this message by Gene Heskett are: Copyright 2003 by Maurice Eugene Heskett, all rights reserved.
Re: strange disk offline error
More information about this problem: Looking more closely in /tmp/amanda/sendsize...debug, I see something strange: sendsize[7578]: time 0.020: calculating for amname '/', dirname '/', spindle -1 sendsize[7578]: time 0.020: getting size via dump for / level 0 sendsize[7578]: time 0.020: calculating for device '/' with '' sendsize[7578]: time 0.022: running /bin/dump 0Esf 1048576 - / sendsize[7578]: time 0.023: running /usr/local/libexec/killpgrp sendsize[7578]: time 0.137: Usage: dump [-?CDVcgl -L[v] -a[v] -f[v] -h[dnv] -o[v] -r[dnv] -s[dnv] -t[nv] -T index1[, index2]] file(s) ... sendsize[7578]: time 0.138: . sendsize[7578]: estimate time for / level 0: 0.116 sendsize[7578]: no size line match in /bin/dump output for / sendsize[7578]: . sendsize[7578]: estimate size for / level 0: -1 KB sendsize[7578]: time 0.138: asking killpgrp to terminate sendsize[7578]: time 1.149: done with amname '/', dirname '/', spindle -1 Notice line 6 Usage: message. Perhaps dump is not acting the way amanda expects? Is this my problem? I had trouble getting gnu tar to compile, but maybe I should try again so I can use it instead of dump. -- Kurt Yoder Sport Health network administrator
Re: strange disk offline error
Kurt Yoder wrote: Weird errors here. The root user is definitely in the passwd file. Could this be part of the problem? Thanks for any ideas on fixing this... Maybe a silly question, but, can your amanda backup user read the passwd file on that system? Marc
Re: strange disk offline error
On Wed, 23 Jul 2003 at 10:57am, Kurt Yoder wrote sendsize[7578]: time 0.020: calculating for amname '/', dirname '/', spindle -1 sendsize[7578]: time 0.020: getting size via dump for / level 0 sendsize[7578]: time 0.020: calculating for device '/' with '' sendsize[7578]: time 0.022: running /bin/dump 0Esf 1048576 - / sendsize[7578]: time 0.023: running /usr/local/libexec/killpgrp sendsize[7578]: time 0.137: Usage: dump [-?CDVcgl -L[v] -a[v] -f[v] -h[dnv] -o[v] -r[dnv] -s[dnv] -t[nv] -T index1[, index2]] file(s) .. *snip* Notice line 6 Usage: message. Perhaps dump is not acting the way amanda expects? Is this my problem? I had trouble getting gnu tar to compile, but maybe I should try again so I can use it instead of dump. Yup, that's definitely the problem. Is amanda picking the right dump for your filesystem? You can tell by the output of ./configure (it might also be in the amandad*debug files -- I'm not sure). tar is nice for being so cross platform. If you can get that to go, it might be your better option. -- Joshua Baker-LePain Department of Biomedical Engineering Duke University
strange disk offline error
(That previous message should have had this subject; sorry, mail client troubles) Hello list I just compiled amanda on a SCO Unix machine (uname -a shows SCO_SV shcorp 3.2 5.0.6 i386) and tried to follow instructions to install it, instructing amanda to back up both of its disks. Everything appears successful, and the machine passes amcheck tests. When I run amdump at night, my other linux, freebsd, and windows machines dump successfully. However on my SCO machine, I get the message: shcorp.shc /stand lev FAILED [disk /stand offline on shcorp.shcorp.com?] shcorp.shc / lev FAILED [disk / offline on shcorp.shcorp.com?] I've looked in google, and found the following suggestions: (from faq-o-matic, http://amanda.sourceforge.net/fom-serve/cache/10.html) is disk really offline? Answer appears to be no. After all, I'm using this machine throughout the day. So I'd assume it should be available for backup, since no-one touches the machine at night. filesystem error? Well, I suppose there *could* be. But the fact that it happens on both disks seems to indicate that this is not the problem. (I also installed the same compiled version on a separate sco machine, and it does the exact same thing). filesystem too large? Does not seem to be. /stand is only 15 megabytes, but still fails. conflicting user name? Doesn't seem to be it. I configured with user backup. This only shows up once in the passwd file, and this box does not have any external sources for authentication (no nis, ldap, etc) don't have dump installed? This isn't it. I compiled by hand, and the config.log shows that amanda found the dump program. I suppose it could conceivably be something that amanda doesn't like about SCO's dump program though. How can I check if this might be the problem? (from an archived post: http://groups.yahoo.com/group/amanda-users/message/40200) permissions on /etc/fstab ok? On SCO, the file seems to be /etc/mnttab. It is unix mode 644, so this shouldn't be a problem. So, I looked at the logs in /tmp/amanda. For the last failed dump, I see these logs: -rw--- 1 root backup 231 Jul 22 00:30 killpgrp.20030722043007.debug -rw--- 1 root backup 231 Jul 22 00:30 killpgrp.20030722043009.debug -rw--- 1 root sys 2108 Jul 22 00:30 sendsize.20030722003007.debug -rw--- 1 root sys 2275 Jul 22 00:30 amandad.20030722003005.debug (strange that these are owned by root instead of backup; is this a problem?) sendsize ends with sendsize[1383]: time 2.300: child 1388 terminated normally sendsize: time 2.300: pid 1383 finish time Tue Jul 22 00:30:10 2003 Looks ok to me amandad ends with amandad: time 4.281: got packet: Amanda 2.4 ACK HANDLE 00E-00A00608 SEQ 1058848217 amandad: time 4.281: pid 1382 finish time Tue Jul 22 00:30:10 2003 seems fine, or? first killpgrp killpgrp: debug 1 pid 1386 ruid 19 euid 0: start at Tue Jul 22 04:30:07 2003 /usr/local/libexec/killpgrp: version 2.4.4 killpgrp: error [cannot find user root in passwd file] killpgrp: pid 1386 finish time Tue Jul 22 04:30:07 2003 second killpgrp killpgrp: debug 1 pid 1389 ruid 19 euid 0: start at Tue Jul 22 04:30:09 2003 /usr/local/libexec/killpgrp: version 2.4.4 killpgrp: error [cannot find user root in passwd file] killpgrp: pid 1389 finish time Tue Jul 22 04:30:09 2003 Weird errors here. The root user is definitely in the passwd file. Could this be part of the problem? Thanks for any ideas on fixing this... -- Kurt Yoder Sport Health network administrator