Re: strange disk offline error

2003-07-29 Thread Eric Siegerman
On Wed, Jul 23, 2003 at 10:57:31AM -0400, Kurt Yoder wrote:
 Looking more closely in /tmp/amanda/sendsize...debug, I see
 something strange:
 [...]
 sendsize[7578]: time 0.137: Usage: dump [-?CDVcgl -L[v] -a[v] -f[v]
 -h[dnv] -o[v] -r[dnv] -s[dnv] -t[nv] -T index1[, index2]] file(s)

On Solaris 7:
% dump
Usage: dump [-agcd:fhln:oprstvCDLT:V?] file(s) ...
% which dump
/usr/ccs/bin/dump
% ufsdump
Usage: ufsdump [0123456789fustdWwnDCcbavloS [argument]] filesystem

dump is for displaying ELF files in human-readable format.  The
backup program we all know and love is called ufsdump.

SCO might be similar.

--

|  | /\
|-_|/ Eric Siegerman, Toronto, Ont.[EMAIL PROTECTED]
|  |  /
When I came back around from the dark side, there in front of me would
be the landing area where the crew was, and the Earth, all in the view
of my window. I couldn't help but think that there in front of me was
all of humanity, except me.
- Michael Collins, Apollo 11 Command Module Pilot



Re: strange disk offline error

2003-07-29 Thread Eric Siegerman
On Fri, Jul 25, 2003 at 02:37:43PM -0400, Kurt Yoder wrote:
 sendsize[505]: time 0.054: error [cannot find user backup in passwd file]

This message is a bit misleading.  It really means:
`getpwnam(backup) was unsuccessful'.  Perhaps getpwnam()
searched /etc/passwd, but perhaps it tried to use NIS or some
SCOish mechanism.  So there might be other things to look at
beyond /etc/passwd and its symlink target.

--

|  | /\
|-_|/ Eric Siegerman, Toronto, Ont.[EMAIL PROTECTED]
|  |  /
When I came back around from the dark side, there in front of me would
be the landing area where the crew was, and the Earth, all in the view
of my window. I couldn't help but think that there in front of me was
all of humanity, except me.
- Michael Collins, Apollo 11 Command Module Pilot



Re: strange disk offline error

2003-07-28 Thread Kurt Yoder

So it looks like this has something to do with me compiling this on
SCO. I've asked my local knowledgeable SCO guy and he gave me an
explanation that I don't understand but perhaps one of the coders
would. Should I direct this thread to amanda-hackers instead?

Kurt Yoder said:
 More information about this problem:

 Looking more closely in /tmp/amanda/sendsize...debug, I see
 something strange:

 sendsize[7578]: time 0.020: calculating for amname '/', dirname '/',
 spindle -1
 sendsize[7578]: time 0.020: getting size via dump for / level 0
 sendsize[7578]: time 0.020: calculating for device '/' with ''
 sendsize[7578]: time 0.022: running /bin/dump 0Esf 1048576 - /
 sendsize[7578]: time 0.023: running /usr/local/libexec/killpgrp
 sendsize[7578]: time 0.137: Usage: dump [-?CDVcgl -L[v] -a[v] -f[v]
 -h[dnv] -o[v] -r[dnv] -s[dnv] -t[nv] -T index1[, index2]] file(s)
 ...
 sendsize[7578]: time 0.138: .
 sendsize[7578]: estimate time for / level 0: 0.116
 sendsize[7578]: no size line match in /bin/dump output for /
 sendsize[7578]: .
 sendsize[7578]: estimate size for / level 0: -1 KB
 sendsize[7578]: time 0.138: asking killpgrp to terminate
 sendsize[7578]: time 1.149: done with amname '/', dirname '/',
 spindle -1

 Notice line 6 Usage:  message. Perhaps dump is not acting the way
 amanda expects? Is this my problem? I had trouble getting gnu tar to
 compile, but maybe I should try again so I can use it instead of
 dump.

-- 
Kurt Yoder
Sport  Health network administrator



Re: strange disk offline error

2003-07-25 Thread Kurt Yoder
So I'm still getting the disk offline error, and seeing this in my
/tmp/amanda/sendsize...debug:




sendsize: debug 1 pid 502 ruid 19 euid 19: start at Fri Jul 25
14:24:16 2003
sendsize: version 2.4.4
sendsize[502]: time 0.026: waiting for any estimate child
sendsize[505]: time 0.026: calculating for amname '/', dirname '/',
spindle -1
sendsize[505]: time 0.026: getting size via gnutar for / level 0
sendsize[505]: time 0.042: spawning /usr/local/libexec/runtar in
pipeline
sendsize[505]: argument list: /usr/local/bin/tar --create --file
/dev/null --directory / --one-file-system --listed-incremental
/usr/local/var/amanda/gnutar-lists/db.shcorp.com__0.new --sparse
--ignore-failed-read --totals --exclude-from
/tmp/amanda/sendsize._.20030725142416.exclude .
sendsize[505]: time 0.054: error [cannot find user backup in passwd
file]
sendsize[505]: time 0.056:
sendsize[505]: time 0.057: .
sendsize[505]: estimate time for / level 0: 0.015
sendsize[505]: no size line match in /usr/local/bin/tar output for /
sendsize[505]: .
sendsize[505]: estimate size for / level 0: -1 KB
sendsize[505]: time 0.057: waiting for /usr/local/bin/tar / child
sendsize[505]: time 0.160: after /usr/local/bin/tar / wait
sendsize[505]: time 0.160: done with amname '/', dirname '/',
spindle -1
sendsize[502]: time 0.160: child 505 terminated normally
sendsize: time 0.160: pid 502 finish time Fri Jul 25 14:24:16 2003




I'm pretty certain my problem has to do with the cannot find user
backup in passwd file. But this makes no sense. I can su - backup
and then view /etc/passwd. The user backup is definitely in it. Any
ideas for what's causing this?

-- 
Kurt Yoder
Sport  Health network administrator



Re: strange disk offline error

2003-07-24 Thread Kurt Yoder

Joshua Baker-LePain said:

snipped

 Yup, that's definitely the problem.  Is amanda picking the right
 dump for
 your filesystem?  You can tell by the output of ./configure (it
 might also
 be in the amandad*debug files -- I'm not sure).

 tar is nice for being so cross platform.  If you can get that to go,
 it
 might be your better option.

OK, I got gnu tar 1.13 compiled and working. However, I still get
the disk offline error. Now in /tmp/amanda/sendsize... I see this
which doesn't look good either:

sendsize: debug 1 pid 15490 ruid 19 euid 19: start at Thu Jul 24
00:30:05 2003
sendsize: version 2.4.4
sendsize[15490]: time 0.009: waiting for any estimate child
sendsize[15492]: time 0.009: calculating for amname '/', dirname
'/', spindle -1sendsize[15492]: time 0.015: getting size via gnutar
for / level 0
sendsize[15492]: time 0.016: spawning /usr/local/libexec/runtar in
pipeline
sendsize[15492]: argument list: /usr/local/bin/tar --create --file
/dev/null --directory / --one-file-system --listed-incremental
/usr/local/var/amanda/gnutar-lists/db.shcorp.com__0.new --sparse
--ignore-failed-read --totals --exclude-from
/tmp/amanda/sendsize._.20030724003005.exclude .
sendsize[15492]: time 0.028: error [cannot find user backup in
passwd file]
sendsize[15492]: time 0.059:
sendsize[15492]: time 0.060: .
sendsize[15492]: estimate time for / level 0: 0.044
sendsize[15492]: no size line match in /usr/local/bin/tar output for
/
sendsize[15492]: .
sendsize[15492]: estimate size for / level 0: -1 KB
sendsize[15492]: time 0.061: waiting for /usr/local/bin/tar / child
sendsize[15492]: time 0.061: after /usr/local/bin/tar / wait
sendsize[15492]: time 0.061: done with amname '/', dirname '/',
spindle -1
sendsize[15490]: time 0.061: child 15492 terminated normally


Again I see cannot find user backup in passwd file. In looking at
/etc, I noticed the passwd file is actually a symlink to a passwd
file in another directory (For anyone who has SCO, why in GOD'S NAME
do they do this symlink BS for everything? It is horrible!). Is this
the only problem that's stopping me here, or is there anything else
that needs fixing?

-- 
Kurt Yoder
Sport  Health network administrator


Re: strange disk offline error

2003-07-24 Thread Joshua Baker-LePain
On Thu, 24 Jul 2003 at 10:26am, Kurt Yoder wrote

  tar is nice for being so cross platform.  If you can get that to go,
  it
  might be your better option.
 
 OK, I got gnu tar 1.13 compiled and working. However, I still get
 the disk offline error. Now in /tmp/amanda/sendsize... I see this
 which doesn't look good either:

tar 1.13 is Bad.  Grab 1.13.25 from alpha.gnu.org.

 sendsize[15492]: time 0.028: error [cannot find user backup in
 passwd file]
 sendsize[15492]: time 0.059:
 sendsize[15492]: time 0.060: .
 sendsize[15492]: estimate time for / level 0: 0.044
 sendsize[15492]: no size line match in /usr/local/bin/tar output for
 /

 
 Again I see cannot find user backup in passwd file. In looking at
 /etc, I noticed the passwd file is actually a symlink to a passwd
 file in another directory (For anyone who has SCO, why in GOD'S NAME
 do they do this symlink BS for everything? It is horrible!). Is this
 the only problem that's stopping me here, or is there anything else
 that needs fixing?

I'm guessing that if you fix access to /etc/passwd, you'll be a lot closer 
to making this work.  What are the owner and permissions on what 
/etc/passwd points to (and yes, that is brain dead)?

-- 
Joshua Baker-LePain
Department of Biomedical Engineering
Duke University



Re: strange disk offline error

2003-07-24 Thread Jon LaBadie
On Thu, Jul 24, 2003 at 10:26:30AM -0400, Kurt Yoder wrote:
 
 Again I see cannot find user backup in passwd file. In looking at
 /etc, I noticed the passwd file is actually a symlink to a passwd
 file in another directory (For anyone who has SCO, why in GOD'S NAME
 do they do this symlink BS for everything? It is horrible!). Is this
 the only problem that's stopping me here, or is there anything else
 that needs fixing?

Not sure about your passwd problem, but this earlier line needs work.

 OK, I got gnu tar 1.13 compiled and working. However, I still get

1.13 is a known BAD version of gnutar for amanda.

Get 1.13.19 or 1.13.25.
Sorry I forget the url, but it is something like ftp://alpha.gnu.org.
Check the archives, it is mentioned regularly.

-- 
Jon H. LaBadie  [EMAIL PROTECTED]
 JG Computing
 4455 Province Line Road(609) 252-0159
 Princeton, NJ  08540-4322  (609) 683-7220 (fax)


Re: strange disk offline error

2003-07-24 Thread Gene Heskett
On Thursday 24 July 2003 10:26, Kurt Yoder wrote:
Joshua Baker-LePain said:

snipped

 Yup, that's definitely the problem.  Is amanda picking the right
 dump for
 your filesystem?  You can tell by the output of ./configure (it
 might also
 be in the amandad*debug files -- I'm not sure).

 tar is nice for being so cross platform.  If you can get that to
 go, it
 might be your better option.

OK, I got gnu tar 1.13 compiled and working.

I certainly hope that version above is a typo. 1.13 is broken, you 
need 1.13-25, getting it from alpha.gnu.org.

 However, I still get
the disk offline error. Now in /tmp/amanda/sendsize... I see this
which doesn't look good either:

sendsize: debug 1 pid 15490 ruid 19 euid 19: start at Thu Jul 24
00:30:05 2003
sendsize: version 2.4.4
sendsize[15490]: time 0.009: waiting for any estimate child
sendsize[15492]: time 0.009: calculating for amname '/', dirname
'/', spindle -1sendsize[15492]: time 0.015: getting size via gnutar
for / level 0
sendsize[15492]: time 0.016: spawning /usr/local/libexec/runtar in
pipeline
sendsize[15492]: argument list: /usr/local/bin/tar --create --file
/dev/null --directory / --one-file-system --listed-incremental
/usr/local/var/amanda/gnutar-lists/db.shcorp.com__0.new --sparse
--ignore-failed-read --totals --exclude-from
/tmp/amanda/sendsize._.20030724003005.exclude .
sendsize[15492]: time 0.028: error [cannot find user backup in
passwd file]
sendsize[15492]: time 0.059:
sendsize[15492]: time 0.060: .
sendsize[15492]: estimate time for / level 0: 0.044
sendsize[15492]: no size line match in /usr/local/bin/tar output for
/
sendsize[15492]: .
sendsize[15492]: estimate size for / level 0: -1 KB
sendsize[15492]: time 0.061: waiting for /usr/local/bin/tar /
 child sendsize[15492]: time 0.061: after /usr/local/bin/tar /
 wait sendsize[15492]: time 0.061: done with amname '/', dirname
 '/', spindle -1
sendsize[15490]: time 0.061: child 15492 terminated normally


Again I see cannot find user backup in passwd file. In looking at
/etc, I noticed the passwd file is actually a symlink to a passwd
file in another directory (For anyone who has SCO, why in GOD'S NAME
do they do this symlink BS for everything? It is horrible!). Is this
the only problem that's stopping me here, or is there anything else
that needs fixing?

-- 
Cheers, Gene
AMD [EMAIL PROTECTED] 320M
[EMAIL PROTECTED]  512M
99.27% setiathome rank, not too shabby for a WV hillbilly
Yahoo.com attornies please note, additions to this message
by Gene Heskett are:
Copyright 2003 by Maurice Eugene Heskett, all rights reserved.



Re: strange disk offline error

2003-07-23 Thread Kurt Yoder
More information about this problem:

Looking more closely in /tmp/amanda/sendsize...debug, I see
something strange:

sendsize[7578]: time 0.020: calculating for amname '/', dirname '/',
spindle -1
sendsize[7578]: time 0.020: getting size via dump for / level 0
sendsize[7578]: time 0.020: calculating for device '/' with ''
sendsize[7578]: time 0.022: running /bin/dump 0Esf 1048576 - /
sendsize[7578]: time 0.023: running /usr/local/libexec/killpgrp
sendsize[7578]: time 0.137: Usage: dump [-?CDVcgl -L[v] -a[v] -f[v]
-h[dnv] -o[v] -r[dnv] -s[dnv] -t[nv] -T index1[, index2]] file(s)
...
sendsize[7578]: time 0.138: .
sendsize[7578]: estimate time for / level 0: 0.116
sendsize[7578]: no size line match in /bin/dump output for /
sendsize[7578]: .
sendsize[7578]: estimate size for / level 0: -1 KB
sendsize[7578]: time 0.138: asking killpgrp to terminate
sendsize[7578]: time 1.149: done with amname '/', dirname '/',
spindle -1

Notice line 6 Usage:  message. Perhaps dump is not acting the way
amanda expects? Is this my problem? I had trouble getting gnu tar to
compile, but maybe I should try again so I can use it instead of
dump.

-- 
Kurt Yoder
Sport  Health network administrator



Re: strange disk offline error

2003-07-23 Thread Marc Mengel
Kurt Yoder wrote:

Weird errors here. The root user is definitely in the passwd file.
Could this be part of the problem?
Thanks for any ideas on fixing this...
Maybe a silly question, but, can your amanda backup user read the passwd file
on that system?
Marc




Re: strange disk offline error

2003-07-23 Thread Joshua Baker-LePain
On Wed, 23 Jul 2003 at 10:57am, Kurt Yoder wrote

 sendsize[7578]: time 0.020: calculating for amname '/', dirname '/',
 spindle -1
 sendsize[7578]: time 0.020: getting size via dump for / level 0
 sendsize[7578]: time 0.020: calculating for device '/' with ''
 sendsize[7578]: time 0.022: running /bin/dump 0Esf 1048576 - /
 sendsize[7578]: time 0.023: running /usr/local/libexec/killpgrp
 sendsize[7578]: time 0.137: Usage: dump [-?CDVcgl -L[v] -a[v] -f[v]
 -h[dnv] -o[v] -r[dnv] -s[dnv] -t[nv] -T index1[, index2]] file(s)
 ..
*snip*

 Notice line 6 Usage:  message. Perhaps dump is not acting the way
 amanda expects? Is this my problem? I had trouble getting gnu tar to
 compile, but maybe I should try again so I can use it instead of
 dump.

Yup, that's definitely the problem.  Is amanda picking the right dump for 
your filesystem?  You can tell by the output of ./configure (it might also 
be in the amandad*debug files -- I'm not sure).

tar is nice for being so cross platform.  If you can get that to go, it 
might be your better option.

-- 
Joshua Baker-LePain
Department of Biomedical Engineering
Duke University




strange disk offline error

2003-07-22 Thread Kurt Yoder
(That previous message should have had this subject; sorry, mail
client troubles)

Hello list

I just compiled amanda on a SCO Unix machine (uname -a shows SCO_SV
shcorp 3.2 5.0.6 i386) and tried to follow instructions to install
it, instructing amanda to back up both of its disks. Everything
appears successful, and the machine passes amcheck tests. When I run
amdump at night, my other linux, freebsd, and windows machines dump
successfully. However on my SCO machine, I get the message:

shcorp.shc /stand lev  FAILED [disk /stand offline on
shcorp.shcorp.com?]
shcorp.shc / lev  FAILED [disk / offline on shcorp.shcorp.com?]

I've looked in google, and found the following suggestions:

(from faq-o-matic,
http://amanda.sourceforge.net/fom-serve/cache/10.html)

is disk really offline?
Answer appears to be no. After all, I'm using this machine
throughout the day. So I'd assume it should be available for backup,
since no-one touches the machine at night.

filesystem error?
Well, I suppose there *could* be. But the fact that it happens on
both disks seems to indicate that this is not the problem. (I also
installed the same compiled version on a separate sco machine, and
it does the exact same thing).

filesystem too large?
Does not seem to be. /stand is only 15 megabytes, but still fails.

conflicting user name?
Doesn't seem to be it. I configured with user backup. This only
shows up once in the passwd file, and this box does not have any
external sources for authentication (no nis, ldap, etc)

don't have dump installed?
This isn't it. I compiled by hand, and the config.log shows that
amanda found the dump program. I suppose it could conceivably be
something that amanda doesn't like about SCO's dump program though.
How can I check if this might be the problem?

(from an archived post:
http://groups.yahoo.com/group/amanda-users/message/40200)
permissions on /etc/fstab ok?
On SCO, the file seems to be /etc/mnttab. It is unix mode 644, so
this shouldn't be a problem.



So, I looked at the logs in /tmp/amanda. For the last failed dump, I
see these logs:

-rw---   1 root backup   231 Jul 22 00:30
killpgrp.20030722043007.debug
-rw---   1 root backup   231 Jul 22 00:30
killpgrp.20030722043009.debug
-rw---   1 root sys 2108 Jul 22 00:30
sendsize.20030722003007.debug
-rw---   1 root sys 2275 Jul 22 00:30
amandad.20030722003005.debug

(strange that these are owned by root instead of backup; is this a
problem?)

sendsize ends with

sendsize[1383]: time 2.300: child 1388 terminated normally
sendsize: time 2.300: pid 1383 finish time Tue Jul 22 00:30:10 2003

Looks ok to me

amandad ends with

amandad: time 4.281: got packet:

Amanda 2.4 ACK HANDLE 00E-00A00608 SEQ 1058848217


amandad: time 4.281: pid 1382 finish time Tue Jul 22 00:30:10 2003

seems fine, or?

first killpgrp

killpgrp: debug 1 pid 1386 ruid 19 euid 0: start at Tue Jul 22
04:30:07 2003
/usr/local/libexec/killpgrp: version 2.4.4
killpgrp: error [cannot find user root in passwd file]
killpgrp: pid 1386 finish time Tue Jul 22 04:30:07 2003


second killpgrp

killpgrp: debug 1 pid 1389 ruid 19 euid 0: start at Tue Jul 22
04:30:09 2003
/usr/local/libexec/killpgrp: version 2.4.4
killpgrp: error [cannot find user root in passwd file]
killpgrp: pid 1389 finish time Tue Jul 22 04:30:09 2003


Weird errors here. The root user is definitely in the passwd file.
Could this be part of the problem?

Thanks for any ideas on fixing this...


-- 
Kurt Yoder
Sport  Health network administrator