On Saturday, Nov 25th 2006 at 10:29 -0500, quoth Steven W. Orr:
=>On Friday, Nov 24th 2006 at 22:33 -0500, quoth Steven W. Orr:
=>
=>=>On Wednesday, Nov 22nd 2006 at 10:51 -0500, quoth Charlie Brady:
=>=>
=>=>=>
=>=>=>On Wed, 22 Nov 2006, Steven W. Orr wrote:
=>=>=>
=>=>=>> =>Whatever is feeding the standard input of that process has not
terminated.
=>=>=>> =>What does "ps fax" tell you?
=>=>=>> =>
=>=>=>> =>> Do we need to modify flexbackup to set SIG_IGN for SIGCHLD?
=>=>=>> =>
=>=>=>> =>I don't know why you are suggesting that.
=>=>=>>
=>=>=>> Right. It's not a zombie like I said above, but since it's not, you're
=>=>=>> correct that the issue of SIG_IGN for SIGCHLD would be a red herring.
From
=>=>=>> the ps output above, it's in a sleep state. Your question about who the
=>=>=>> parent is is good. I don't remember because I just killed the process
=>=>=>> after I sent this message but I believe (from previous incidents) it is
=>=>=>> the child of flexbackup. So the tree should be
=>=>=>>
=>=>=>> cron
=>=>=>> \_bash
=>=>=>> \_flexbackup
=>=>=>> \_gzip
=>=>=>
=>=>=>No, the tree should never be just that. Something should be feeding gzip,
=>=>=>and gzip should be feeding something. Both "somethings" should be
children
=>=>=>of flexbackup. The exact identity of the "somethings" will depend on your
=>=>=>configuration.
=>=>=>
=>=>=>> What I think is happening is that flexbackup is waiting for gzip to
=>=>=>> complete before it exits. But gzip doesn't exit because it's waiting
for
=>=>=>> more input, not knowing that more isn't coming.
=>=>=>
=>=>=>Yes, and you need to determine why no more input is coming, and yet the
=>=>=>program providing such input to gzip has not exited.
=>=>=>
=>=>=>> Sometimes I can go a month without a hangup, and sometimes it hangs
=>=>=>> multiple times per week. Do we need to wait for a reoccurance or is
this
=>=>=>> enough to be able to work with?
=>=>=>
=>=>=>It's not enough because you haven't given us the full information. Since
=>=>=>you've killed the gzip process, we can't determine what was feeding it
=>=>=>input and why it was blocked. If you can show the actual process tree
=>=>=>rather than what you think "should" be there, then we can provide more
=>=>=>debugging instructions.
=>=>=>
=>=>=>Perhaps if you describe your configuration someone can speculate about
=>=>=>what process was blocked and why.
=>=>
=>=>Ok. I got a new one today and I'm leaving it around so we can figure this
=>=>thing out.
=>=>
=>=>Here's the cron tree:
=>=>
=>=> 3480 ? Ss 0:01 crond
=>=>16571 ? S 0:00 \_ crond
=>=>16572 ? Ss 0:00 \_ /usr/bin/perl -w /usr/bin/flexbackup
-set backup -incremental
=>=>16846 ? Z 0:00 | \_ [sh] <defunct>
=>=>16573 ? S 0:00 \_ /usr/sbin/sendmail -FCronDaemon -i -odi
-oem -oi -t
=>=>
=>=>And here's the gzip:
=>=>
=>=>16860 ? S 0:01 gzip -9
=>=>
=>=>and ps -ef shows
=>=>
=>=>root 16860 1 0 03:31 ? 00:00:01 gzip -9
=>=>root 16571 3480 0 03:31 ? 00:00:00 crond
=>=>root 16572 16571 0 03:31 ? 00:00:00 /usr/bin/perl -w
/usr/bin/flexbackup -set backup -incremental
=>=>smmsp 16573 16571 0 03:31 ? 00:00:00 /usr/sbin/sendmail
-FCronDaemon -i -odi -oem -oi -t
=>=>root 16846 16572 0 03:31 ? 00:00:00 [sh] <defunct>
=>=>
=>=>which shows that gzip is now the child of init which means that his parent
=>=>exited and orphaned him. And 16846 seems to not be getting cleanup up by
=>=>flexbackup.
=>=>
=>=>Anyone have an idea of what this all means?
=>
=>Next day and we got lucky. It happened again
=>
=>
=>root 3480 0.0 0.0 2668 468 ? Ss Aug27 0:01 crond
=>root 16571 0.0 0.0 3292 988 ? S Nov24 0:00 \_ crond
=>root 16572 0.0 0.5 8320 5960 ? Ss Nov24 0:00 | \_
/usr/bin/perl -w /usr/bin/flexbackup -set backup -incremental
=>root 16846 0.0 0.0 0 0 ? Z Nov24 0:00 | | \_
[sh] <defunct>
=>smmsp 16573 0.0 0.2 7344 2744 ? S Nov24 0:00 | \_
/usr/sbin/sendmail -FCronDaemon -i -odi -oem -oi -t
=>root 21193 0.0 0.0 3292 988 ? S 03:31 0:00 \_ crond
=>root 21194 0.0 0.5 8320 5952 ? Ss 03:31 0:00 \_
/usr/bin/perl -w /usr/bin/flexbackup -set backup -differential
=>root 21377 0.0 0.0 0 0 ? Z 03:31 0:00 | \_
[sh] <defunct>
=>smmsp 21195 0.0 0.2 7344 2728 ? S 03:31 0:00 \_
/usr/sbin/sendmail -FCronDaemon -i -odi -oem -oi -t
=>
=>and we now have two gzips owned by init.
=>
=>526 > ps -ef | grep gzip
=>root 16860 1 0 Nov24 ? 00:00:01 gzip -9
=>root 21419 1 0 03:31 ? 00:00:03 gzip -9
=>steveo 5813 9890 0 10:28 pts/3 00:00:00 grep gzip
=>527 >
Ok. I promise I won't post any more examples. I just had one more from
last night.
3480 ? Ss 0:01 crond
16571 ? S 0:00 \_ crond
16572 ? Ss 0:00 | \_ /usr/bin/perl -w /usr/bin/flexbackup -set
backup -incremental
16846 ? Z 0:00 | | \_ [sh] <defunct>
16573 ? S 0:00 | \_ /usr/sbin/sendmail -FCronDaemon -i -odi -oem
-oi -t
21193 ? S 0:00 \_ crond
21194 ? Ss 0:00 | \_ /usr/bin/perl -w /usr/bin/flexbackup -set
backup -differential
21377 ? Z 0:00 | | \_ [sh] <defunct>
21195 ? S 0:00 | \_ /usr/sbin/sendmail -FCronDaemon -i -odi -oem
-oi -t
17332 ? S 0:00 \_ crond
17333 ? Ss 0:00 \_ /usr/bin/perl -w /usr/bin/flexbackup -set
backup -incremental
17705 ? Z 0:00 | \_ [sh] <defunct>
17334 ? S 0:00 \_ /usr/sbin/sendmail -FCronDaemon -i -odi -oem
-oi -t
524 > ps -ef | grep gzip
root 16860 1 0 Nov24 ? 00:00:01 gzip -9
root 21419 1 0 Nov25 ? 00:00:03 gzip -9
root 17719 1 0 03:32 ? 00:00:01 gzip -9
steveo 12378 3292 0 10:17 pts/5 00:00:00 grep gzip
525 >
Also, I included my flexbackup.conf, if that helps. Thanks.
$type = 'afio';
$set{'backup'} = "/e/web /usr/share/emacs/site-lisp /usr/local /etc /boot
/root /var/spool/mail /var/log";
$prune{'/e/web'} = "steveo/mpg";
$compress = 'gzip'; # one of false/gzip/bzip2/lzop/zip/compress/hardware
$compr_level = '9'; # compression level (1-9) (for gzip/bzip2/lzop/zip)
$buffer = 'buffer'; # one of false/buffer/mbuffer
$buffer_megs = '10'; # buffer memory size (in megabytes)
$buffer_fill_pct = '75'; # start writing when buffer this percent full
$buffer_pause_usec = '100'; # pause after write (tape devices only)
$device = '/d2/backup';
$blksize = '10';
$mt_blksize = "0";
$pad_blocks = 'true';
$remoteshell = 'ssh'; # command for remote shell (rsh/ssh/ssh2)
$remoteuser = ''; # if non-null, secondary username for remote shells
$label = 'true'; # somehow store identifying label in archive?
$verbose = 'true'; # echo each file?
$sparse = 'true'; # handle sparse files?
$indexes = 'true'; # false to turn off all table-of-contents support
$staticfiles = 'false';
$atime_preserve = 'false';
$traverse_fs = 'false';
$exclude_expr[0] = '.*/[Cc]ache/.*';
$exclude_expr[1] = '.*~$';
$erase_tape_set_level_zero = 'true';
$erase_rewind_only = 'false';
$logdir = '/var/log/flexbackup'; # directory for log files
$comp_log = 'bzip2'; # compress log?
false/gzip/bzip2/lzop/compress/zip
$staticlogs = 'false'; # static log filenames w/ no date stamp
$prefix = ''; # log files will start with this prefix
$tmpdir = '/tmp'; # used for temporary refdate files, etc
$stampdir = '/var/lib/flexbackup'; # directory for backup timestamps
$index = '/var/lib/flexbackup/index'; # DB filename for tape indexes
$keyfile = '00-index-key'; # filename for keyfile if archiving to dir
$sprefix = ''; # stamp files will start with this prefix
$afio_nocompress_types = 'mp3 MP3 Z z gz gif GIF zip ZIP lha jpeg jpg JPG
taz tgz deb rpm bz2 lzo png';
$afio_echo_block = 'false';
$afio_compress_threshold = '3';
$afio_compress_cache_size = '2';
$tar_echo_record_num = 'false';
$cpio_format = 'newc';
$dump_length = '0';
$dump_use_dumpdates = 'false';
$star_fifo = 'true';
$star_acl = 'true';
$star_format = 'exustar';
$star_echo_block_num = 'false';
$pax_format = 'ustar';
$zip_nocompress_types = 'mp3 MP3 Z z gz gif zip ZIP lha jpeg jpg JPG taz tgz
deb rpm bz2 lzo';
$pkgdelta_archive_list = 'rootonly';
$pkgdelta_archive_unowned = 'true';
$pkgdelta_archive_changed = 'true';
1;
-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
flexbackup-help mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/flexbackup-help