I applied the 3-line patch to the 0831 snapshot and ran a full backup on
both machines, with 4 file systems each. All 8 completed successfully with
no "strange" messages.

Next, I commented out the 3 new lines and tried the backup again on one of
the machines. This time all 4 file systems failed; e.g.:

1251825776.552104: sendbackup: pid 24374 ruid 150 euid 150 version
2.6.2alpha: start at Tue Sep  1 11:22:56 2009
1251825776.552688: sendbackup: Version 2.6.2alpha
1251825776.575326: sendbackup: pid 24374 ruid 150 euid 150 version
2.6.2alpha: rename at Tue Sep  1 11:22:56 2009
1251825776.577315: sendbackup:   Parsed request as: program `DUMP'
1251825776.577352: sendbackup:                      disk `/'
1251825776.577377: sendbackup:                      device `/'
1251825776.577400: sendbackup:                      level 0
1251825776.577424: sendbackup:                      since NODATE
1251825776.577446: sendbackup:                      options `'
1251825776.578210: sendbackup: start: zirconium.example.com:/ lev 0
1251825776.579426: sendbackup: dumping device '/dev/rsd1a' with 'ffs'
1251825776.582918: sendbackup: Spawning "/sbin/dump dump 0usf 1048576 -
/dev/rsd1a" in pipeline
1251825776.587089: sendbackup: Started backup
1251825776.619734: sendbackup:  90:  normal(|):   DUMP: Date of this level 0
dump: Tue Sep  1 11:22:56 2009
1251825776.632529: sendbackup: Started index creator: "/sbin/restore -tvf -
2>&1 | sed -e '
s/^leaf[        ]*[0-9]*[       ]*\.//
t
/^dir[  ]/ {
s/^dir[         ]*[0-9]*[       ]*\.//
s%$%/%
t
}
d
'"
1251825776.721646: sendbackup:  90:  normal(|):   DUMP: Date of last level 0
dump: the epoch
1251825776.722686: sendbackup:  90:  normal(|):   DUMP: Dumping /dev/rsd1a
(/) to standard output
1251825776.779507: sendbackup:  90:  normal(|):   DUMP: mapping (Pass I)
[regular files]
1251825778.010107: sendbackup:  90:  normal(|):   DUMP: mapping (Pass II)
[directories]
1251825778.012085: sendbackup:  90:  normal(|):   DUMP: estimated 47600 tape
blocks.
1251825778.013367: sendbackup:  90:  normal(|):   DUMP: Volume 1 started at:
Tue Sep  1 11:22:58 2009
1251825778.020808: sendbackup: critical (fatal): index tee cannot write
[Resource temporarily unavailable]
1251825778.020918: sendbackup:  90:  normal(|):   DUMP: dumping (Pass III)
[directories]
1251825778.022805: sendbackup: 115: strange(?): sendbackup: index tee cannot
write [Resource temporarily unavailable]
1251825778.042344: sendbackup:  90:  normal(|):   DUMP: Broken pipe
1251825778.047559: sendbackup:  90:  normal(|):   DUMP: The ENTIRE dump is
aborted.
1251825778.048234: sendbackup: critical (fatal): error [dump (21317)
/sbin/dump returned 3]

So it seems reliable that those 3 lines fix the problem somehow.
Anything else you want to try before I ask for help on the OpenBSD list?

Thanks,
Michael

On Tue, Sep 1, 2009 at 9:32 AM, Michael Burk <bur...@gmail.com> wrote:

> I checked the errata for OpenBSD 4.5, but saw nothing that looked related.
> I applied the patch to the 0831 snapshot and am building it now. After we
> find the minimal patch, as Jean-Louis said, I'll post on the OpenBSD-misc
> list to see if anyone has an explanation.
>
> Thanks guys for working on this - I know it's a frustrating one.
>
> -- Michael
>
>
> On Tue, Sep 1, 2009 at 8:33 AM, Dustin J. Mitchell <dus...@zmanda.com>wrote:
>
>> On Tue, Sep 1, 2009 at 7:45 AM, Jean-Louis
>> Martineau<martin...@zmanda.com> wrote:
>> > We need to find a minimal patch that fix the problem.
>> > Cat you try the attached patch?
>>
>> This is starting to look like a kernel bug -- is there an associated
>> OpenBSD bug or something that we could reference in comments in the
>> code to explain this strange formulation?  Also, we'll probably need
>> to do this everywhere we fork and plumb a new process -- in
>> pipespawn.c at least.
>>
>> Dustin
>>
>> --
>> Open Source Storage Engineer
>> http://www.zmanda.com
>>
>
>

Reply via email to