Re: amrestore problem, headers ok but no data

2005-01-21 Thread Brian Cuttler
Stefan,

On Wed, Jan 19, 2005 at 10:48:54PM +0100, Stefan G. Weichinger wrote:
 BC Major imporvement though still below the acceptable threshold.
 
 BC Definitely looking like it was never an amanda issue, this looks to
 BC be an underlying architecture problem.
 
 I hope that you spot the issue soon, sounds much better already ;-)

Always the simple stuff. Block size (still need to find out how to
reset at boot time), SCSI bus length...

Confirmed with SGI engineer, with the Narrow device on the bus we
are limited to 1.5 meters, we'd have been ok with WIDE devices, 3 Meter
limit but we are clearly over.

Will see if I can't get permission to remove SDLT/shoebox and see if
the jukebox doesn't work properly. In the longer term I think we've
convinced the machine owner to purchase/install an additional SCSI card.

thank you,

Brian
---
   Brian R Cuttler [EMAIL PROTECTED]
   Computer Systems Support(v) 518 486-1697
   Wadsworth Center(f) 518 473-6384
   NYS Department of HealthHelp Desk 518 473-0773



Re: amrestore problem, headers ok but no data

2005-01-21 Thread Gene Heskett
On Friday 21 January 2005 11:37, Brian Cuttler wrote:
Stefan,

On Wed, Jan 19, 2005 at 10:48:54PM +0100, Stefan G. Weichinger wrote:
 BC Major imporvement though still below the acceptable threshold.

 BC Definitely looking like it was never an amanda issue, this
 looks to BC be an underlying architecture problem.

 I hope that you spot the issue soon, sounds much better already
 ;-)

Always the simple stuff. Block size (still need to find out how to
reset at boot time), SCSI bus length...

Put the appropriate commands, as you would exec them from a root 
shell, in the /etc/rc.d/rc.local file.  They'll work as long as there 
is a loaded tape in the drive during the bootup.

Confirmed with SGI engineer, with the Narrow device on the bus we
are limited to 1.5 meters, we'd have been ok with WIDE devices, 3
 Meter limit but we are clearly over.

What???  Is???  He??? Smoking???  I want some of that!  Its got to be 
great stuff...

The scsi bus, properly setup and terminated, can be as long as 30 some 
meters, and I have personally ran it well over 10 feet!

But with poorly matched stuff, you are often in the virgin sacrificing 
business at 18 inches or less.  This 'engineer' (note the lowercase, 
its intended) obviously does not understand that the scsi bus is a 
transmission line, and subject to the usual rules regarding control 
of vswr.  VSWR=Voltage Standing Wave Ratio, and is a measure of the 
raltive magnitudes of the signal power going down the cable, and 
coming back up the cable from the far end.  Ideally, with proper 
terms in place, there is no echo from either end as the terms 
perfectly absorb the signal by presenting a load to the cable that 
matches the cables impedance very closely, making the cable look to 
be of almost infinite length.

The bus itself is wired OR, open collector style, meaning that any 
device, anyplace on the bus, can exert a pulldown to a logic zero, 
and thos signal will propagate both ways on the cable from a device 
in the middle, with no echo's comeing back from either end of the 
cable under ideal conditions.

In order to control echos or vswr as the rf folks call it, it must be 
terminated with its characteristic impedance at both ends of the 
cable.  Note that this means the extra lines for a wide bus MUST be 
terminated at the physical end of that wider portion of the bus, 
preferably with whats known as an active termination as that 
typically matches the cables impedance much better than the usual 
powered resistor packages do.  Then the rest of the cable, the narrow 
part, needs to be terminated at its physical ends too, with no spare 
cable going on by that point because of the echos generated by the 
open end of the cable if the terms are not actually at the end of the 
path.

The card normally terminates the src end of the cable, usually with 
active terms today, but its toss a coin to see what you get from the 
aftermarket folks unless you specify, and accept and pay for, no 
less.

Let me get a view of what your logic one noise margin is Brian, grab a 
multimeter and poke into an unused connector of the cable and read 
the resting voltages of 3 or 4 of the lines and tell me what your 
meter reads please.  All lines on one side of the connector should be 
grounded, so I'm interested in the 2.x volts you'll read on the other 
lines.

Will see if I can't get permission to remove SDLT/shoebox and see if
the jukebox doesn't work properly. In the longer term I think we've
convinced the machine owner to purchase/install an additional SCSI
 card.

They aren't THAT expensive, but non-technical bean counters often 
cannot see the value obtained from the expense spent because they 
don't understand the value received is often a do, or  don't, and 
there is no middle ground scenario.  You often have to explain it in 
terms of if you want it to work, we spend x dollars to make it work.
Otherwise its no workee.  Then its simple.  Give them a middle ground 
where it might work, and they'll assume you walk on water and can 
make it work everytime.  We're quite often not that lucky in this 
business.

Worth every penny in most cases.

Good luck.

-- 
Cheers Brian, Gene
There are four boxes to be used in defense of liberty:
 soap, ballot, jury, and ammo. Please use in that order.
-Ed Howdershelt (Author)
99.32% setiathome rank, not too shabby for a WV hillbilly
Yahoo.com attorneys please note, additions to this message
by Gene Heskett are:
Copyright 2005 by Maurice Eugene Heskett, all rights reserved.


Re: amrestore problem, headers ok but no data

2005-01-19 Thread Brian Cuttler
Info update only.

Ok we've shutdown the Origin 300 and swapped the scsi daisy-chain
around a little.

At this point the SDLT in the shoebox was able to restore a large
tar file without causing a SCSI reset and hanging the system.

I then set the blocksize on the SDLT in the jukebox and relabled
the amanda tapes. I ran an amdump with only 3 DLE, one using dump
and 2 using (gnu)tar. As always, the amdump completed with (apparent)
success.

This time however I was actually able to restore the dump file (a 
relatively small level 1), and partitially restored both tar dumps
before they each hung the bus and caused a reset.

My hope is that the (NARROW) robot is still between the SDLT in the
jukebox and that swapping the scsi cable and the terminator on the
jukebox will resolve the issue. I'm waiting on the vendor of the 
jukebox on this info - if I don't hear quickly I'll just open the
cover on the jukebox and have a look inside.

Major imporvement though still below the acceptable threshold.

Definitely looking like it was never an amanda issue, this looks to
be an underlying architecture problem.

---
   Brian R Cuttler [EMAIL PROTECTED]
   Computer Systems Support(v) 518 486-1697
   Wadsworth Center(f) 518 473-6384
   NYS Department of HealthHelp Desk 518 473-0773



Re: amrestore problem, headers ok but no data

2005-01-19 Thread Stefan G. Weichinger
Hi, Brian,

on Mittwoch, 19. Jänner 2005 at 22:02 you wrote to amanda-users:

BC Major imporvement though still below the acceptable threshold.

BC Definitely looking like it was never an amanda issue, this looks to
BC be an underlying architecture problem.

I hope that you spot the issue soon, sounds much better already ;-)
-- 
best regards,
Stefan

Stefan G. Weichinger
mailto:[EMAIL PROTECTED]






Re: amrestore problem, headers ok but no data

2005-01-12 Thread Eric Siegerman
On Tuesday 11 January 2005 16:40, Jon LaBadie wrote:
Also, I think that if both types of devices exist on the same bus,
the lower performance one determines the performance of the entire
bus.

In theory, this is *not* the case.  One of the (many) selling
points of SCSI over IDE is supposed to be that a SCSI bus can run
each device at its own speed (though perhaps later versions of
the IDE spec have caught up, as they have in some other respects;
I dunno).  Of course, the slower/narrower device will consume
more of the SCSI bus's available bandwidth to carry the same
amount of data, even if they don't directly affect performance of
the faster/wider devices.

In practice, according to the excellent SCSI FAQ, it depends on
the devices in question.  See these questions in particular:
  - Can I connect a SCSI-3 disk to my SCSI-1 host adapter?
[...]  (which isn't Brian's precise situation, but the
answer might well apply)
  - How can I calculate the performance I'll get with mixed SCSI
devices?

The SCSI FAQ is dated, but still useful.  It's at
www.scsifaq.org; click on the link for Official
comp.periphs.scsi FAQ.  Sorry, but the site uses
too-smart-for-its-own-good navigation that makes it hard to post
the actual URL.


On Tue, Jan 11, 2005 at 10:48:17PM -0500, Gene Heskett wrote:
 [A SCSI bus is]
 double handicapped because the cable is, compared to a piece of well 
 built coax, pretty much a guestimate as to its operating impedance, 
 usually quoted as being in the 120 to 130 ohm territory,

This isn't supposed to be a problem either, because cable
impedence isn't supposed to be a guesstimate; it's explicitly
specified in the SCSI specs.  But in practice, what you've said
is true; there's all manner of out-of-spec junk sold as SCSI
cables.

For example, I've read that you can have problems if you put a
SCSI adapter in the middle of an internal/external chain, even if
all the termination is correct, because the internal ribbon cable
and the external cable might have different impedences, leading
to signal reflections between the two cables.

For a telling, if rather ancient, anecdote told by someone from
Adaptec, see question What is the problem with the Adaptec 1542C
and external cables? in the SCSI FAQ.


off-topic
 To many folks forget that a a scsi bus is indeed 
 an rf transmission line, subject to the usual rules about vswr.

Geez, you mean we've got an actual engineer in the e-room?
Awesome!  (Despite my rambling about impedences and such, I'm
sure no hardware guy.)

From the context it's pretty clear what vswr means, but what
does it stand for?
/off-topic

--

|  | /\
|-_|/ Eric Siegerman, Toronto, Ont.[EMAIL PROTECTED]
|  |  /
In theory, there is no difference between theory and practice.
But, in practice, there is.
- Jan van de Snepscheut 


Re: amrestore problem, headers ok but no data

2005-01-12 Thread Jon LaBadie
On Wed, Jan 12, 2005 at 11:18:00AM -0500, Eric Siegerman wrote:
 On Tuesday 11 January 2005 16:40, Jon LaBadie wrote:
 Also, I think that if both types of devices exist on the same bus,
 the lower performance one determines the performance of the entire
 bus.
 
 In theory, this is *not* the case.  ...
 
 On Tue, Jan 11, 2005 at 10:48:17PM -0500, Gene Heskett wrote:

 As to the whole chain being restricted to the speed of the slowest
 device, I've heard that, but it doesn't, on the face of it, make a
 lot of sense to me


Gene, Eric,

Thanks for the correction.
I'm glad I remembered to say I wasn't a scsi expert.
My comments provide the proof.

 
 off-topic
  To many folks forget that a a scsi bus is indeed 
  an rf transmission line, subject to the usual rules about vswr.
 
 From the context it's pretty clear what vswr means, but what
 does it stand for?
 /off-topic
 
Wondered too, as I'm posting anyway, I'll guess: v??? standing wave reflections

jl
-- 
Jon H. LaBadie  [EMAIL PROTECTED]
 JG Computing
 4455 Province Line Road(609) 252-0159
 Princeton, NJ  08540-4322  (609) 683-7220 (fax)


Re: amrestore problem, headers ok but no data

2005-01-12 Thread Joe Lewandoski
Google says: voltage standing wave ratio

   To many folks forget that a a scsi bus is indeed 
   an rf transmission line, subject to the usual rules about vswr.
  
  From the context it's pretty clear what vswr means, but what
  does it stand for?
  /off-topic
  
 Wondered too, as I'm posting anyway, I'll guess: v??? standing wave 
 reflections
   
 jl
 -- 
 Jon H. LaBadie  [EMAIL PROTECTED]
  JG Computing
  4455 Province Line Road(609) 252-0159
  Princeton, NJ  08540-4322  (609) 683-7220 (fax)

-- 
_
Joe Lewandoski, N342, x85722
jlwndski at bellsouth dot net
lewandoskij at navo dot navy dot mil



Re: amrestore problem, headers ok but no data

2005-01-12 Thread Gene Heskett
On Wednesday 12 January 2005 14:55, Gene Heskett wrote:
On Wednesday 12 January 2005 12:04, Jon LaBadie wrote:
On Wed, Jan 12, 2005 at 11:18:00AM -0500, Eric Siegerman wrote:
 On Tuesday 11 January 2005 16:40, Jon LaBadie wrote:
 Also, I think that if both types of devices exist on the same
  bus, the lower performance one determines the performance of
  the entire bus.

 In theory, this is *not* the case.  ...

 On Tue, Jan 11, 2005 at 10:48:17PM -0500, Gene Heskett wrote:

 As to the whole chain being restricted to the speed of the
 slowest device, I've heard that, but it doesn't, on the face of
 it, make a lot of sense to me

Gene, Eric,

Thanks for the correction.
I'm glad I remembered to say I wasn't a scsi expert.
My comments provide the proof.

 off-topic

  To many folks forget that a a scsi bus is indeed
  an rf transmission line, subject to the usual rules about vswr.
 
 From the context it's pretty clear what vswr means, but what

 does it stand for?
 /off-topic

Wondered too, as I'm posting anyway, I'll guess: v??? standing wave
 reflections

VSWR=Voltage Standing Wave Reflections.  Back in rf days, because
 the signal is repetitive, very high voltages could develop at
 certain points in a transmission line driving an antenna that
 wasn't an ideal load, so they came to be known as standing waves. 
 And it doesn't take a lot of miss-match to burn a 6  1/8 75 ohm
 rigid copper coax with 30kw peak power going into it at channel 19
 (about 507mhz), making you call in a tower crew and replace the
 burnt section and clean the teflon carbon out of the other 1000
 feet of it.  Lets just say thats expen$ive...  But now lets get
 down to the much profaned scsi buss, where these same physical laws
 apply even if they don't make junk and smoke out of expensive
 copper and teflon parts.

Understand we are dealing with a bus capable of responding to a 10
nanosecond or less signal for starters, often lots less.

Some ascii art if you'll all bear with me  use a monospaced font.
The vertical scale:
3.0 Volts a solid logic 1
2.4 Volts unk upper bound, usually a 1 99% of the time
1.8 Volts unk logic state
1.2 Volts unk logic state
0.6 Volts unk lower bound, usually a 0 99% of the time
0.0 Volts a solid logic zero

Now assume a worst case condition of no termination just to make it
simple.

Input pulse from typical wired-or driver, active pulldown.

And screw it, the (insert 15 minute non-repetitive monologue of 
swearing) list server ate my tabs.
Sorry folks, I tried.
t0 t+5ns t+10ns t+15ns  t+20ns t+25ns t+30ns
1---|  |
? |  |
? |  |
? |  |
? |  |
0 |

Same pulse 18 down the cable after the pulse has reached the end of
the 24 cable and bounced because its not terminated.

   |-| - this could be clipped
   |
   | |   |-|
   | |   |
   | |   | | |-
   | |   | | |etc

|  |-| |   | | |etc

   |  | |   | | |etc
   |  | |   | | |etc
   |  | |   | |
   |  | |   | |-|etc
   |  | |
   |  | |-|
   |
   |---|

As you can see, after the main data pulse, there are several forays
into unknown territory as the rest of the circuit slowly absorbs the
unwanted echo.  Also, in the real world, there would be some wibbles
in the initial logic zero time I drew as a flat line, but most of
these are absorbed by the very solidly turned on driver transistor
 at the time.  These largly disappear if the terminations are
 correct, but only active terms really match the cable that well.

This loss of the upper boundary noise margin in particular is one of
the reasons I'd love to see a special, 5.8 volt supply made
 available just for supplying term power to scsi busses, the extra
 .8 volts to make up for the losses because the average card
 designer gets his choice of a low voltage drop schotkey diode for
 the isolator overridden by some friggin bean counter who doesn't
 understand that his arbitrary replacement of a 50 cent diode with a
 10 cent si diode, and its .7 volt loss, has just cost the product
 about 90% of its logic 1 noise margin by reducing the upoper
 resting voltage of the buss to the 2.6 volt area.  And we wind up
 advertising for virgins to sacrifice to make the darned thing work.
  Hell, even a Ge diode would be a better choice if they think they
 cannot afford that 50 cent schotkey.

Hopefully this poor ascii art might clarify the situation a bit.  If
I'm curious, I turn on my scope and look, but TBT, my 100 mhz dual
trace scope is actually a bit slow to really display stuff like
 this. Stuff like this really needs a 500mhz scope to display it
 anywhere near in real time, but the chips on a scsi buss are in
 fact that fast!

-- 
Cheers, Gene
There are four boxes to be used in defense of liberty:
 soap, ballot, jury, and ammo. Please use in that order.
-Ed Howdershelt (Author)
99.31% setiathome rank, not 

Re: amrestore problem, headers ok but no data

2005-01-11 Thread Stefan G. Weichinger
Hi, Jon,

on Montag, 10. Jänner 2005 at 22:19 you wrote to amanda-users:

JL In recent versions, amanda can work with blocksizes other than 32k.
JL I forget if it is a configure option needed during the build or
JL a parameter that can be set in amanda.conf.  I've never used it.

The parameter blocksize goes into the tapetype-section in
amanda.conf.

man amanda says about that parameter:

   blocksize int
  Default:  32 kbytes.  How much data will be written
  in each tape record.  The minimum  blocksize  value
  is  32  KBytes.   The maximum blocksize value is 32
  KBytes.  The maximum is set during  configure  with
  --with-maxtapeblocksize.

Maybe this is just documented wrong, as the minimum is the same as the
maximum, except when you set another maximum with the
configure-option.

But as I see from the other thread, Brian is on another path right
now ... maybe he'll come back to blocksizes later.

-- 
best regards,
Stefan

Stefan G. Weichinger
mailto:[EMAIL PROTECTED]






Re: amrestore problem, headers ok but no data

2005-01-11 Thread Brian Cuttler

Amanda user list,

I've spoken to Harry at Condre systems and he confirmed what we
where seeing. The jukebox robot is a narrow bus so that are two
changes we need to make.

1) We want to move the jukebox/SDLT to be the last device on
   the daisy chain.

2) We want to terminate the jukebox-robot, so we need to make
   certain that the live port on the box hits the SDLT before
   we reach (electically) the robot.

Hopefully this will have a positive effect on both SDLT drives.

---
   Brian R Cuttler [EMAIL PROTECTED]
   Computer Systems Support(v) 518 486-1697
   Wadsworth Center(f) 518 473-6384
   NYS Department of HealthHelp Desk 518 473-0773


Re: amrestore problem, headers ok but no data

2005-01-11 Thread Brian Cuttler

Jon,

Recommended buying an additional scsi card more than once to the
system's owners, haven't made the case yet (maybe now). I would
really have liked the raid array on a separate bus from the tape
drives.

I have to trust that the proper cabeling to terminate the additional
lines is present in the jukebox casing as the wiring is inaccessible
and the sdlt completely enclosed (there is a door that slides open to
admit to expel cartridges) and only well, 4 connection, 2 scsi (in
and out), power and a port for a serial console.

I agree with you completely but all I can hope for at the moment is
to get the narrowest bus as far away from the cpu as I can.

thanks,

Brian

On Tue, Jan 11, 2005 at 04:40:40PM -0500, Jon LaBadie wrote:
 Not a scsi expert here, but if you terminate the jukebox, wouldn't
 that terminate only the narrow portion of the bus leaving the extra
 lines of the wide bus unterminated.  Might there be some kind of
 device/cable to go from wide - narrow terminating the unused lines?
 
 Also, I think that if both types of devices exist on the same bus,
 the lower performance one determines the performance of the entire
 bus.  That narrow jukebox may hurt your sdlt performance just by
 being on the bus.  Some of my scsi adapters have multiple channels
 and buses.  Perhaps you could run the sdlt's on one channel, wide,
 and the jukebox on another, narrow or wide.  I added a cheap second
 controller from a dead system for a very similar purpose, external
 devices that were narrow when the main controller was driving only
 wide devices.  Its two channels were used for different speed devices.
 
 jl
 -- 
 Jon H. LaBadie  [EMAIL PROTECTED]
  JG Computing
  4455 Province Line Road(609) 252-0159
  Princeton, NJ  08540-4322  (609) 683-7220 (fax)
---
   Brian R Cuttler [EMAIL PROTECTED]
   Computer Systems Support(v) 518 486-1697
   Wadsworth Center(f) 518 473-6384
   NYS Department of HealthHelp Desk 518 473-0773



Re: amrestore problem, headers ok but no data

2005-01-11 Thread Gene Heskett
On Tuesday 11 January 2005 16:40, Jon LaBadie wrote:
On Tue, Jan 11, 2005 at 03:40:47PM -0500, Brian Cuttler wrote:
 Amanda user list,

 I've spoken to Harry at Condre systems and he confirmed what we
 where seeing. The jukebox robot is a narrow bus so that are two
 changes we need to make.

 1) We want to move the jukebox/SDLT to be the last device on
the daisy chain.

 2) We want to terminate the jukebox-robot, so we need to make
certain that the live port on the box hits the SDLT before
we reach (electically) the robot.

 Hopefully this will have a positive effect on both SDLT drives.

Not a scsi expert here, but if you terminate the jukebox, wouldn't
that terminate only the narrow portion of the bus leaving the extra
lines of the wide bus unterminated.  Might there be some kind of
device/cable to go from wide - narrow terminating the unused lines?

Also, I think that if both types of devices exist on the same bus,
the lower performance one determines the performance of the entire
bus.  That narrow jukebox may hurt your sdlt performance just by
being on the bus.  Some of my scsi adapters have multiple channels
and buses.  Perhaps you could run the sdlt's on one channel, wide,
and the jukebox on another, narrow or wide.  I added a cheap second
controller from a dead system for a very similar purpose, external
devices that were narrow when the main controller was driving only
wide devices.  Its two channels were used for different speed
 devices.

jl

Generally speaking Jon, thats very good advice. In any event, where 
there is a transition to narrow devices, the unused lines must be 
properly terminated. To many folks forget that a a scsi bus is indeed 
an rf transmission line, subject to the usual rules about vswr.  And 
double handicapped because the cable is, compared to a piece of well 
built coax, pretty much a guestimate as to its operating impedance, 
usually quoted as being in the 120 to 130 ohm territory, hence the 
commonly used resistive termination of a pair of resistors attached 
to the data line, a 220 ohm connected to the +5 volt line, and a 330 
ohm connected to ground.  It works quite well provided the resting 
voltage on the data lines can be held well above the 2.6 volt region, 
preferably at nearly 3.0 volts, giving an equal noise margin for both 
a logic zero and a logic one.  Having that isolation diode in series 
with this feed often drops that voltage into the very low noise 
margin region below 2.6 volts.  At that point you start sacrificing 
virgins etc to make it work.

As to the whole chain being restricted to the speed of the slowest 
device, I've heard that, but it doesn't, on the face of it, make a 
lot of sense to me given that the drivers are properly written to 
auto-detect whats narrow and slow, and whats wide and fast.  But 
thats a given I wouldn't swear about, at maybe...  Driver quality 
does vary unfortunately depending on the authors whims and expertise 
at the time.

If Brian has an old card he can stick in just to run the robot, it 
sure would be worth the try.

-- 
Cheers, Gene
There are four boxes to be used in defense of liberty:
 soap, ballot, jury, and ammo. Please use in that order.
-Ed Howdershelt (Author)
99.31% setiathome rank, not too shabby for a WV hillbilly
Yahoo.com attorneys please note, additions to this message
by Gene Heskett are:
Copyright 2005 by Maurice Eugene Heskett, all rights reserved.


Re: amrestore problem, headers ok but no data

2005-01-10 Thread Brian Cuttler

The following (just scripted, with commentary added) is a little long
and rather un-good.


samar 9# script
Script started, file is typescript

Position us at the start of the next volume. This volume has been
relabeled after setting the block size on the tape (last week).


samar 1# /usr/local/sbin/amcheck samar
Amanda Tape Server Host Check
-
ERROR: holding disk /usr5/dumps/amanda: statfs: No such file or directory
amcheck-server: slot 7: date Xlabel SAMAR08 (exact label match)
NOTE: skipping tape-writable test
Tape SAMAR08 label ok
Server check took 4.311 seconds

Amanda Backup Client Hosts Check

Client check: 1 host checked in 0.168 seconds, 0 problems found

(brought to you by Amanda 2.4.4p1-20030716)

Find out the current block size. I has expected 32768, it isn't.

samar 2# mt -f /dev/sdlt2 blksize

 Recommended tape I/O size: 131072 bytes (256 512-byte blocks)
 Minimum block size: 4 byte(s)
 Maximum block size: 16777212 bytes
 Current block size: Variable

Get the current label.

We failed unless we specified the blocksize of 32768.

samar 5# mt -f /dev/sdlt2 rewind

samar 2# dd if=/dev/sdlt2 of=./scratch
Read error: Invalid argument
0+0 records in
0+0 records out

samar 3# dd if=/dev/sdlt2 of=./scratch bs=32768
1+0 records in
1+0 records out

Many null characters removed from cat output.

samar 6# dd if=/dev/sdlt2 of=./scratch bs=32768
1+0 records in
1+0 records out
samar 7# cat -evt scratch | more
AMANDA: TAPESTART DATE X TAPE SAMAR08$
^L$
[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@^@

Set the blocksize to the drive recommended size.

samar 8# mt -f /dev/sdlt2 setblksz 131072


samar 9# dd if=./scratch of=/dev/sdlt2 obs 131072
Bad argument: obs
samar 10# dd if=./scratch of=/dev/sdlt2 obs=131072
Write error: Invalid argument
64+0 records in
0+0 records out
samar 11# mt -f /dev/sdlt2 blksize

 Recommended tape I/O size: 131072 bytes (256 512-byte blocks)
 Minimum block size: 4 byte(s)
 Maximum block size: 16777212 bytes
 Current block size: 131072 byte(s)
samar 12# dd if=/dev/sdlt2 of=./scratch bs=131072
0+0 records in
0+0 records out

samar 13# mt -f /dev/sdlt2 rewind

samar 14# dd if=/dev/sdlt2 of=./scratch bs=131072
0+0 records in
0+0 records out
samar 15# !cat
cat -evt scratch | more

samar 16# mt -f /dev/sdlt2 rewind

samar 17# dd if=/dev/sdlt2 of=./scratch bs=131072
0+0 records in
0+0 records out

samar 18# /usr/local/sbin/amcheck samar
Amanda Tape Server Host Check
-
ERROR: holding disk /usr5/dumps/amanda: statfs: No such file or directory
amcheck-server: slot 7: reading label: Invalid argument
amcheck-server: slot 8: reading label: Invalid argument

Progressed this way through the rest of the tapes.

- use amtape to reset us to slot 7.

Probing the drive I find a block size of 32768 again.

samar 25# mt -f /dev/sdlt2 setblksz 131072

samar 26# /usr/local/sbin/amlabel -f samar SAMAR08
labeling tape in slot 7 (/dev/sdlt2):
rewinding, reading label, reading label: Invalid argument
rewinding, writing label SAMAR08
amlabel: writing label: Invalid argument

Have we hit some sort of amanda buffer size limit ?
Something that didn't error during the write but did 
during the read ?

Are the new drivers in the OS failing to properly handle the drive ?

My money is on an OS problem.

Is anyone successfully using SDLT 320 drives on an SGI/IRIX system ?
Note, I have the required APD drivers installed and configured.

I'm gonna open a call with SGI.

Jan 10 10:58:54 3D:samar TapeCheckError[234379]: 
|$(409)tps1d4,SDLT320,,pid=411237,read,Aborted command,IDE message received
Jan 10 10:58:54 6D:samar tps1d4[234379]: Command aborted by target -- 
Jan 10 10:58:54 3D:samar tps1d4[234379]: Aborted command

---
   Brian R Cuttler [EMAIL PROTECTED]
   Computer Systems Support(v) 518 486-1697
   Wadsworth Center(f) 518 473-6384
   NYS Department of HealthHelp Desk 518 473-0773



Re: amrestore problem, headers ok but no data

2005-01-10 Thread Jon LaBadie
On Mon, Jan 10, 2005 at 11:48:42AM -0500, Brian Cuttler wrote:
 
 The following (just scripted, with commentary added) is a little long
 and rather un-good.
 
...
 
 Probing the drive I find a block size of 32768 again.
 
 samar 25# mt -f /dev/sdlt2 setblksz 131072
 
 samar 26# /usr/local/sbin/amlabel -f samar SAMAR08
 labeling tape in slot 7 (/dev/sdlt2):
 rewinding, reading label, reading label: Invalid argument
 rewinding, writing label SAMAR08
 amlabel: writing label: Invalid argument
 
 Have we hit some sort of amanda buffer size limit ?
 Something that didn't error during the write but did 
 during the read ?
 

This is not too surprising to me.

A tape with a blocksize of 132k will accept a write of 32k and
simply pad it to the required size.  No errors.

But on read back, it will not allow a read of 32k when the
blocksize of the taped data  is 132k.  That could be your read error.

In recent versions, amanda can work with blocksizes other than 32k.
I forget if it is a configure option needed during the build or
a parameter that can be set in amanda.conf.  I've never used it.

-- 
Jon H. LaBadie  [EMAIL PROTECTED]
 JG Computing
 4455 Province Line Road(609) 252-0159
 Princeton, NJ  08540-4322  (609) 683-7220 (fax)


Re: amrestore problem, headers ok but no data

2005-01-10 Thread Brian Cuttler
Jon,

On Mon, Jan 10, 2005 at 04:19:45PM -0500, Jon LaBadie wrote:
  Something that didn't error during the write but did 
  during the read ?
  
 
 This is not too surprising to me.
 
 A tape with a blocksize of 132k will accept a write of 32k and
 simply pad it to the required size.  No errors.
 
 But on read back, it will not allow a read of 32k when the
 blocksize of the taped data  is 132k.  That could be your read error.
 
 In recent versions, amanda can work with blocksizes other than 32k.
 I forget if it is a configure option needed during the build or
 a parameter that can be set in amanda.conf.  I've never used it.

It has been an eventful afternoon.

I'll try to get a newer version installed tomorrow.

I just learned that I have have a device ordering problem on the
scsi chain.

Looks like the robot may be interfering with my second sdlt in
the shoebox. Though I don't expect it to be causing a problem
with the sdlt within the robot case.

samar 39# scsiha -t -p 1
BUS PARMS: Selection Timeout: 25 uS, Host SCSI ID: 0, SE
TARGET PARMS: [1] async. transfers, NARROW
: [2] sync. period: 50 nS, sync. offset: 14, WIDE
: [4] sync. period: 50 nS, sync. offset: 14, WIDE
: [5] sync. period: 50 nS, sync. offset: 14, WIDE


---
   Brian R Cuttler [EMAIL PROTECTED]
   Computer Systems Support(v) 518 486-1697
   Wadsworth Center(f) 518 473-6384
   NYS Department of HealthHelp Desk 518 473-0773



Re: amrestore problem, headers ok but no data

2005-01-08 Thread Eric Siegerman
On Fri, Jan 07, 2005 at 03:11:41PM -0500, Jon LaBadie wrote:
 That ENOMEM, reported as insufficient memory sometimes used to
 throw me for a loop.  Here is the situation as I understand it.
 
 To enhance performance dd tries to do unbuffered I/O, meaning the
 data goes directly to memory in dd rather than through buffers
 in the OS and then to dd.  An upshot of this is that the buffer
 dd reads into must be at least as big as a block on the device.
 As there is no OS buffering, dd must get the entire block from
 the device in one shot, no taking little chunks at a time.  The
 devices do not send bytes on request, they send blocks.

This is correct, except for one minor quibble -- the claim that
dd *tries* to do unbuffered I/O.  As I understand it, dd has no
choice in the matter.  Character devices (usually?) behave as
you've described:  there's no buffering within the kernel; the
data flows more-or-less directly from the hardware into the
process's user-space buffer, and vice versa during writes.
Further, each read() or write() call by the process is typically
translated into a single hardware I/O request.  This imposes a
restriction on the user process: the call must read and write
only whole hardware blocks.  Specifically:
  (a) each read() or write() call must start on a block boundary
  (b) it must also end on a block boundary

For character, aka raw, disk devices, the read() and write()
cases are identical:
  (a) at the time of any read() or write() call, the seek offset
  must be a multiple of the disk's sector size
  (b) the requested length must (typically?) be a multiple of the
  same

When writing to a traditional, variable-length tape device, each
write() call writes exactly one physical block to the tape:
  (a) the new block starts wherever the head was positioned at
  the time, or somewhere after that if the drive had to skip
  past a bad spot on the tape
  (b) the length of the new block is precisely the length passed
  to write()

When reading from such a tape device, each read() call reads
exactly one physical block from the tape:
  (a) at the start of the call, the heads are guaranteed *not* to
  be positioned in the middle of a block; they might be
  precisely at the start of the next block to be read, or
  there might first be a bad spot in the tape to be skipped
  over, which doesn't contain any data.

  The hardware enforces this constraint, perhaps with help
  from the driver.  Unlike disk devices, tape devices aren't
  seekable; thus the user code has no way to position the
  tape incorrectly.  (Skipping backwards and forwards is
  always done in whole blocks -- hence the names,
  backward/forward skip *record*.)

  (b) the read buffer must be at least large enough to contain
  the entire block.  The (single) block will be read in its
  entirety, and its actual length returned by read().


Block devices are a different story; they go through the buffer
cache, the same as regular disk files do, so none of the above
restrictions apply.  The cost of that convenience is lower
performance; the kernel breaks up large read() and write() calls
into cache-buffer-sized chunks, so you might have to wait almost
an entire disk rotation for each sector, whereas with the
character device, you'd only have to pay that cost once for the
entire read() or write() call.  (Roughly analogous to
shoeshining a tape drive vs. letting it stream :-))

(I think Linux is a bit weird in this respect; its disk special
files are block devices, but seem to behave more like character
devices.  Let me be perfectly clear:  I don't know this for sure.
I'm inferring it from Linus's claim that dump can't get a
consistent view of a Linux file system.  If Linux's block disk
devices *acted* like block devices, AFAICT his claim would have
to be false.  But the claim must be true -- I mean, if anyone
knows, it's Linus, right?  Hence the initial assumption must be
false.  QED.  (His more general dissing of dump isn't fact,
it's opinion, and so has no bearing here.))

I remember once -- probably back in V6 days -- seeing a UNIX with
both block and character special files for tapes.  That is, each
drive had (at least) all of these variants:

rewinding   non-rewinding
-   -
block   /dev/mt0/dev/nmt0
character   /dev/rmt0   /dev/nrmt0

The character version of a tape device worked as described above.
The block version went through the buffer cache like any other
block device, which resulted in tapes with 512-byte blocks, no
matter how much you write()ed -- uh, wrote()? :-) -- in one
call.  That'd waste a *lot* of tape; it's not surprising that I
haven't seen a block-special file for a tape in a very long time.

The only optimization that's left for dd to perform is a small one:
if ibs and obs are the same, it can save a tiny amount of CPU
time by not using an inner loop; it can just do something like
this 

Re: amrestore problem, headers ok but no data

2005-01-08 Thread Eric Siegerman
Apologies for following up my own post.

On Sat, Jan 08, 2005 at 06:20:59PM -0500, Eric Siegerman wrote:
 [...] in neither case does [dd] need to copy the data
 from one buffer to another.  It can just have a single buffer
 that's max(ibs,obs) long [...]

Oops; I'm wrong about this.  It's only true if ibs is an exact
multiple of obs, or vice versa.  Otherwise, the data *will* need
to be copied.  I have no idea whether any dd implementations do
in fact optimize the exact-multiple case.

--

|  | /\
|-_|/ Eric Siegerman, Toronto, Ont.[EMAIL PROTECTED]
|  |  /
The animal that coils in a circle is the serpent; that's why so
many cults and myths of the serpent exist, because it's hard to
represent the return of the sun by the coiling of a hippopotamus.
- Umberto Eco, Foucault's Pendulum


Re: amrestore problem, headers ok but no data

2005-01-07 Thread Stefan G. Weichinger

Hello, Brian,

I thought of your problems this morning ;-)

Just now (on 01/07/2005 at 17:17) you wrote:

BC Oddly trying to dd if=/dev/rmt/tps... read no data

BC samar 85# mt -f /dev/rmt/tps1d4nrns rewind
BC samar 86# dd if=/dev/rmt/tps1d4nrns of=scratch
BC Read error: Invalid argument

Invalid argument !

Gene wrote:

GH Then you can use a
GH #dd if=/path/to/device of=scratch count=1

Notice the option count, dd has to be told how much to read/write
...

BC However, I ran amdump last night. Still having problems with TAR DLE
BC though oddly I was able to see that a DUMP DLE attempted to write.

BC I was able to retrieve the file, using both amrestore and Eric's
BC suggestion of manually issuing the dd command to get the file from
BC tape. I was able to open the dump file (DLE for /usr1) and saw that
BC the file kmitra was present. This I thought to be good news since
BC the only top level file on the partition is kmitra/ (note directlry
BC slash). Unfortuantely xfsdump reported the file as a regular file
BC and not a directory and I was unable to proceed from there.

Do your AMANDA-binaries point to the proper xfs-tools? Is the proper
xfsdump used?

BC I've tried to retrieve several of the TAR DLE but have been unsuccessful
BC with either method.

BC On the issue of streaming the drive vs horsepower. We have a holding
BC disk (unlike a few early runs of this config) and we seem to dump to
BC tape quickly enough once the dumper portion completes. I don't know that
BC we are polishing the tape.

BC Something very basic is wrong, looks like the 15th config is the
BC unlucky one.

What about setting up a second config on this host, with just one
small DLE and a few tapes for testing? Maybe you can dig things up
with this.

And BTW: could you please try to strip the older mails from your
replies to save bandwidth, your last mail was about 26k in size, with
only a few new lines in it. Thank you.

-- 
Best regards,
  Stefan




Re: amrestore problem, headers ok but no data

2005-01-07 Thread Brian Cuttler

Stefan,

Yes, will strip more stuff out (didn't realize it had grown so much.

The instruction I was following from Gene where pretty explicite
(not that I follow all that well) and where intended to save the
amanda header, reset the block count, restore the header and flush
the buffers. Unfortunately I failed to retrieve the headers and the
command was simple enough that I'm baffled at having incorrectly 
coded it, invalid argument, must be me. I'll try again.

 Do your AMANDA-binaries point to the proper xfs-tools? Is the proper
 xfsdump used?

Truth be told, I'd have expected dump and restore to be in the same
directory as one another, but this matches the config on other, working
Irix amanda systems.

samar 126# which xfsrestore
/sbin/xfsrestore
samar 127# which xfsdump
/usr/sbin/xfsdump

Yes, I can try a different config on the system, or for that matter
just save/edit the disklist and amanda.conf, its not like I am likely
to lose a working config.

thanks,

Brian
---
   Brian R Cuttler [EMAIL PROTECTED]
   Computer Systems Support(v) 518 486-1697
   Wadsworth Center(f) 518 473-6384
   NYS Department of HealthHelp Desk 518 473-0773



Re: amrestore problem, headers ok but no data

2005-01-07 Thread Stefan G. Weichinger
Hi, Brian,

on Freitag, 07. Jänner 2005 at 17:59 you wrote to amanda-users:

BC The instruction I was following from Gene where pretty explicite
BC (not that I follow all that well)

;-)

BC and where intended to save the
BC amanda header, reset the block count, restore the header and flush
BC the buffers. Unfortunately I failed to retrieve the headers and the
BC command was simple enough that I'm baffled at having incorrectly 
BC coded it, invalid argument, must be me. I'll try again.

Let's see.

 Do your AMANDA-binaries point to the proper xfs-tools? Is the proper
 xfsdump used?

BC Truth be told, I'd have expected dump and restore to be in the same
BC directory as one another, but this matches the config on other, working
BC Irix amanda systems.

BC samar 126# which xfsrestore
BC /sbin/xfsrestore
BC samar 127# which xfsdump
BC /usr/sbin/xfsdump

And these two binaries are found and used by the configure-script?

BC Yes, I can try a different config on the system, or for that matter
BC just save/edit the disklist and amanda.conf, its not like I am likely
BC to lose a working config.

I would do a copy to a second config and substitute the config-name
where it has to be done (amanda.conf, the paths and stuff, you know
that). Then you can fiddle around with that second config until you
figure something out.
-- 
best regards,
Stefan

Stefan G. Weichinger
mailto:[EMAIL PROTECTED]






Re: amrestore problem, headers ok but no data

2005-01-07 Thread Gene Heskett
On Friday 07 January 2005 11:59, Brian Cuttler wrote:
Stefan,

Yes, will strip more stuff out (didn't realize it had grown so much.

The instruction I was following from Gene where pretty explicite
(not that I follow all that well) and where intended to save the
amanda header, reset the block count, restore the header and flush
the buffers. Unfortunately I failed to retrieve the headers and the
command was simple enough that I'm baffled at having incorrectly
coded it, invalid argument, must be me. I'll try again.

Humm, are you saying that amlabel works and can read the label 
written, but that dd doesn't/cannot?  Back to 'man dd' as it exists 
on that Irix system would be my next suggestion.

 Do your AMANDA-binaries point to the proper xfs-tools? Is the
 proper xfsdump used?

Truth be told, I'd have expected dump and restore to be in the same
directory as one another, but this matches the config on other,
 working Irix amanda systems.

samar 126# which xfsrestore
/sbin/xfsrestore
samar 127# which xfsdump
/usr/sbin/xfsdump

Yes, I can try a different config on the system, or for that matter
just save/edit the disklist and amanda.conf, its not like I am
 likely to lose a working config.

   thanks,

   Brian
---
   Brian R Cuttler [EMAIL PROTECTED]
   Computer Systems Support(v) 518 486-1697
   Wadsworth Center(f) 518 473-6384
   NYS Department of HealthHelp Desk 518 473-0773

-- 
Cheers, Gene
There are four boxes to be used in defense of liberty:
 soap, ballot, jury, and ammo. Please use in that order.
-Ed Howdershelt (Author)
99.31% setiathome rank, not too shabby for a WV hillbilly
Yahoo.com attorneys please note, additions to this message
by Gene Heskett are:
Copyright 2004 by Maurice Eugene Heskett, all rights reserved.


Re: amrestore problem, headers ok but no data

2005-01-07 Thread Brian Cuttler
Gene,

 Humm, are you saying that amlabel works and can read the label 
 written, but that dd doesn't/cannot?  Back to 'man dd' as it exists 
 on that Irix system would be my next suggestion.

Remembering that I'd set the blocksize on the device and relabeled
yesterday I tried this.

samar 160# mt -f /dev/sdlt2 rewind
samar 161# dd of=/dev/sdlt2 bs=32k if=./scratch
1+0 records in
1+0 records out

Which worked fine.

I then tried to re-write the label with dd

samar 170# dd of=/dev/sdlt2 obs=32k if=./scratch 
64+0 records in
1+0 records out

bs block size, obs outpub BS, (there is an IBS also, which I
am afraid of developing should this not resolve soon)

Actually the xfsrestore file is a link to the executable in the /usr/sbin
directory. It does make sense, after a fashion (why not a link for the
xfsdump executable as well).

 samar 126# which xfsrestore
 /sbin/xfsrestore
 samar 127# which xfsdump
 /usr/sbin/xfsdump

I'm now going to attempt to run amdump with a much shorter disklist
and see if I can't get a rational result on this tape.

---
   Brian R Cuttler [EMAIL PROTECTED]
   Computer Systems Support(v) 518 486-1697
   Wadsworth Center(f) 518 473-6384
   NYS Department of HealthHelp Desk 518 473-0773



Re[2]: amrestore problem, headers ok but no data

2005-01-07 Thread Stefan G. Weichinger

Hey Brian,

just now (on 01/07/2005 at 19:20) you wrote:

 Humm, are you saying that amlabel works and can read the label 
 written, but that dd doesn't/cannot?  Back to 'man dd' as it exists
 on that Irix system would be my next suggestion.

BC Remembering that I'd set the blocksize on the device and relabeled
BC yesterday I tried this.

BC samar 160# mt -f /dev/sdlt2 rewind
BC samar 161# dd of=/dev/sdlt2 bs=32k if=./scratch
BC 1+0 records in
BC 1+0 records out

BC Which worked fine.

Good to hear. What size did scratch have?
Much nicer device-name now, BTW, maybe this was the problem ... Do you
use this one (the corresponding non-rewinding device) in amanda.conf
as well ?? 

BC I then tried to re-write the label with dd

BC samar 170# dd of=/dev/sdlt2 obs=32k if=./scratch 
BC 64+0 records in
BC 1+0 records out

Ok, I see, you had a bs=512 (0.5k) when you read and wrote the same
file with bs=32k (64 times bigger now).

BC Actually the xfsrestore file is a link to the executable in the /usr/sbin
BC directory. It does make sense, after a fashion (why not a link for the
BC xfsdump executable as well).

 samar 126# which xfsrestore
 /sbin/xfsrestore
 samar 127# which xfsdump
 /usr/sbin/xfsdump

As long as AMANDA actually USES it this is OK, yes.

BC I'm now going to attempt to run amdump with a much shorter disklist
BC and see if I can't get a rational result on this tape.

I am looking forward to the results ...

Stefan



Re: amrestore problem, headers ok but no data

2005-01-07 Thread Brian Cuttler
Stefan,

 Good to hear. What size did scratch have?
 Much nicer device-name now, BTW, maybe this was the problem ... Do you
 use this one (the corresponding non-rewinding device) in amanda.conf
 as well ?? 

I'll have to try to pull it again, I seem to have subsequently 
overwritten it with a zero length file.

 BC I then tried to re-write the label with dd
 
 BC samar 170# dd of=/dev/sdlt2 obs=32k if=./scratch 
 BC 64+0 records in
 BC 1+0 records out
 
 Ok, I see, you had a bs=512 (0.5k) when you read and wrote the same
 file with bs=32k (64 times bigger now).
 
 BC Actually the xfsrestore file is a link to the executable in the /usr/sbin
 BC directory. It does make sense, after a fashion (why not a link for the
 BC xfsdump executable as well).
 
  samar 126# which xfsrestore
  /sbin/xfsrestore
  samar 127# which xfsdump
  /usr/sbin/xfsdump
 
 As long as AMANDA actually USES it this is OK, yes.

I had intended (and forgot) to write, that I checked the amdump log
file and that the paths and names of the programs are correct.

 I am looking forward to the results ...

yah, me too.

---
   Brian R Cuttler [EMAIL PROTECTED]
   Computer Systems Support(v) 518 486-1697
   Wadsworth Center(f) 518 473-6384
   NYS Department of HealthHelp Desk 518 473-0773



Re: amrestore problem, headers ok but no data

2005-01-07 Thread Eric Siegerman
On Fri, Jan 07, 2005 at 11:59:54AM -0500, Brian Cuttler wrote:
 samar 126# which xfsrestore
 /sbin/xfsrestore
 samar 127# which xfsdump
 /usr/sbin/xfsdump

I suppose the theory must be that anyone can do a restore, but
only root can use [xfs]dump.

--

|  | /\
|-_|/ Eric Siegerman, Toronto, Ont.[EMAIL PROTECTED]
|  |  /
The animal that coils in a circle is the serpent; that's why so
many cults and myths of the serpent exist, because it's hard to
represent the return of the sun by the coiling of a hippopotamus.
- Umberto Eco, Foucault's Pendulum


Re: amrestore problem, headers ok but no data

2005-01-07 Thread Jon LaBadie
On Fri, Jan 07, 2005 at 02:01:26PM -0500, Eric Siegerman wrote:
 On Fri, Jan 07, 2005 at 11:17:21AM -0500, Brian Cuttler wrote:

samar 170# dd of=/dev/sdlt2 obs=32k if=./scratch
64+0 records in
1+0 records out

bs block size, obs outpub BS, (there is an IBS also, which I
am afraid of developing should this not resolve soon)


bs is just a short cut to saying obs and ibs.
So the following are the same:

   obs=32k ibs=32k
bs=32k

It is only necessary to use obs and ibs if they are different
(including an unspecified one of the default 512 bytes).

  
  Oddly trying to dd if=/dev/rmt/tps... read no data
  
  samar 85# mt -f /dev/rmt/tps1d4nrns rewind
  samar 86# dd if=/dev/rmt/tps1d4nrns of=scratch
  Read error: Invalid argument
  0+0 records in
  0+0 records out
 
 These two things might well be related.  That dd command, without
 a bs= argument, is trying to read 512-byte blocks.  But the
 physical blocks on the tape are 32 KB -- your adjustments have
 seen to that.  It would be appropriate for the read() call to
 fail in that situation, as indeed it did.  On Solaris (whose man
 pages I have access to at the moment), the error status would be
 ENOMEM; perhaps on your system it's EINVAL == Invalid argument
 instead.  (The place to look that up would likely be in the man
 page for the tape driver -- st(7) is where I found the Solaris
 version.)

That ENOMEM, reported as insufficient memory sometimes used to
throw me for a loop.  Here is the situation as I understand it.

To enhance performance dd tries to do unbuffered I/O, meaning the
data goes directly to memory in dd rather than through buffers
in the OS and then to dd.  An upshot of this is that the buffer
dd reads into must be at least as big as a block on the device.
As there is no OS buffering, dd must get the entire block from
the device in one shot, no taking little chunks at a time.  The
devices do not send bytes on request, they send blocks.

So if dd is left with a default 512 byte ibs, input block size,
but the device is using a larger block size, like an amanda tape
of 32k, dd has allocated a 512 byte piece of memory to hold the
input data.  But when dd requests the first block it unexectedly
gets 32k of data and has insufficient memory (ENOMEM).

The reverse is not really a problem.  Suppose you said ibs=128k.
dd would simply read sufficient device blocks until the buffer
was filled, four blocks in the above example.  The problems arise
when the device wants to feed dd more data per read than dd is
prepared to receive.

On output dd can make its own adjustments.  If the obs is larger,
it can move multiple input buffers to the output buffer before
doing the write.  If the reverse is true, input block size larger
than output, it can copy part of the input block to the output
buffer and do multiple outputs from a single input buffer.

HTH
jon
-- 
Jon H. LaBadie  [EMAIL PROTECTED]
 JG Computing
 4455 Province Line Road(609) 252-0159
 Princeton, NJ  08540-4322  (609) 683-7220 (fax)


Re: amrestore problem, headers ok but no data

2005-01-07 Thread Eric Siegerman
On Fri, Jan 07, 2005 at 01:20:42PM -0500, Brian Cuttler wrote:
 samar 170# dd of=/dev/sdlt2 obs=32k if=./scratch 
 64+0 records in
 1+0 records out
 
 bs block size, obs outpub BS, (there is an IBS also, which I
 am afraid of developing should this not resolve soon)

Yup, this makes sense.  Since you specified obs, the input
block size remained at the default of 512 bytes.  dd read
enough of those (64) to make a 32-KB output block and then wrote
the latter.  If the file had been longer, it would have repeated
the process -- 64 512-byte reads followed by 1 32-KB write --
until done.  It can also convert the other way, from a large ibs
to a smaller obs.

For a disk file, the block size doesn't matter much; things speed
up if you use a larger one, but you get the same result except
for dd's stats at the end.  For a tape, block size can be a lot
more important.

With a traditional tape drive that uses variable-length blocks,
each write() system call produces exactly one physical tape
block, whose length is simply the length specified to write().
Correspondingly, each read() call reads exactly one physical tape
block; the length specified to read() must be at least as large
as the block currently under the heads, or something will go
wrong (on Solaris SCSI tapes, as I mentioned before, the read
will fail with ENOMEM; I don't know whether other systems behave
differently, but one thing that almost certainly will *not*
happen is that you get the first chunk of the physical block on
this read(), and the next chunk on the next read().  One physical
block for each read() call is the rule.

This is why dd has separate ibs and obs arguments in the
first place -- to let you reblock a tape, i.e. copy it to another
tape while changing the block size, without making an
intermediate copy on disk.  When copying from one disk file to
another, or between tape and disk, specifying different input and
output block sizes doesn't really accomplish anything.

I don't know how tape drives with fixed-length blocks (or
configured for them, as in your case) work.  Perhaps the drive
does the necessary block-length conversion itself, using its
internal RAM buffer.

--

|  | /\
|-_|/ Eric Siegerman, Toronto, Ont.[EMAIL PROTECTED]
|  |  /
The animal that coils in a circle is the serpent; that's why so
many cults and myths of the serpent exist, because it's hard to
represent the return of the sun by the coiling of a hippopotamus.
- Umberto Eco, Foucault's Pendulum


Re: amrestore problem, headers ok but no data

2005-01-07 Thread Brian Cuttler

samar 5# /usr/local/sbin/amrestore -r /dev/sdlt2
amrestore:   0: skipping start of tape: date 20050107 label SAMAR05
amrestore:   1: restoring samar._usr5_amanda.20050107.1
amrestore: read error: I/O error

samar 6# mt -f /dev/sdlt2 rewind
samar 7# mt -f /dev/sdlt2 fsf 1
samar 8# dd if=/dev/sdlt2 bs=32768 of=samar._usr5_amanda.20050107.1 skip=1
Read error: I/O error
1+0 records in
1+0 records out

samar 9# mt -f /dev/sdlt2 rewind
samar 10# mt -f /dev/sdlt2 fsf 1
samar 11# dd if=/dev/sdlt2 bs=32768 of=samar._usr5_amanda.20050107.1 
Read error: I/O error
1+0 records in
1+0 records out

samar 12# mt -f /dev/sdlt2 rewind
samar 13# mt -f /dev/sdlt2 fsf 1
samar 14# cat -evt /dev/sdlt2 | more
Input error: Invalid argument (/dev/sdlt2)

samar 15# mt -f /dev/sdlt2 rewind
samar 16# mt -f /dev/sdlt2 fsf 1
samar 17# dd if=/dev/sdlt2 bs=32768| cat -evt | more
AMANDA: FILE 20050107 samar /usr5/amanda lev 1 comp .gz program /usr/local/sbin/
gnutar$
To restore, position tape at start of file and run:$
^Idd if=tape bs=32k skip=1 | /usr/sbin/gzip -dc | usr/local/sbin/gnutar -f... 
-$
^L$
[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@
[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@


 ** many null characters removed.



samar 18# mt -f /dev/sdlt2 rewind
samar 19# mt -f /dev/sdlt2 fsf 1
samar 23# dd if=/dev/sdlt2 bs=32768 skip=1 | /usr/sbin/gzip -dc | 
/usr/local/sbin/gnutar -tf - | more
Read error: I/O error
0+0 records in
0+0 records out

gzip: stdin: unexpected end of file


I had followed Gene's instructions and saved the label via DD and
restored it to the tape as shown in an earlier email. This was the
procedure to set the blksize in the tape and dd the label back.

samar 24# mt -f /dev/sdlt2 blksize

 Recommended tape I/O size: 131072 bytes (256 512-byte blocks)
 Minimum block size: 4 byte(s)
 Maximum block size: 16777212 bytes
 Current block size: Variable

I am using /dev/sdlt2 link to /dev/rmt/tps1d4nrns - no rewind, non-byte swap
device. I am no longer using the trailing 'v' device which would normally
provide variable length blocks.

I will demonstrate with the next tape SAMAR06 in slot 5.

samar 27# mt -f /dev/sdlt2 rewind
samar 28# dd if=/dev/sdlt2 of=./scratch
Read error: Invalid argument
0+0 records in
0+0 records out

Remembering that I'd setblksize yesterday prior to running amlabel -f

samar 29# dd if=/dev/sdlt2 of=./scratch bs=32k
1+0 records in
1+0 records out

samar 30# ls -las scratch
  64 -rw-r--r--1 root sys32768 Jan  7 15:36 scratch

AMANDA: TAPESTART DATE X TAPE SAMAR06$
^L$
[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@
[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@

 ** many nulls removed


samar 32# mt -f /dev/sdlt2 blksize

 Recommended tape I/O size: 131072 bytes (256 512-byte blocks)
 Minimum block size: 4 byte(s)
 Maximum block size: 16777212 bytes
 Current block size: 32768 byte(s)

samar 33# mt -f /dev/sdlt2 rewind

samar 34# mt -f /dev/sdlt2 blksize

 Recommended tape I/O size: 131072 bytes (256 512-byte blocks)
 Minimum block size: 4 byte(s)
 Maximum block size: 16777212 bytes
 Current block size: 32768 byte(s)

samar 35# dd bs=32768 if=scratch of=/dev/sdlt2
1+0 records in
1+0 records out

That out of the way I'll go back and see what the other outstanding
questions/tests where.

---
   Brian R Cuttler [EMAIL PROTECTED]
   Computer Systems Support(v) 518 486-1697
   Wadsworth Center(f) 518 473-6384
   NYS Department of HealthHelp Desk 518 473-0773



Re: amrestore problem, headers ok but no data

2005-01-07 Thread Jon LaBadie
On Fri, Jan 07, 2005 at 03:40:12PM -0500, Brian Cuttler wrote:
 
 samar 24# mt -f /dev/sdlt2 blksize
 
  Recommended tape I/O size: 131072 bytes (256 512-byte blocks)
  Minimum block size: 4 byte(s)
  Maximum block size: 16777212 bytes
  Current block size: Variable
 
 I am using /dev/sdlt2 link to /dev/rmt/tps1d4nrns - no rewind, non-byte swap
 device. I am no longer using the trailing 'v' device which would normally
 provide variable length blocks.
 

Brian,
as you were using the 'v', might the data on the tapes have been written with
some  32K block size?  All of your dd's seem to fail after the header.

How about trying a couple of mt fsf # to some file in the middle (on the
no rewind device), then do dd's with bs=128k and/or bs=1024k and/or bs=16384k.

jl
-- 
Jon H. LaBadie  [EMAIL PROTECTED]
 JG Computing
 4455 Province Line Road(609) 252-0159
 Princeton, NJ  08540-4322  (609) 683-7220 (fax)


Re: amrestore problem, headers ok but no data

2005-01-07 Thread Gene Heskett
On Friday 07 January 2005 15:40, Brian Cuttler wrote:
samar 5# /usr/local/sbin/amrestore -r /dev/sdlt2
amrestore:   0: skipping start of tape: date 20050107 label SAMAR05
amrestore:   1: restoring samar._usr5_amanda.20050107.1
amrestore: read error: I/O error

samar 6# mt -f /dev/sdlt2 rewind
samar 7# mt -f /dev/sdlt2 fsf 1
samar 8# dd if=/dev/sdlt2 bs=32768 of=samar._usr5_amanda.20050107.1
 skip=1 Read error: I/O error
1+0 records in
1+0 records out

samar 9# mt -f /dev/sdlt2 rewind
samar 10# mt -f /dev/sdlt2 fsf 1
samar 11# dd if=/dev/sdlt2 bs=32768 of=samar._usr5_amanda.20050107.1
Read error: I/O error
1+0 records in
1+0 records out

samar 12# mt -f /dev/sdlt2 rewind
samar 13# mt -f /dev/sdlt2 fsf 1
samar 14# cat -evt /dev/sdlt2 | more
Input error: Invalid argument (/dev/sdlt2)

samar 15# mt -f /dev/sdlt2 rewind
samar 16# mt -f /dev/sdlt2 fsf 1
samar 17# dd if=/dev/sdlt2 bs=32768| cat -evt | more
AMANDA: FILE 20050107 samar /usr5/amanda lev 1 comp .gz program
 /usr/local/sbin/ gnutar$
To restore, position tape at start of file and run:$
^Idd if=tape bs=32k skip=1 | /usr/sbin/gzip -dc |
 usr/local/sbin/gnutar -f... -$
^L$
[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@
[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@
 [EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]
@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@


 ** many null characters removed.



samar 18# mt -f /dev/sdlt2 rewind
samar 19# mt -f /dev/sdlt2 fsf 1
samar 23# dd if=/dev/sdlt2 bs=32768 skip=1 | /usr/sbin/gzip -dc |
 /usr/local/sbin/gnutar -tf - | more Read error: I/O error
0+0 records in
0+0 records out

gzip: stdin: unexpected end of file


I had followed Gene's instructions and saved the label via DD and
restored it to the tape as shown in an earlier email. This was the
procedure to set the blksize in the tape and dd the label back.

samar 24# mt -f /dev/sdlt2 blksize

This is querying the tape only, and I guess the bigger tapes like 
bigger blocksizes by default.
 Recommended tape I/O size: 131072 bytes (256 512-byte blocks)
 Minimum block size: 4 byte(s)
 Maximum block size: 16777212 bytes
 Current block size: Variable
This I think, is not good.

I am using /dev/sdlt2 link to /dev/rmt/tps1d4nrns - no rewind,
 non-byte swap device. I am no longer using the trailing 'v' device
 which would normally provide variable length blocks.

I will demonstrate with the next tape SAMAR06 in slot 5.

samar 27# mt -f /dev/sdlt2 rewind
samar 28# dd if=/dev/sdlt2 of=./scratch
Read error: Invalid argument
0+0 records in
0+0 records out

Remembering that I'd setblksize yesterday prior to running amlabel
 -f

samar 29# dd if=/dev/sdlt2 of=./scratch bs=32k
1+0 records in
1+0 records out

samar 30# ls -las scratch
  64 -rw-r--r--1 root sys32768 Jan  7 15:36 scratch

Ok, what happens if you don't spec the blocksize, but just a count of 
1 on a rewound tape.  Then what size of scratch file do you get?

AMANDA: TAPESTART DATE X TAPE SAMAR06$
^L$
[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@
[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@
 [EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]
@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@

 ** many nulls removed


samar 32# mt -f /dev/sdlt2 blksize

 Recommended tape I/O size: 131072 bytes (256 512-byte blocks)
 Minimum block size: 4 byte(s)
 Maximum block size: 16777212 bytes
 Current block size: 32768 byte(s)
Aha, this one should be ok.

samar 33# mt -f /dev/sdlt2 rewind

samar 34# mt -f /dev/sdlt2 blksize

 Recommended tape I/O size: 131072 bytes (256 512-byte blocks)
 Minimum block size: 4 byte(s)
 Maximum block size: 16777212 bytes
 Current block size: 32768 byte(s)

samar 35# dd bs=32768 if=scratch of=/dev/sdlt2
1+0 records in
1+0 records out
Which should have obtained the tapes label block, probably with about 
32700 bytes of nulls appended.

That out of the way I'll go back and see what the other outstanding
questions/tests 

Re: amrestore problem, headers ok but no data

2005-01-07 Thread Eric Siegerman
On Fri, Jan 07, 2005 at 03:40:12PM -0500, Brian Cuttler wrote:
 samar 5# /usr/local/sbin/amrestore -r /dev/sdlt2
 amrestore:   0: skipping start of tape: date 20050107 label SAMAR05
 amrestore:   1: restoring samar._usr5_amanda.20050107.1
 amrestore: read error: I/O error
 [likewise with dd bs=32k]

Ooo, I don't like the look of I/O error at all.  That suggests
a hardware or media problem, as opposed to anything that software
has any control over.

If you haven't done so (can't recall; maybe you've already
described this), look in the system logs for errors from that
device.  If there aren't any, it couldn't hurt to take a look at
the driver's man page (and pages for the SCSI subsystem in
general -- it is a SCSI drive, isn't it?) to see if more verbose
error reporting can be enabled.

If you come up with anything, please post it.

 samar 12# mt -f /dev/sdlt2 rewind
 samar 13# mt -f /dev/sdlt2 fsf 1
 samar 14# cat -evt /dev/sdlt2 | more
 Input error: Invalid argument (/dev/sdlt2)

This one's to be expected.  cat uses a smaller block size --
and unlike dd, you can't change it.

 samar 15# mt -f /dev/sdlt2 rewind
 samar 16# mt -f /dev/sdlt2 fsf 1
 samar 17# dd if=/dev/sdlt2 bs=32768| cat -evt | more
 AMANDA: FILE 20050107 samar /usr5/amanda lev 1 comp .gz program 
 /usr/local/sbin/
 gnutar$
 To restore, position tape at start of file and run:$
 ^Idd if=tape bs=32k skip=1 | /usr/sbin/gzip -dc | usr/local/sbin/gnutar 
 -f... 
 -$
 ^L$
 [EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@
 [EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@
 
 
  ** many null characters removed.

Hmmm, I'd have expected to see Read error: I/O error again
here, seeing as you get it on every other 32-KB read.  Odd that
the message is missing while the main output is still as though
the error had occurred.

 samar 18# mt -f /dev/sdlt2 rewind
 samar 19# mt -f /dev/sdlt2 fsf 1
 samar 23# dd if=/dev/sdlt2 bs=32768 skip=1 | /usr/sbin/gzip -dc | 
 /usr/local/sbin/gnutar -tf - | more
 Read error: I/O error
 0+0 records in
 0+0 records out
 
 gzip: stdin: unexpected end of file

Yeah, more of same... If dd can't get the bits off the tape,
there's not much point trying to do different things with its
nonexistent output :-(

 I had followed Gene's instructions

I've never had to use these, so my comments here are pretty
tentative.

 [Commands to show that the tape drive is set for
 variable-length blocks, stash the label in ./scratch, and
 verify that the label was stashed ok.]

I don't see the command to set the drive to 32768-byte blocks,
but it's presumably there becuse:

 samar 32# mt -f /dev/sdlt2 blksize
 
  Recommended tape I/O size: 131072 bytes (256 512-byte blocks)
  Minimum block size: 4 byte(s)
  Maximum block size: 16777212 bytes
  Current block size: 32768 byte(s)
 
 samar 33# mt -f /dev/sdlt2 rewind
 
 samar 34# mt -f /dev/sdlt2 blksize
 
  Recommended tape I/O size: 131072 bytes (256 512-byte blocks)
  Minimum block size: 4 byte(s)
  Maximum block size: 16777212 bytes
  Current block size: 32768 byte(s)
 
 samar 35# dd bs=32768 if=scratch of=/dev/sdlt2
 1+0 records in
 1+0 records out

If I recall from Gene's description of the problem, it's only
when you go to *read* a tape that your setting gets magically
zapped.  So it couldn't hurt to do a final:
mt rewind
dd bs=32k /dev/sdlt2 ./scratch2
mt -f /dev/sdlt2 blksize
to verify that that hasn't happened.

--

|  | /\
|-_|/ Eric Siegerman, Toronto, Ont.[EMAIL PROTECTED]
|  |  /
The animal that coils in a circle is the serpent; that's why so
many cults and myths of the serpent exist, because it's hard to
represent the return of the sun by the coiling of a hippopotamus.
- Umberto Eco, Foucault's Pendulum


Re: amrestore problem, headers ok but no data

2005-01-01 Thread Brian Cuttler

Gene,

Thanks for the tape label sequence, will re-label the next couple
of tapes and see if I don't get different results from the next
amdump run. (commands reproduced below).

If this solves the problem perhaps it should be incorporated into amlabel.

Alexander,

Hadn't realized that the 1st chunk on disk was broken into the
amanda header followed by the collected chuncks. Makes sense if
you think about it.

New information--

The amdump run I started yesterday completed in record time. 51 hours
of dump time and only 33 hours of elapse time.

I tried to amrestore several dumps from the tape to disk, oddly
the restores of the DUMP DLE both failed, the restore of the TAR
DLE sort of worked, I can... I'm an idiot.

I was able to take the DLE restored from tape and # tar -tf the
tar ball but was unable to restore it with # tar -xvf. On second
look I've realized that I can, and MUST, use gnutar and was then
able to restore the files.

Suddenly its looking like TAR DLEs are a non-issue. (well, move below)

Things are still anomalous with the DUMP DLEs though.

samar 22# mt -f /dev/sdlt2 rewind
samar 23# /usr/local/sbin/amrestore /dev/sdlt2 samar /usr1
amrestore:   0: skipping start of tape: date 20041229 label SAMAR24
amrestore:   1: skipping samar._usr5_lalor.20041229.0
amrestore:   2: skipping samar._usr5_tapu.20041229.1
amrestore:   3: skipping samar._.20041229.1
amrestore:   4: skipping samar._usr5_amanda.20041229.1
amrestore:   5: skipping samar._usr5_bimal.20041229.1
amrestore:   6: skipping samar._usr5_allengr.20041229.2
amrestore:   7: skipping samar._usr5_skaur.20041229.1
amrestore:   8: skipping samar._usr5_leith.20041229.1
amrestore:   9: skipping samar._usr5_hxgao.20041229.1
amrestore:  10: skipping samar._usr5_ninggao.20041229.1
amrestore:  11: restoring samar._usr1.20041229.0
amrestore: read error: I/O error

gzip: stdin: unexpected end of file

This created a zero length file for samar._usr1.20041229.0

Same result for the DUMP DLE for the root partition.

Same result when I tried to pipe directly to xfsrestore.

samar 31# /usr/local/sbin/amrestore -p /dev/sdlt2 samar /usr5/bimal | 
/usr/local/sbin/gnutar -tf -
amrestore:   0: skipping start of tape: date 20041229 label SAMAR24
amrestore:   1: skipping samar._usr5_lalor.20041229.0
amrestore:   2: skipping samar._usr5_tapu.20041229.1
amrestore:   3: skipping samar._.20041229.1
amrestore:   4: skipping samar._usr5_amanda.20041229.1
amrestore:   5: restoring samar._usr5_bimal.20041229.1
./
./cvs_test/
./cvs_test/spider/
./cvs_test/spider/CVS/
./cvs_test/spider/man/
amrestore./cvs_test/spider/man/CVS/
: read error: I/O error./cvs_test/spider/src/

./cvs_test/spider/src/CVS/
./hpc/
./hpc/eve/
./hpc/eve/hpc/
./hpc/eve/hpc/dynamic/
./hpc/eve/hpc/dynamic/new/

 --lines removed for brevity (like that'll help now).

./pick_part/classify/Power_Spectra/power/
./pick_part/classify/Reconstruction/
./pick_part/classify/Refinement/
./pick_part/final/
./pick_part/final/0deg_proj_asym_mask/
./pick_part/final/0deg_proj_asym_mask/interpolated/
./pick_part/final/0deg_proj_asym_mask/interpolated/new/
./pick_part/final/0deg_proj_sym_mask/

gzip: stdin: unexpected end of file
./pick_part/final/0deg_proj_sym_mask/interpolated/
/usr/local/sbin/gnutar: Unexpected EOF in archive
/usr/local/sbin/gnutar: Error is not recoverable: exiting now

Actually at this point I'm not sure... that file name is familiar
and I don't know we didn't error while writing the file to tape
in the first place.

No errors reported in the amdump report, read of disk was completely
clean, unlike the previous output that I posted.

We are close, but the results of this run seem to weaken the argument
for disk block/record issues.








 I can switch to a fixed length device by removing the trailing 'v'
 in the device name. It hasn't been an issue on other systems though
 I admit that this SDLT on IRIX is unique at my site.
 
 Those are commands in the 'mt' arsenal Brian.  And there have been 
 known to be platform diffs, so consult the manpages for your 
 version, and, make sure you have the latest version.  
 
 Mine possibly isn't quite fresh and 100% green, but it works and returns:
 
 mt-st v. 0.7
 
 for an mt --version command
 
  Back when I was running real tapes, I found that a 32kilobyte
  record was measureable faster than the default of 512 bytes.  But,
  if everything isn't set to the same record length, the whole thing
  blew up for me.  Is there any way you can make it run fixed length
  records on that lashup?
 
 Block size on tape. Where is that set, is this a function of the
 underlying DD command within Amanda or is this elsewhere ?
 
 You can spec the block size used by dd with the bs option, but the 
 tape itself must know about it first by useing the appropriate mt 
 command.  Again, there are platform diffs, but here is the -h 
 output from my copy:
 [EMAIL PROTECTED] root]# mt -h
 usage: mt [-v] [--version] [-h] [ -f device ] command [ count ]
 commands: weof, wset, 

Re: amrestore problem, headers ok but no data

2005-01-01 Thread Gene Heskett
On Friday 31 December 2004 10:49, Brian Cuttler wrote:
Gene,

Thanks for the tape label sequence, will re-label the next couple
of tapes and see if I don't get different results from the next
amdump run. (commands reproduced below).

If this solves the problem perhaps it should be incorporated into
 amlabel.

Alexander,

Hadn't realized that the 1st chunk on disk was broken into the
amanda header followed by the collected chuncks. Makes sense if
you think about it.

New information--

The amdump run I started yesterday completed in record time. 51
 hours of dump time and only 33 hours of elapse time.

I tried to amrestore several dumps from the tape to disk, oddly
the restores of the DUMP DLE both failed, the restore of the TAR
DLE sort of worked, I can... I'm an idiot.

I was able to take the DLE restored from tape and # tar -tf the
tar ball but was unable to restore it with # tar -xvf. On second
look I've realized that I can, and MUST, use gnutar and was then
able to restore the files.

Suddenly its looking like TAR DLEs are a non-issue. (well, move
 below)

Things are still anomalous with the DUMP DLEs though.

samar 22# mt -f /dev/sdlt2 rewind
samar 23# /usr/local/sbin/amrestore /dev/sdlt2 samar /usr1
amrestore:   0: skipping start of tape: date 20041229 label SAMAR24
amrestore:   1: skipping samar._usr5_lalor.20041229.0
amrestore:   2: skipping samar._usr5_tapu.20041229.1
amrestore:   3: skipping samar._.20041229.1
amrestore:   4: skipping samar._usr5_amanda.20041229.1
amrestore:   5: skipping samar._usr5_bimal.20041229.1
amrestore:   6: skipping samar._usr5_allengr.20041229.2
amrestore:   7: skipping samar._usr5_skaur.20041229.1
amrestore:   8: skipping samar._usr5_leith.20041229.1
amrestore:   9: skipping samar._usr5_hxgao.20041229.1
amrestore:  10: skipping samar._usr5_ninggao.20041229.1
amrestore:  11: restoring samar._usr1.20041229.0
amrestore: read error: I/O error

gzip: stdin: unexpected end of file

This created a zero length file for samar._usr1.20041229.0

Same result for the DUMP DLE for the root partition.

Same result when I tried to pipe directly to xfsrestore.

samar 31# /usr/local/sbin/amrestore -p /dev/sdlt2 samar /usr5/bimal
 | /usr/local/sbin/gnutar -tf - amrestore:   0: skipping start of
 tape: date 20041229 label SAMAR24 amrestore:   1: skipping
 samar._usr5_lalor.20041229.0
amrestore:   2: skipping samar._usr5_tapu.20041229.1
amrestore:   3: skipping samar._.20041229.1
amrestore:   4: skipping samar._usr5_amanda.20041229.1
amrestore:   5: restoring samar._usr5_bimal.20041229.1
./
./cvs_test/
./cvs_test/spider/
./cvs_test/spider/CVS/
./cvs_test/spider/man/
amrestore./cvs_test/spider/man/CVS/

: read error: I/O error./cvs_test/spider/src/

./cvs_test/spider/src/CVS/
./hpc/
./hpc/eve/
./hpc/eve/hpc/
./hpc/eve/hpc/dynamic/
./hpc/eve/hpc/dynamic/new/

 --lines removed for brevity (like that'll help now).

./pick_part/classify/Power_Spectra/power/
./pick_part/classify/Reconstruction/
./pick_part/classify/Refinement/
./pick_part/final/
./pick_part/final/0deg_proj_asym_mask/
./pick_part/final/0deg_proj_asym_mask/interpolated/
./pick_part/final/0deg_proj_asym_mask/interpolated/new/
./pick_part/final/0deg_proj_sym_mask/

gzip: stdin: unexpected end of file
./pick_part/final/0deg_proj_sym_mask/interpolated/
/usr/local/sbin/gnutar: Unexpected EOF in archive
/usr/local/sbin/gnutar: Error is not recoverable: exiting now

Actually at this point I'm not sure... that file name is familiar
and I don't know we didn't error while writing the file to tape
in the first place.

No errors reported in the amdump report, read of disk was completely
clean, unlike the previous output that I posted.

We are close, but the results of this run seem to weaken the
 argument for disk block/record issues.

But, otoh, its strengthening the argument to use gnutar only.

And I'd make the suggestion that somewhere in this setup, there is 
insufficient iron to do the job.  I don't know anything about an 
SDLT, but if its taking 33 hours of real time, that drive has got to 
be doing a lot of shoe shining where it collects data till the 
buffer is about full, starts up and dumps that to the tape, then 
stops and does a short rewind so that it doesn't waste a lot of blank 
tape in between data writes.  The server should have enough bandwidth 
at the drive cable to keep the pipe full so that once it starts, it 
streams data at the rate the drive can take it, at least for the 
duration of that dle's dump from the holding disk.  A 150 mhz cyrix 
hasn't the power, and here, a 400mhz k6-II wasn't able to keep a puny 
little DDS2 drive 100% busy either.

Here, using virtual tapes on a dedicated hard drive, a 16GB before 
compression backup run that fits in a bit over 9GB after gzip gets 
through with it, starts a few minutes after midnight, and is often 
done by 2:15 am.  But, during the dump time from the holding disk to 
the virtual tape, the machine is virtually frozen, only the xwindow 
clock seems to keep on counting 

Re: amrestore problem, headers ok but no data

2004-12-30 Thread Stefan G. Weichinger
Hi,
on Wednesday 29 December 2004 16:29, Brian Cuttler wrote:

Amanda2.4.4p1-20030716
SGI/IRIX  6.5.19m
mtx   1.3.8
8 Slot jukebox with SDLT 320 (Quantum)
gnutar1.13.25

I suppose this configuration looks familiar to some of you.

I was in the process of writing an email to thank Stefan Weichinger,
Gene Heskett and Eric Siegerman for the terrific and patient help
 they gave me with my new amanda config.

Welcome ;-)

I am able to see the DLEs on tape but I'm unable to retried any
 data, either from the dump or the tar partitions.

samar 144# mt -f /dev/sdlt2 rewind
samar 145# /usr/local/sbin/amrestore /dev/sdlt2 samar /usr5/dtaylor
amrestore:   0: skipping start of tape: date 20041227 label SAMAR23
amrestore:   1: skipping samar._usr1.20041227.1
amrestore:   2: skipping samar._usr5_lalor.20041227.1
amrestore:   3: skipping samar._usr5_amanda.20041227.0
amrestore:   4: restoring samar._usr5_dtaylor.20041227.1
amrestore: read error: I/O error

gzip: stdin: unexpected end of file

And you don't get a file with this command?
I once had this behavior on one of my machines, just have to remember
...

You could also try to specify the blocksize with the amrestore-option
-b, maybe this is not detected correctly.

Do you use DUMP or TAR for samar._usr5_dtaylor?

I am able to use amrestore with the -r option which results in
files like this


samar 77# more samar._usr5_lalor.20041227.1
AMANDA: FILE 20041227 samar /usr5/lalor lev 1 comp .gz program
 /usr/local/sbin/gnutar To restore, position tape at start of file
 and run:
dd if=tape bs=32k skip=1 | /usr/sbin/gzip -dc |
 usr/local/sbin/gnutar -f... -


where the rest of the file is empty.

Tried this with one of my holdingdisk-chunks, the rest is not empty
here.

 I should be more explicite.

samar 82# ls -las samar._usr5_lalor.20041227.1
  64 -rw-r-   1 root sys 32768 Dec 29 15:36
 samar._usr5_lalor.20041227.1

This one is only 32768 kB in size ... a header.


samar 79# cat -evt samar._usr5_lalor.20041227.1 | head
AMANDA: FILE 20041227 samar /usr5/lalor lev 1 comp .gz program
 /usr/local/sbin/g nutar$
To restore, position tape at start of file and run:$
^Idd if=tape bs=32k skip=1 | /usr/sbin/gzip -dc |
 usr/local/sbin/gnutar -f... -$
^L$
[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@
[EMAIL PROTECTED]@
 [EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]
@[EMAIL PROTECTED]@
 [EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]
@[EMAIL PROTECTED]@
 [EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]
@[EMAIL PROTECTED]@

The file headers are there, I just don't know where the data has
 gotten to. I certainly see multiple chunksize files in the work
 area when I'm running amdump.

Are the chunks still there? This would explain why the headers have
made it to the tape, while the data (or parts of it) is still in the
holdingdisk. (This is also a lev1-dump, so the question is, if there
were any changes at all?)

What did the REPORT-mail say?

I've otherwise tested the tape drive with the native version of #
 tar and have been able to write and recover data. Besides the
 amrestore listing is out there.

# mt commands seem to run progressively longer as I read the tape,
 I'm apparently writing lots of something between EOF marks, I'm
 just hoping that its actual data.

# ls -las  /dev/sdlt2
lrwxr-xr-x1 root  sys  20 Nov 22 14:45 /dev/sdlt2 -
 /dev/rmt/tps1d4nrnsv

ie, bus 1, target (SCSI id) 4, non-rewinding, non-bytes-wapping,
 variable length records. Similar to what I run on other systems.

Seems ok so far, let's debug the upper points first.
-- 
best regards,
Stefan

Stefan G. Weichinger
mailto:[EMAIL PROTECTED]






Re: amrestore problem, headers ok but no data

2004-12-30 Thread Brian Cuttler

Gene,
Stefan,

* I'm gonna remove a lot of text from the middle of this.

On Wednesday 29 December 2004 16:29, Brian Cuttler wrote:
## Amanda2.4.4p1-20030716
## SGI/IRIX  6.5.19m
## mtx   1.3.8
## 8 Slot jukebox with SDLT 320 (Quantum)
## gnutar1.13.25
## 
## I'm planning to dump 2-3 times per week but have been starting the
## process manually (via the # at command) and have seen dumps in only
## 50 hours, down from hundreds when I initially started the server up.
## 
## By all accounts everything is working properly, some tuning is still
## in order but there are no apparent errors. Or where not until I
##  tried to test the file restore process.
## 
## I am able to see the DLEs on tape but I'm unable to retried any
##  data, either from the dump or the tar partitions.
## 
## samar 144# mt -f /dev/sdlt2 rewind
## samar 145# /usr/local/sbin/amrestore /dev/sdlt2 samar /usr5/dtaylor
## amrestore:   0: skipping start of tape: date 20041227 label SAMAR23
## amrestore:   1: skipping samar._usr1.20041227.1
## amrestore:   2: skipping samar._usr5_lalor.20041227.1
## amrestore:   3: skipping samar._usr5_amanda.20041227.0
## amrestore:   4: restoring samar._usr5_dtaylor.20041227.1
## amrestore: read error: I/O error
## 
## gzip: stdin: unexpected end of file
## 
## 
## I am able to use amrestore with the -r option which results in
## files like this
## 
## 
## samar 77# more samar._usr5_lalor.20041227.1
## AMANDA: FILE 20041227 samar /usr5/lalor lev 1 comp .gz program
##  /usr/local/sbin/gnutar To restore, position tape at start of file
##  and run:
## dd if=tape##  bs=32k skip=1 | /usr/sbin/gzip -dc |
##  usr/local/sbin/gnutar -f... -
## 
## 
## where the rest of the file is empty. I should be more explicite.
## 
## samar 82# ls -las samar._usr5_lalor.20041227.1
##   64 -rw-r-   1 root sys 32768 Dec 29 15:36
##  samar._usr5_lalor.20041227.1
## 
## samar 79# cat -evt samar._usr5_lalor.20041227.1 | head
## AMANDA: FILE 20041227 samar /usr5/lalor lev 1 comp .gz program
##  /usr/local/sbin/g nutar$
## To restore, position tape at start of file and run:$
## ^Idd if=tape##  bs=32k skip=1 | /usr/sbin/gzip -dc |
##  usr/local/sbin/gnutar -f... -$
## ^L$
## [EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@
## [EMAIL PROTECTED]@
##  [EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]
## @[EMAIL PROTECTED]@
##  [EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]
## @[EMAIL PROTECTED]@
##  [EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]
## @[EMAIL PROTECTED]@
## 
## The file headers are there, I just don't know where the data has
##  gotten to. I certainly see multiple chunksize files in the work
##  area when I'm running amdump.
## 
## I've otherwise tested the tape drive with the native version of #
##  tar and have been able to write and recover data. Besides the
##  amrestore listing is out there.
## 
## # mt commands seem to run progressively longer as I read the tape,
##  I'm apparently writing lots of something between EOF marks, I'm
##  just hoping that its actual data.
## 
## # ls -las  /dev/sdlt2
## lrwxr-xr-x1 root  sys  20 Nov 22 14:45 /dev/sdlt2 -## 
##  /dev/rmt/tps1d4nrnsv

 The two things I'd question are: what version of tar?  We've found 
 that tar-1.13-19 and 1.13-25.  We've had a report of 1.14 not working 
 a week or 2 back up the log.  And earlier versions than 1.13-19 were 
 mistakenly putting a number of some kind at the start of the header, 
 which wasn't compatible with manda.

Confirm, I am using GNU tar 1.13.25

samar 35# /usr/local/sbin/gnutar --version  
tar (GNU tar) 1.13.25

 And I'd observe that amanda generally uses a fixed length record.

I can switch to a fixed length device by removing the trailing 'v'
in the device name. It hasn't been an issue on other systems though
I admit that this SDLT on IRIX is unique at my site.

 Back when I was running real tapes, I found that a 32kilobyte record 
 was 

Re: amrestore problem, headers ok but no data

2004-12-30 Thread Gene Heskett
On Thursday 30 December 2004 14:55, Brian Cuttler wrote:
Gene,
Stefan,

* I'm gonna remove a lot of text from the middle of this.

On Wednesday 29 December 2004 16:29, Brian Cuttler wrote:
## Amanda2.4.4p1-20030716
## SGI/IRIX  6.5.19m
## mtx   1.3.8
## 8 Slot jukebox with SDLT 320 (Quantum)
## gnutar1.13.25
##
## I'm planning to dump 2-3 times per week but have been starting
 the ## process manually (via the # at command) and have seen dumps
 in only ## 50 hours, down from hundreds when I initially started
 the server up. ##
## By all accounts everything is working properly, some tuning is
 still ## in order but there are no apparent errors. Or where not
 until I ##  tried to test the file restore process.
##
## I am able to see the DLEs on tape but I'm unable to retried any
##  data, either from the dump or the tar partitions.
##
## samar 144# mt -f /dev/sdlt2 rewind
## samar 145# /usr/local/sbin/amrestore /dev/sdlt2 samar
 /usr5/dtaylor ## amrestore:   0: skipping start of tape: date
 20041227 label SAMAR23 ## amrestore:   1: skipping
 samar._usr1.20041227.1
## amrestore:   2: skipping samar._usr5_lalor.20041227.1
## amrestore:   3: skipping samar._usr5_amanda.20041227.0
## amrestore:   4: restoring samar._usr5_dtaylor.20041227.1
## amrestore: read error: I/O error
##
## gzip: stdin: unexpected end of file
##
##
## I am able to use amrestore with the -r option which results in
## files like this
##
##
## samar 77# more samar._usr5_lalor.20041227.1
## AMANDA: FILE 20041227 samar /usr5/lalor lev 1 comp .gz program
##  /usr/local/sbin/gnutar To restore, position tape at start of
 file ##  and run:
## dd if=tape##  bs=32k skip=1 | /usr/sbin/gzip -dc |
##  usr/local/sbin/gnutar -f... -
##
##
## where the rest of the file is empty. I should be more explicite.
##
## samar 82# ls -las samar._usr5_lalor.20041227.1
##   64 -rw-r-   1 root sys 32768 Dec 29 15:36
##  samar._usr5_lalor.20041227.1
##
## samar 79# cat -evt samar._usr5_lalor.20041227.1 | head
## AMANDA: FILE 20041227 samar /usr5/lalor lev 1 comp .gz program
##  /usr/local/sbin/g nutar$
## To restore, position tape at start of file and run:$
## ^Idd if=tape##  bs=32k skip=1 | /usr/sbin/gzip -dc |
##  usr/local/sbin/gnutar -f... -$
## ^L$
##
 [EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]
@ ## [EMAIL PROTECTED]@
## 
 [EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]
 ## @[EMAIL PROTECTED]@
## 
 [EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]
 ## @[EMAIL PROTECTED]@
## 
 [EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]
 ## @[EMAIL PROTECTED]@
##
## The file headers are there, I just don't know where the data has
##  gotten to. I certainly see multiple chunksize files in the
 work ##  area when I'm running amdump.
##
## I've otherwise tested the tape drive with the native version of #
##  tar and have been able to write and recover data. Besides the
##  amrestore listing is out there.
##
## # mt commands seem to run progressively longer as I read the
 tape, ##  I'm apparently writing lots of something between EOF
 marks, I'm ##  just hoping that its actual data.
##
## # ls -las  /dev/sdlt2
## lrwxr-xr-x1 root  sys  20 Nov 22 14:45 /dev/sdlt2 -##
##  /dev/rmt/tps1d4nrnsv

 The two things I'd question are: what version of tar?  We've found
 that tar-1.13-19 and 1.13-25.  We've had a report of 1.14 not
 working a week or 2 back up the log.  And earlier versions than
 1.13-19 were mistakenly putting a number of some kind at the start
 of the header, which wasn't compatible with manda.

Confirm, I am using GNU tar 1.13.25

Good version

samar 35# /usr/local/sbin/gnutar --version  
tar (GNU tar) 1.13.25

 And I'd observe that amanda generally uses a fixed length record.

I can switch to a fixed length device by removing the trailing 'v'
in the device name. It hasn't been an issue on other systems though
I admit that this SDLT on IRIX is unique at my site.


amrestore problem, headers ok but no data

2004-12-29 Thread Brian Cuttler

Amanda2.4.4p1-20030716
SGI/IRIX  6.5.19m
mtx   1.3.8
8 Slot jukebox with SDLT 320 (Quantum)
gnutar1.13.25

I suppose this configuration looks familiar to some of you.

I was in the process of writing an email to thank Stefan Weichinger,
Gene Heskett and Eric Siegerman for the terrific and patient help they
gave me with my new amanda config.

I don't wish to diminish the thanks but must sadly ask for further help.
Unfortuanately I have yet another issue... but we'll get to that later.

Currently we are using the jukebox, have DLE types of both xfsdump for
the root and /usr1 partitions and of type TAR (gnutar) for DLEs that
match the (top of partition) user directories on the Raid Array (.86 Tbytes).

I've assigned a directory on the raid array to act as the work area
(but was careful to not list it as a DLE) and have increased the maxdump
to something larger than 1. We are still client contrained but 3 concurrent
dumps is an aweful lot better than one. [I should also say that this
system is the sole client of the server resident on this system]. I have
also set the dump cycle to 2 weeks rather than one.

I'm planning to dump 2-3 times per week but have been starting the
process manually (via the # at command) and have seen dumps in only
50 hours, down from hundreds when I initially started the server up.

By all accounts everything is working properly, some tuning is still
in order but there are no apparent errors. Or where not until I tried
to test the file restore process.

I am able to see the DLEs on tape but I'm unable to retried any data,
either from the dump or the tar partitions.

samar 144# mt -f /dev/sdlt2 rewind
samar 145# /usr/local/sbin/amrestore /dev/sdlt2 samar /usr5/dtaylor
amrestore:   0: skipping start of tape: date 20041227 label SAMAR23
amrestore:   1: skipping samar._usr1.20041227.1
amrestore:   2: skipping samar._usr5_lalor.20041227.1
amrestore:   3: skipping samar._usr5_amanda.20041227.0
amrestore:   4: restoring samar._usr5_dtaylor.20041227.1
amrestore: read error: I/O error

gzip: stdin: unexpected end of file


I am able to use amrestore with the -r option which results in
files like this


samar 77# more samar._usr5_lalor.20041227.1
AMANDA: FILE 20041227 samar /usr5/lalor lev 1 comp .gz program 
/usr/local/sbin/gnutar
To restore, position tape at start of file and run:
dd if=tape bs=32k skip=1 | /usr/sbin/gzip -dc | usr/local/sbin/gnutar 
-f... -


where the rest of the file is empty. I should be more explicite.

samar 82# ls -las samar._usr5_lalor.20041227.1
  64 -rw-r-   1 root sys 32768 Dec 29 15:36 samar._usr5_lalor.20041227.1

samar 79# cat -evt samar._usr5_lalor.20041227.1 | head
AMANDA: FILE 20041227 samar /usr5/lalor lev 1 comp .gz program /usr/local/sbin/g
nutar$
To restore, position tape at start of file and run:$
^Idd if=tape bs=32k skip=1 | /usr/sbin/gzip -dc | usr/local/sbin/gnutar -f... 
-$
^L$
[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@
[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@
[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@
[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@

The file headers are there, I just don't know where the data has gotten to.
I certainly see multiple chunksize files in the work area when I'm running
amdump.

I've otherwise tested the tape drive with the native version of # tar
and have been able to write and recover data. Besides the amrestore
listing is out there.

# mt commands seem to run progressively longer as I read the tape, I'm
apparently writing lots of something between EOF marks, I'm just hoping
that its actual data.

# ls -las  /dev/sdlt2
lrwxr-xr-x1 root  sys  20 Nov 22 14:45 /dev/sdlt2 - /dev/rmt/tps1d4nrnsv

ie, bus 1, target (SCSI id) 4, non-rewinding, non-bytes-wapping, variable
length records. Similar to what I run on other systems.

Thank you all for your help, past, 

Re: amrestore problem, headers ok but no data

2004-12-29 Thread Gene Heskett
On Wednesday 29 December 2004 16:29, Brian Cuttler wrote:
Amanda2.4.4p1-20030716
SGI/IRIX  6.5.19m
mtx   1.3.8
8 Slot jukebox with SDLT 320 (Quantum)
gnutar1.13.25

I suppose this configuration looks familiar to some of you.

I was in the process of writing an email to thank Stefan Weichinger,
Gene Heskett and Eric Siegerman for the terrific and patient help
 they gave me with my new amanda config.

I don't wish to diminish the thanks but must sadly ask for further
 help. Unfortuanately I have yet another issue... but we'll get to
 that later.

Currently we are using the jukebox, have DLE types of both xfsdump
 for the root and /usr1 partitions and of type TAR (gnutar) for DLEs
 that match the (top of partition) user directories on the Raid
 Array (.86 Tbytes).

I've assigned a directory on the raid array to act as the work area
(but was careful to not list it as a DLE) and have increased the
 maxdump to something larger than 1. We are still client contrained
 but 3 concurrent dumps is an aweful lot better than one. [I should
 also say that this system is the sole client of the server resident
 on this system]. I have also set the dump cycle to 2 weeks rather
 than one.

I'm planning to dump 2-3 times per week but have been starting the
process manually (via the # at command) and have seen dumps in only
50 hours, down from hundreds when I initially started the server up.

By all accounts everything is working properly, some tuning is still
in order but there are no apparent errors. Or where not until I
 tried to test the file restore process.

I am able to see the DLEs on tape but I'm unable to retried any
 data, either from the dump or the tar partitions.

samar 144# mt -f /dev/sdlt2 rewind
samar 145# /usr/local/sbin/amrestore /dev/sdlt2 samar /usr5/dtaylor
amrestore:   0: skipping start of tape: date 20041227 label SAMAR23
amrestore:   1: skipping samar._usr1.20041227.1
amrestore:   2: skipping samar._usr5_lalor.20041227.1
amrestore:   3: skipping samar._usr5_amanda.20041227.0
amrestore:   4: restoring samar._usr5_dtaylor.20041227.1
amrestore: read error: I/O error

gzip: stdin: unexpected end of file


I am able to use amrestore with the -r option which results in
files like this


samar 77# more samar._usr5_lalor.20041227.1
AMANDA: FILE 20041227 samar /usr5/lalor lev 1 comp .gz program
 /usr/local/sbin/gnutar To restore, position tape at start of file
 and run:
dd if=tape bs=32k skip=1 | /usr/sbin/gzip -dc |
 usr/local/sbin/gnutar -f... -


where the rest of the file is empty. I should be more explicite.

samar 82# ls -las samar._usr5_lalor.20041227.1
  64 -rw-r-   1 root sys 32768 Dec 29 15:36
 samar._usr5_lalor.20041227.1

samar 79# cat -evt samar._usr5_lalor.20041227.1 | head
AMANDA: FILE 20041227 samar /usr5/lalor lev 1 comp .gz program
 /usr/local/sbin/g nutar$
To restore, position tape at start of file and run:$
^Idd if=tape bs=32k skip=1 | /usr/sbin/gzip -dc |
 usr/local/sbin/gnutar -f... -$
^L$
[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
PROTECTED]@
[EMAIL PROTECTED]@
 [EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]
@[EMAIL PROTECTED]@
 [EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]
@[EMAIL PROTECTED]@
 [EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL PROTECTED]@[EMAIL 
 PROTECTED]@[EMAIL PROTECTED]
@[EMAIL PROTECTED]@

The file headers are there, I just don't know where the data has
 gotten to. I certainly see multiple chunksize files in the work
 area when I'm running amdump.

I've otherwise tested the tape drive with the native version of #
 tar and have been able to write and recover data. Besides the
 amrestore listing is out there.

# mt commands seem to run progressively longer as I read the tape,
 I'm apparently writing lots of something between EOF marks, I'm
 just hoping that its actual data.

# ls -las  /dev/sdlt2
lrwxr-xr-x1 root  sys  20 Nov 22 14:45 /dev/sdlt2 -
 /dev/rmt/tps1d4nrnsv

ie, bus 1, target (SCSI id) 4, non-rewinding, non-bytes-wapping,
 variable length