Re: [Bacula-users] Time for change

2008-12-19 Thread Alan Brown
On Thu, 18 Dec 2008, Jesper Krogh wrote:

  I'm multiplexing anything up to 20 jobs at a time. To ensure that small
  incremental and diff jobs are dumped in one hit and to ensure that full
  backups are laid in as large chunks as possible, this is the kindof size
  which is required.

 Are you having that many clients or is it actually larger volumes that have
 been split up in smaller filesets?

Mostly larger volumes broken into 1Tb filesets. There are about 150
desktop boxes being backed up but their load is minor.

 Yes, I use spooling even for our +2TB volumes.

I hope you've benchmarked restore times. Between the sheer volume of data
and the possibilities of several million files, we settled on 1Tb as a
workable unit for current technology and even that's had trouble when
restoring some filesystems (10 million file restores make Bacula-dir
unhappy!)

  250Tb startup, 1Pb within 18 months, and growing past that. Filesizes
  measured in tens of Mb apiece.

 Still over Gigabit netwok or do you have better infrastructure?

Right now the links are Gb, with some LACP overlaying being done, but as
this larger fileset comes online most stuff will be moved to 8Gb/s fibre
(IP over F/O) and 10Gb/s ethernet. Prices on this gear (especially
switches!) are falling fast so it makes sense to not deploy until actually
needed.




--
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-18 Thread Arno Lehmann
Hi,

18.12.2008 06:50, Jesper Krogh wrote:
 Alan Brown wrote:
 Jesper Krogh wrote:
 I'm running spooling on a 4 drive software raid0 quite happily on a 4Gb
 3GHz P4D machine. The limiting factors are disk head seek time(*) when
 running concurrent backups to 2 LTO2 drives and available SATA ports.
 Because of that I'm considering dropping in solid state disks.
 
 I still have got to see a reasonable priced SSD' disk that can deliver 
 around 100MB/s both ways at the same time.
   
 There aren't any mechanical disks which can do it either.

 Which is why I'm not trying to do that - replacing 4 RAID0 mechanical 
 disks with 4 SSDs will provide similar sustained throughput to the 
 mechanical RAID0, but provide _much_ better performance for anything 
 where the mechanical disks had head seeking involved - such as multiple 
 simultaneous input/output streams to LTO drives.

 http://www.slashgear.com/samsung-64gb-ssd-performance-benchmarks-278717/

   
 Make sure you compare apples with something remotely looking like 
 apples. The ONLY SSds which are suitable fo this kind of use are SLCs, 
 not MLCs
 
 Ok. I havent spend enough time in that area.

In short, MLC's are cheaper but slower. There are claims they are less 
reliable, i.e. allow lesser rewrites.

 I have beefed up my director with sufficient amount of memory and 
 mounted it as a ramdisk for spooling. That doesn't impose any 
 limitations on the 2 LTO3 drives attached.
   
 How much do you regard as sufficient?

 100-200Gb ram and systems capable of addressing that amount of memory 
 are still far more expensive than a stack of flash drives, else I'd use 
 them.
 
 But do you need to spool a complete tape? In order to avoid doing evil 
 stuff to you tape drive, much less is sufficient.

Well, Alan probably runs jobs spanning more than one tape. Also, he's 
running jobs concurrently. Now, if he wants to avoid interleaving of 
different jobs, he needs to spool complete jobs, and probably several 
jobs of more than LTO3 capacity - thus the need for a rather large 
spool space.

 My concern isn't just backup run time.
 
 So you'd like to spool a complete Job? Whats you average job-size? (mine 
 is less than 8GB) if its larger, we just need a period of despooling 
 (I'd love to have concurrent spooling/despooling in bacula). Currenly I 
 use at most 32GB for spooling area, with a Job Concurrency at 4 and 2 
 tape drives.

I believe Alan is handling data sets a bit larger...

 When doing large backups(full+archive) I mostly have one
 (or two) drives in action at the same time while spooling to disk with 
 2(or 3) threads at the same time. The LTO3 drives far outperform our 
 network speed (1gbit). Transfer to tapes are in the range from 60MB/s to 
 100MB/s (and I unfortunately have no idea why they spread that much).
 
 In total numbers we're around 25TB to disk/month with monthly full and
 daily incrementals.
 
 Concerned about job run time, its my impression that spool space only 
 speeds up incremental/differential.

Depends - the important thing is that spooling allows multiple 
concurrent jobs without or with less interleaving.

 Restore times are also important and having a tape read back 1Gb, then 
 seek, then pull back another 1Gb (or even 10Gb) is a significant 
 penalty  over reading larger blocks when worst-case 75Tb+ restores are 
 considered (25-60 days on 2 drive LTO2, dpeending on the directory 
 structures being restored.)
 
 Whats the time consuming part in this? Seeking on tapes? Neither SSD's 
 or memory will change that.

Moving tapes, loading and unloading times. Seek times add to that, but 
are not that bad with LTO.

 AFAIK the spooling area is only used when 
 going TO tape, not FROM tape.

Right, but if you manage to have all jobs in one continuos block on 
tape, you minimize the number of volumes needed per job, and thus save 
much of the volume handling time.

 And spooling doesnt need any form for persistence, so its fine that its 
 gone after reboot
 Indeed.

 If it was practical I'd use ramdisks. Right now it's not. Apart from the 
 cost factor there is very little hardware which can address more than 
 128Gb of Dram. There are RAM arrays which are setup to operate as F/O 
 scsi devices, but these are currently silly money as they're marketed 
 at the world of high end, high cost databases.
 
 Again, I assume we're talking about spooling space, then try to think 
 about if you need that much.

I believe Alan thought about it (and I hope my reasoning above is 
correct :-)

 In 12 months time that may change, Ram is always falling in price - but 
 Flash drive pricing is falling faster,  performance/durability is rising 
 at the same time and there isn't the same issue with massive address 
 ranges as it just looks like more disk, vs having to change out entire 
 servers at $20k a time if RAM limits are reached.

 I'm not just looking at the issue of my current setup. Projects are 
 already pencilled onsite which will 

Re: [Bacula-users] Time for change

2008-12-18 Thread Alan Brown
On Thu, 18 Dec 2008, Jesper Krogh wrote:

  100-200Gb ram and systems capable of addressing that amount of memory are
  still far more expensive than a stack of flash drives, else I'd use them.

 But do you need to spool a complete tape? In order to avoid doing evil stuff
 to you tape drive, much less is sufficient.

I'm multiplexing anything up to 20 jobs at a time. To ensure that small
incremental and diff jobs are dumped in one hit and to ensure that full
backups are laid in as large chunks as possible, this is the kindof size
which is required.

  My concern isn't just backup run time.

 So you'd like to spool a complete Job? Whats you average job-size? (mine is
 less than 8GB)

Full backups run 500Gb to 1Tb apiece, the average nightly incremental is
about 80-150Gb - multiplied out by 90 filesystems.

You get the idea.

 Concerned about job run time, its my impression that spool space only speeds
 up incremental/differential.

It does at the moment. HOWEVER if spooling isn't used then jobs are
interleaved on the tape at record time, resulting in massive shoeshining
on restores and restore throughput rates measured in kb/Sec instead of
25-40Mb/sec

 Whats the time consuming part in this? Seeking on tapes? Neither SSD's or
 memory will change that. AFAIK the spooling area is only used when going TO
 tape, not FROM tape.

Using large spool areas increases the size of the chunks dropped to tape
and thus allows the tapes to stream for longer periods. ONE of my backups
might span 5 LTO2s

 Can you give some numbers, so we have a feeling about the sizes you talk
 about?

250Tb startup, 1Pb within 18 months, and growing past that. Filesizes
measured in tens of Mb apiece.




--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-18 Thread Jesper Krogh
Thanks for the elaborate reply. Just a few more querious questions.

Alan Brown wrote:
 On Thu, 18 Dec 2008, Jesper Krogh wrote:
 
 100-200Gb ram and systems capable of addressing that amount of memory are
 still far more expensive than a stack of flash drives, else I'd use them.
 But do you need to spool a complete tape? In order to avoid doing evil 
 stuff
 to you tape drive, much less is sufficient.
 
 I'm multiplexing anything up to 20 jobs at a time. To ensure that small
 incremental and diff jobs are dumped in one hit and to ensure that full
 backups are laid in as large chunks as possible, this is the kindof size
 which is required.

Are you having that many clients or is it actually larger volumes that 
have been split up in smaller filesets?

 My concern isn't just backup run time.
 So you'd like to spool a complete Job? Whats you average job-size? (mine is
 less than 8GB)
 
 Full backups run 500Gb to 1Tb apiece, the average nightly incremental is
 about 80-150Gb - multiplied out by 90 filesystems.
 
 You get the idea.

Absolutely.

 Concerned about job run time, its my impression that spool space only speeds
 up incremental/differential.
 
 It does at the moment. HOWEVER if spooling isn't used then jobs are
 interleaved on the tape at record time, resulting in massive shoeshining
 on restores and restore throughput rates measured in kb/Sec instead of
 25-40Mb/sec

Yes, I use spooling even for our +2TB volumes.

 Whats the time consuming part in this? Seeking on tapes? Neither SSD's or
 memory will change that. AFAIK the spooling area is only used when going TO
 tape, not FROM tape.
 
 Using large spool areas increases the size of the chunks dropped to tape
 and thus allows the tapes to stream for longer periods. ONE of my backups
 might span 5 LTO2s

I can have the same for LTO3, but that due to a few large jobs running 
full backups concurrenly.

 Can you give some numbers, so we have a feeling about the sizes you talk
 about?
 
 250Tb startup, 1Pb within 18 months, and growing past that. Filesizes
 measured in tens of Mb apiece.

Still over Gigabit netwok or do you have better infrastructure?

-- 
Jesper

--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-17 Thread Alan Brown
On Wed, 10 Dec 2008, Josh Fisher wrote:

  I'm running spooling on a 4 drive software raid0 quite happily on a 4Gb
  3GHz P4D machine. The limiting factors are disk head seek time(*) when
  running concurrent backups to 2 LTO2 drives and available SATA ports.
  Because of that I'm considering dropping in solid state disks.
 

 I've been wondering about those, but was thinking it would be better used as
 DB storage for Mysql, where I/Os per second is going to be more important than
 throughput. I see your point, though, where concurrent jobs are running.

There's no way in the world that I'd put any Database on RAID0, for
pretty obvious reasons.


In the case of Mysql, the biggest win will be simply having enough memory
to prevent the system from paging and having the mysql system tuned
appropriately to use it.

Past that, good fast arrays will help, but not as much as having
sufficient memory in the first place (where sufficient may be 16Gb or
more for large databases)

AB




--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-17 Thread Jesper Krogh
Alan Brown wrote:
 On Tue, 9 Dec 2008, Brian Debelius wrote:
 
 John Drescher wrote:
 In linux, I find this to be completely wrong. I have 15TB of software
 raid 6 and the most load that it puts on the cpu is around 7% and
 these are raid arrays that net over 200MB/s writes on single core
 systems that are 3 or so years old.
 
 So what do you think a reasonable cpu for bacula would be?
 
 I'm running spooling on a 4 drive software raid0 quite happily on a 4Gb
 3GHz P4D machine. The limiting factors are disk head seek time(*) when
 running concurrent backups to 2 LTO2 drives and available SATA ports.
 Because of that I'm considering dropping in solid state disks.

I still have got to see a reasonable priced SSD' disk that can deliver 
around 100MB/s both ways at the same time.

http://www.slashgear.com/samsung-64gb-ssd-performance-benchmarks-278717/

I have beefed up my director with sufficient amount of memory and 
mounted it as a ramdisk for spooling. That doesn't impose any 
limitations on the 2 LTO3 drives attached.

And spooling doesnt need any form for persistence, so its fine that its 
gone after reboot.

-- 
Jesper

--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-17 Thread Alan Brown
Jesper Krogh wrote:
 I'm running spooling on a 4 drive software raid0 quite happily on a 4Gb
 3GHz P4D machine. The limiting factors are disk head seek time(*) when
 running concurrent backups to 2 LTO2 drives and available SATA ports.
 Because of that I'm considering dropping in solid state disks.
 

 I still have got to see a reasonable priced SSD' disk that can deliver 
 around 100MB/s both ways at the same time.
   
There aren't any mechanical disks which can do it either.

Which is why I'm not trying to do that - replacing 4 RAID0 mechanical 
disks with 4 SSDs will provide similar sustained throughput to the 
mechanical RAID0, but provide _much_ better performance for anything 
where the mechanical disks had head seeking involved - such as multiple 
simultaneous input/output streams to LTO drives.

 http://www.slashgear.com/samsung-64gb-ssd-performance-benchmarks-278717/

   

Make sure you compare apples with something remotely looking like 
apples. The ONLY SSds which are suitable fo this kind of use are SLCs, 
not MLCs

 I have beefed up my director with sufficient amount of memory and 
 mounted it as a ramdisk for spooling. That doesn't impose any 
 limitations on the 2 LTO3 drives attached.
   
How much do you regard as sufficient?

100-200Gb ram and systems capable of addressing that amount of memory 
are still far more expensive than a stack of flash drives, else I'd use 
them.

My concern isn't just backup run time.

Restore times are also important and having a tape read back 1Gb, then 
seek, then pull back another 1Gb (or even 10Gb) is a significant 
penalty  over reading larger blocks when worst-case 75Tb+ restores are 
considered (25-60 days on 2 drive LTO2, dpeending on the directory 
structures being restored.)

 And spooling doesnt need any form for persistence, so its fine that its 
 gone after reboot
Indeed.

If it was practical I'd use ramdisks. Right now it's not. Apart from the 
cost factor there is very little hardware which can address more than 
128Gb of Dram. There are RAM arrays which are setup to operate as F/O 
scsi devices, but these are currently silly money as they're marketed 
at the world of high end, high cost databases.

In 12 months time that may change, Ram is always falling in price - but 
Flash drive pricing is falling faster,  performance/durability is rising 
at the same time and there isn't the same issue with massive address 
ranges as it just looks like more disk, vs having to change out entire 
servers at $20k a time if RAM limits are reached.

I'm not just looking at the issue of my current setup. Projects are 
already pencilled onsite which will increase storage demands by a factor 
of 20 from the current size within 12 months and I have to try and be 
ready to back that data up.





--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-17 Thread Jesper Krogh
Alan Brown wrote:
 Jesper Krogh wrote:
 I'm running spooling on a 4 drive software raid0 quite happily on a 4Gb
 3GHz P4D machine. The limiting factors are disk head seek time(*) when
 running concurrent backups to 2 LTO2 drives and available SATA ports.
 Because of that I'm considering dropping in solid state disks.
 
 I still have got to see a reasonable priced SSD' disk that can deliver 
 around 100MB/s both ways at the same time.
   
 There aren't any mechanical disks which can do it either.
 
 Which is why I'm not trying to do that - replacing 4 RAID0 mechanical 
 disks with 4 SSDs will provide similar sustained throughput to the 
 mechanical RAID0, but provide _much_ better performance for anything 
 where the mechanical disks had head seeking involved - such as multiple 
 simultaneous input/output streams to LTO drives.
 
 http://www.slashgear.com/samsung-64gb-ssd-performance-benchmarks-278717/

   
 
 Make sure you compare apples with something remotely looking like 
 apples. The ONLY SSds which are suitable fo this kind of use are SLCs, 
 not MLCs

Ok. I havent spend enough time in that area.

 I have beefed up my director with sufficient amount of memory and 
 mounted it as a ramdisk for spooling. That doesn't impose any 
 limitations on the 2 LTO3 drives attached.
   
 How much do you regard as sufficient?
 
 100-200Gb ram and systems capable of addressing that amount of memory 
 are still far more expensive than a stack of flash drives, else I'd use 
 them.

But do you need to spool a complete tape? In order to avoid doing evil 
stuff to you tape drive, much less is sufficient.

 My concern isn't just backup run time.

So you'd like to spool a complete Job? Whats you average job-size? (mine 
is less than 8GB) if its larger, we just need a period of despooling 
(I'd love to have concurrent spooling/despooling in bacula). Currenly I 
use at most 32GB for spooling area, with a Job Concurrency at 4 and 2 
tape drives. When doing large backups(full+archive) I mostly have one
(or two) drives in action at the same time while spooling to disk with 
2(or 3) threads at the same time. The LTO3 drives far outperform our 
network speed (1gbit). Transfer to tapes are in the range from 60MB/s to 
100MB/s (and I unfortunately have no idea why they spread that much).

In total numbers we're around 25TB to disk/month with monthly full and
daily incrementals.

Concerned about job run time, its my impression that spool space only 
speeds up incremental/differential.

 Restore times are also important and having a tape read back 1Gb, then 
 seek, then pull back another 1Gb (or even 10Gb) is a significant 
 penalty  over reading larger blocks when worst-case 75Tb+ restores are 
 considered (25-60 days on 2 drive LTO2, dpeending on the directory 
 structures being restored.)

Whats the time consuming part in this? Seeking on tapes? Neither SSD's 
or memory will change that. AFAIK the spooling area is only used when 
going TO tape, not FROM tape.

 And spooling doesnt need any form for persistence, so its fine that its 
 gone after reboot
 Indeed.
 
 If it was practical I'd use ramdisks. Right now it's not. Apart from the 
 cost factor there is very little hardware which can address more than 
 128Gb of Dram. There are RAM arrays which are setup to operate as F/O 
 scsi devices, but these are currently silly money as they're marketed 
 at the world of high end, high cost databases.

Again, I assume we're talking about spooling space, then try to think 
about if you need that much.

 In 12 months time that may change, Ram is always falling in price - but 
 Flash drive pricing is falling faster,  performance/durability is rising 
 at the same time and there isn't the same issue with massive address 
 ranges as it just looks like more disk, vs having to change out entire 
 servers at $20k a time if RAM limits are reached.
 
 I'm not just looking at the issue of my current setup. Projects are 
 already pencilled onsite which will increase storage demands by a factor 
 of 20 from the current size within 12 months and I have to try and be 
 ready to back that data up.

Can you give some numbers, so we have a feeling about the sizes you talk 
about?


-- 
Jesper

--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-10 Thread Alan Brown
On Tue, 9 Dec 2008, Jeff Kalchik wrote:

 *NEVER* use  software RAID if you can avoid it.  Software RAID puts a
 pretty good hit right on your CPU.

That hasn't been true in Linux for a number of years. Given a modern
machine (less than 2-5 years old) _and sufficient ram_, Linux software
raid is a great deal faster than most cheap dedicated hardware controllers
without having any noticable CPU load.


HOWEVER: as noted by others, Windows software raid performance is very
poor

AB




--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-10 Thread Alan Brown
On Tue, 9 Dec 2008, Lukasz Szybalski wrote:

 I'm not sure about ubuntu, but if you are installing debian you can
 setup raid right there, during installation. It was fairly easy. After
 that bacula setup and you are ready to go.

Ubuntu _is_ Debian, more or less.

There's plenty of configurability in the drive setup and I setup raid1 on
my personal machine in about 5 minutes during the installation procedure
(it took longer to navigate the menus than to configure the raid array)

 Do people have recommendation on that? I actually would like to add a
 5th 500gb drive to my raid but I have no space for it.

If you have a PCIe x1 connector then there are plenty of dumb dual SATA
adaptors available. Quad-SATA can be found with some difficulty (Promise
make one, but it can be hard to obtain)

If you have PCIe x4, then smarter cards are available, up to 16 SATA/SAS
ports.

Alternatively (or as well as...), something like 1 or more LaCie Biggest
or Quadra external eSATA arrays may be worth looking at.

http://www.lacie.com/uk/products/product.htm?pid=5
http://www.lacie.com/uk/products/product.htm?pid=10458

AB




--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-10 Thread Alan Brown
On Tue, 9 Dec 2008, Brian Debelius wrote:

 John Drescher wrote:
  In linux, I find this to be completely wrong. I have 15TB of software
  raid 6 and the most load that it puts on the cpu is around 7% and
  these are raid arrays that net over 200MB/s writes on single core
  systems that are 3 or so years old.

 So what do you think a reasonable cpu for bacula would be?

I'm running spooling on a 4 drive software raid0 quite happily on a 4Gb
3GHz P4D machine. The limiting factors are disk head seek time(*) when
running concurrent backups to 2 LTO2 drives and available SATA ports.
Because of that I'm considering dropping in solid state disks.

(*) The SD array is more than fast enough to drive a single LTO4 drive for
one backup, but when driving 2 LTO2 drives and running up to 6
simultaneous backups, the heads thrash quite hard and throughput slows by
50%. Installing 15krpm drives over the existing 7500rpm ones won't help as
that only reduces seek time by about 10-15%.

This machine also has the Bacula director and Mysql on it, Mysql is the
main memory hog, with the director coming in as the CPU hog. The biggest
single problem is memory - due to the database size (40Gb bacula dumps)
and the existing desktop-class motherboard is maxed out for that.

AB



--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-10 Thread Brian Debelius
Well right now I have an old Asus k8v-se which has an Athlon 64 
processor.  So from the conversation, it seems that it should be enough 
for software raid. But the basboard has only 2 sata ports, and 2 raid 
ports, and 5 PCI slots.  If I wanted more disks I would have to add a 
controller, or two.  Would the PCI bus be a bottleneck?


--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-10 Thread John Drescher
On Wed, Dec 10, 2008 at 10:57 AM, Brian Debelius
[EMAIL PROTECTED] wrote:
 Well right now I have an old Asus k8v-se which has an Athlon 64 processor.

Some of my raid older servers are very similar to this motherboard
with Athlon64 3000 chips.

  So from the conversation, it seems that it should be enough for software
 raid. But the basboard has only 2 sata ports, and 2 raid ports, and 5 PCI
 slots.  If I wanted more disks I would have to add a controller, or two.

Use the raid ports in sata (probably called jbod) mode.

  Would the PCI bus be a bottleneck?

From my testing with a similar using the PCI bus will limit the total
bandwidth to the drives that are hung off that to around 100MB/s. So
for a 6 drive array using 4 mobo ports I was getting around 200MB/s
reads. I am not sure about writes. I think I have the same box still
available, so I probably can run some numbers but that will have to be
later as I have a meeting...

John

--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-10 Thread thing

Lukasz Szybalski wrote:

On Tue, Dec 9, 2008 at 1:04 PM, John Drescher [EMAIL PROTECTED] wrote:
  

So what do you think a reasonable cpu for bacula would be?

  

Depends on what level of performance you are looking for. My director
is a 2 processor 2GHz opteron machine (circa 2003) with 4 GB of memory
and 18 or so  x 250 GB SATA 1 drives in raid 6. My main storage daemon
is on a second machine with the same cpus and ram but only 5 x 250 GB
SATA drives and the storage is a 2 drive x 24 slot LTO 2 autochanger.
Right now the bacula database is on that machine but it has been on
yet another machine. On top of that a typical SATA fileserver will be
a desktop class 2.5GHz Athlon X2  with 4 or 8GB of memory and 5 to 10
750 GB Segate SATA2 drives in raid 6 using mostly the motherboard sata
ports. This type of machine will net 300MB/s writes and even higher
reads in xfs. At 300 MB/s writes the cpu usage never goes above 7% for
the raid under normal conditions. Replacing a drive will make it go to
around 20% of 1 cpu until the drive is re synced.




I would go with software raid, no need to spend extra money unless you
already have one, then go ahead.

See/compare some performance statistics and functionality like swap drives:
http://lucasmanual.com/mywiki/Bacula#head-3ad60a172345da090bfbaa9488f8f9e6b88722b8

Also if you are going with hardware raid will you be using SAS which
supports (SAS + SATA connectors) or just SATA?

I'm not sure about ubuntu, but if you are installing debian you can
setup raid right there, during installation. It was fairly easy. After
that bacula setup and you are ready to go.
It would seem to me anything over 2004 pentium 4 should be enough. The
biggest question is: does the pc you want to purchase has enough space
to fit let say 4 hdd and does the motherboard supports these many hard
drives connectors.

Do people have recommendation on that? I actually would like to add a
5th 500gb drive to my raid but I have no space for it.

Thanks,
Lucas
  


I use an Antec 900 case as has 3 x 5.25inch triple bays, so thats 9 
disks plus then 3 single bays, so a dvd bay and 2 boot disk bays in 
which I have placed  a  dual scsi bay unit...(There is now an antec 1200 
case which has an extra triple bay I think). My Motherboard is an ASUS 
p35 with 6 sata plus 2 esataI run the dvd off the ide channel..one 
triple bay was removed and a 5 bay scsi tray unit installed, so I have 2 
scsi disks as the OS raid1 (in the dual scsi bay) and three scsi R5 as 
the primary data (so I can add two more but scsi are too 
expensive)...then another scsi raid card powers the 6 bay external scsi 
disk pack, that is a R5(4+1) of 146s for more data the 6th is my restore 
disk..(146gb scsi).  I plan to have 2 x 1tB for data as expansion as 
needed on the dual esata and  the other 6 internal internal could be (2 
at the moment) 1Tb disks mounted one at a time to backup to...Tape back 
up is to a scsi powered 2 x DLTiv 75 unit...I dont use it much though 
too manual...considering using single external 1tb fwire attached disks 
instead.


There is also a Corsair 3 x 5.25inch bay unit that holds 4 disk per 
unit, so I could run 13 internal disks, 2 external disks and 2 for the OS...


:)

On the Antec, each bay has a 120mm fan in front of it and the case has a 
120mm fan at the back plus a 200mm fan at the top rearI run them all 
at slow speed and there is little (fan) noise, the scsi disks whine 
noticably though (10,000rpm). At the moment I have a 520HX Corsair PSU, 
high efficiency and a 5 year warrantee...it sits at the bottom of the 
case so air gets drawn over the pci[e] slots, it seems to run at min fan 
speed so cant be stretched the raid cards have the spare ASUS chipset 
fans over them attached to the motherboard...looking at the 600HX unit 
when I add more disks though (the psu will then also be 3 years old)...I 
run a pci graphics card so the 2 x 16 pci-e are free for sata/sas 
controllers...


The Antec 900 I got on special...pleased with it...should last years 
with hardware refreshes every so often...oh the CPU is a dual core 
cerelon at 1.6Ghz and 2.5gb ram about to become 4, I can fit 8 but 2.5 
seems enough, I just happen to have 2 more sitting around so it goes in 
when I next add more disks.


regards

Steven




--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-10 Thread Josh Fisher

Alan Brown wrote:
 On Tue, 9 Dec 2008, Brian Debelius wrote:

   
 John Drescher wrote:
 
 In linux, I find this to be completely wrong. I have 15TB of software
 raid 6 and the most load that it puts on the cpu is around 7% and
 these are raid arrays that net over 200MB/s writes on single core
 systems that are 3 or so years old.
   

   
 So what do you think a reasonable cpu for bacula would be?
 

 I'm running spooling on a 4 drive software raid0 quite happily on a 4Gb
 3GHz P4D machine. The limiting factors are disk head seek time(*) when
 running concurrent backups to 2 LTO2 drives and available SATA ports.
 Because of that I'm considering dropping in solid state disks.
   

I've been wondering about those, but was thinking it would be better 
used as DB storage for Mysql, where I/Os per second is going to be more 
important than throughput. I see your point, though, where concurrent 
jobs are running.

 (*) The SD array is more than fast enough to drive a single LTO4 drive for
 one backup, but when driving 2 LTO2 drives and running up to 6
 simultaneous backups, the heads thrash quite hard and throughput slows by
 50%. Installing 15krpm drives over the existing 7500rpm ones won't help as
 that only reduces seek time by about 10-15%.

 This machine also has the Bacula director and Mysql on it, Mysql is the
 main memory hog, with the director coming in as the CPU hog. The biggest
 single problem is memory - due to the database size (40Gb bacula dumps)
 and the existing desktop-class motherboard is maxed out for that.

 AB



 --
 SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
 The future of the web can't happen without you.  Join us at MIX09 to help
 pave the way to the Next Web now. Learn more and register at
 http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
 ___
 Bacula-users mailing list
 Bacula-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/bacula-users
   

--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-09 Thread Jeff Kalchik
 Hello,

   I am currently running Ubuntu.  The bacula director and sd are on this
 box.  I have bacula configured to spool to this box, and then it goes
 directly to tape. I want to change how I am backing up. I would like to
 have some period of time of disk backups, and have them
 moved/migrated/copied to tape.

 Should I consider linux software raid. or use a hardware raid adapter?
 If software does the cpu choice have much of an impact?

 I am thinking of configuring the disk as follows:
   One OS and database disk, maybe a mirror
   Two disks in raid 0 for spooling, and recovering
   Three or more disks in raid 5 for the sd storage.

 Thoughts or comments on this?

*NEVER* use  software RAID if you can avoid it.  Software RAID puts a
pretty good hit right on your CPU.

Jeff Kalchik


--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-09 Thread John Drescher
 *NEVER* use  software RAID if you can avoid it.  Software RAID puts a
 pretty good hit right on your CPU.

In linux, I find this to be completely wrong. I have 15TB of software
raid 6 and the most load that it puts on the cpu is around 7% and
these are raid arrays that net over 200MB/s writes on single core
systems that are 3 or so years old.

In windows software raid performance sucks so you need a hardware card there.

John

--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-09 Thread thing

Normally I would say,

1)  Performance generally ie cpu hit is not a huge issue IMHO, 
especially with todays dual and quad core cpus...and of course ram is 
dirt cheap.
2)  For me the big issues is Ive lost software raid sets from power 
failures for a backup partition that's probably no biggeeunless 
of course your data partition and your backup partition are both 
software raided...and you loose both at once...


From whats posted the backup system is seperate from the production 
servers, in which case software raid would be fine IMHO. If its lost, 
the data should be available on tape and on the production 
serversthe proviso is that you backup to disk overnight and then 
immediately spool to tape..which could be during the day.


regards

Steven



Jeff Kalchik wrote:

Hello,

  I am currently running Ubuntu.  The bacula director and sd are on this
box.  I have bacula configured to spool to this box, and then it goes
directly to tape. I want to change how I am backing up. I would like to
have some period of time of disk backups, and have them
moved/migrated/copied to tape.

Should I consider linux software raid. or use a hardware raid adapter?
If software does the cpu choice have much of an impact?

I am thinking of configuring the disk as follows:
  One OS and database disk, maybe a mirror
  Two disks in raid 0 for spooling, and recovering
  Three or more disks in raid 5 for the sd storage.

Thoughts or comments on this?



*NEVER* use  software RAID if you can avoid it.  Software RAID puts a
pretty good hit right on your CPU.

Jeff Kalchik


--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users
  


--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-09 Thread John Drescher
 So what do you think a reasonable cpu for bacula would be?

Depends on what level of performance you are looking for. My director
is a 2 processor 2GHz opteron machine (circa 2003) with 4 GB of memory
and 18 or so  x 250 GB SATA 1 drives in raid 6. My main storage daemon
is on a second machine with the same cpus and ram but only 5 x 250 GB
SATA drives and the storage is a 2 drive x 24 slot LTO 2 autochanger.
Right now the bacula database is on that machine but it has been on
yet another machine. On top of that a typical SATA fileserver will be
a desktop class 2.5GHz Athlon X2  with 4 or 8GB of memory and 5 to 10
750 GB Segate SATA2 drives in raid 6 using mostly the motherboard sata
ports. This type of machine will net 300MB/s writes and even higher
reads in xfs. At 300 MB/s writes the cpu usage never goes above 7% for
the raid under normal conditions. Replacing a drive will make it go to
around 20% of 1 cpu until the drive is re synced.

-- 
John M. Drescher

--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-09 Thread Brian Debelius
John Drescher wrote:
 In linux, I find this to be completely wrong. I have 15TB of software
 raid 6 and the most load that it puts on the cpu is around 7% and
 these are raid arrays that net over 200MB/s writes on single core
 systems that are 3 or so years old.
So what do you think a reasonable cpu for bacula would be?

--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-09 Thread Wolfgang Denk
Dear Jeff Kalchik,

In message [EMAIL PROTECTED] you wrote:

 *NEVER* use  software RAID if you can avoid it.  Software RAID puts a
 pretty good hit right on your CPU.

He. So what does me a h/w RAID controller good  when  I  find  myself
having  the  system  in 90% I/O wait? On a file server (that does not
perform other services which would utilize the CPU)  s/w  RAID  gives
much  better  performance  -  at  least in the usage szenarios we see
(which means serving a *big* number of relatively small files).

Best regards,

Wolfgang Denk

-- 
DENX Software Engineering GmbH, MD: Wolfgang Denk  Detlev Zundel
HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany
Phone: (+49)-8142-66989-10 Fax: (+49)-8142-66989-80 Email: [EMAIL PROTECTED]
Generally speaking, there are other ways to accomplish whatever it is
that you think you need ...   - Doug Gwyn

--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-09 Thread Steven Jones
Brian Debelius wrote:
 John Drescher wrote:
 In linux, I find this to be completely wrong. I have 15TB of software
 raid 6 and the most load that it puts on the cpu is around 7% and
 these are raid arrays that net over 200MB/s writes on single core
 systems that are 3 or so years old.
 So what do you think a reasonable cpu for bacula would be?

depends on what you are backing up. If its network backup your network 
is likely to be the first limit...

I have a dual core 1.6Ghz cerelon with 2.5gb of ram, my bottleneck is my 
scsi disks.so if you have money to spend I'd suggest the disks and 
disk controllers are the priority.

regards

Steven

--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] Time for change

2008-12-09 Thread Dan Langille
Jeff Kalchik wrote:
 Hello,

   I am currently running Ubuntu.  The bacula director and sd are on this
 box.  I have bacula configured to spool to this box, and then it goes
 directly to tape. I want to change how I am backing up. I would like to
 have some period of time of disk backups, and have them
 moved/migrated/copied to tape.

 Should I consider linux software raid. or use a hardware raid adapter?
 If software does the cpu choice have much of an impact?

 I am thinking of configuring the disk as follows:
   One OS and database disk, maybe a mirror
   Two disks in raid 0 for spooling, and recovering
   Three or more disks in raid 5 for the sd storage.

 Thoughts or comments on this?
 
 *NEVER* use  software RAID if you can avoid it.  Software RAID puts a
 pretty good hit right on your CPU.

Perhaps this applies to some versions of RAID and some implementations 
of software RAID.   It certainly does not apply to simple RAID (such as 
RAID-1) and fine implementations of software such as FreeBSD's gmirror.

I use gmirror on all my machines at home.

That said, I also have hardware RAID-1.

--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users