Re: SAS 6/iR as SAS/SATA only controller

2010-12-07 Thread Eric Rostetter
Quoting Nathan Hruby nhr...@gmail.com:

 Is this configuration supported?

No idea.  But try turning of SMART if you have it on.  Also you might
want to check for firmware upgrades.

We've had some problems where SMART checks were tripping SAS drives off-line.
Doing a /sbin/chkconfig smartd off fixed the issue.  I _think_ this might
be fixed in newer firmware?

-- 
Eric Rostetter
The Department of Physics
The University of Texas at Austin

Go Longhorns!

___
Linux-PowerEdge mailing list
Linux-PowerEdge@dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq


Re: IPMI

2010-08-05 Thread Eric Rostetter
Quoting Jason Edgecombe ja...@rampaginggeek.com:

 Would you please list a couple of these nagios-based systems? Preferably
 open-source.

Groundwork OpenSource (GWOS) Community Edition
Opsview
Centreon

Others I can't remember off the top of my head...

 Thanks,
 Jason

-- 
Eric Rostetter
The Department of Physics
The University of Texas at Austin

Go Longhorns!

___
Linux-PowerEdge mailing list
Linux-PowerEdge@dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq


Re: dsm_om_shrsvc start failes

2010-05-25 Thread Eric Rostetter
Quoting Support @ Technologist.si supp...@technologist.si:

 Executing /etc/rc.d/init.d/dsm_om_shrsvc start ..

 Starting DSM SA Shared Services: terminate called after throwing an  
 instance of 'std::runtime_error'
   what():  locale::facet::_S_create_c_locale name not valid
 /bin/bash: line 1: 27610 Terminated   
 /opt/dell/srvadmin/sbin/dsm_om_shrsvcd
 [FAILED]

See if typing:

export LC_ALL=C

before you start it fixes it...  It fixed a problem for me on one of my
systems (an rpath linux install).  Not seen it elsewhere (CentOS/RHEL
installs).

-- 
Eric Rostetter
The Department of Physics
The University of Texas at Austin

Go Longhorns!

___
Linux-PowerEdge mailing list
Linux-PowerEdge@dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq


Re: RAID-5 and database servers

2010-03-11 Thread Eric Rostetter
Quoting J. Epperson d...@epperson.homelinux.net:

 On Thu, March 11, 2010 11:17, Dan Pritts wrote:
 On Tue, Mar 09, 2010 at 04:54:44PM -0600, John G. Heim wrote:
 Has anyone configured a database server with RAID-5? Is it really a bad
 idea

 http://www.orafaq.com/wiki/RAID


 Which says that unless money is no object, go with RAID 5.

I'd say the page is somewhat outdated.  If your disks are large,
and most disks are today, RAID 5 should be replaced by RAID 6 or better.
RAID 5 is risky if your disks are large...  The larger the disk, the better
the chance of a second failure during a RAID 5 rebuild (causing a total lose
of data).

Also, while it does indeed say go with RAID 5 if you can't afford RAID 10,
it also says:

 use where availability is important, AND 'read' will be the majority of I/O's

If your database is mostly write, RAID 5 would not be a great idea...

Fortunately most databases are either mostly read, or mixed read-write.
But there are some mostly-write databases, and these would be a bad fit
for RAID 5 (or RAID 6).

Again, it depends on your environment and your needs... It is possible RAID 5
is perfect for your needs, but terrible for my needs...

If you don't need fast access, then it doesn't matter... Some people have
databases, and it takes many hours to generate a report, and they are okay
with that.  Others can't bear it if the report takes more than 30 seconds...
If your database use is interactive and response time is important, you
likely need a different setup than if your database is mostly batch
oriented and response time isn't as important...

-- 
Eric Rostetter
The Department of Physics
The University of Texas at Austin

Go Longhorns!

___
Linux-PowerEdge mailing list
Linux-PowerEdge@dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq


Re: RAID-5 and database servers

2010-03-11 Thread Eric Rostetter
Quoting Jefferson Ogata powere...@antibozo.net:

 I've got several hundred disks running on RAID 5 and I've had one actual
 full RAID failure in 10 years, and that was my fault.

You've been lucky! :)

In 10 years, I've think I've had 3 RAID 5 failures (all rebuilt without
problems).

 In terms of performance, depending on the workload, RAID 5 can
 outperform RAID 10.

Very true.

 Furthermore Oracle's recommendations are based on
 what appears to be 5-10-year-old data

I agree, it appears outdated to me also.

 Bear in mind
 also that now that Oracle is a hardware company, they'd just love you to
 buy almost twice as much disk (from them).

I doubt that is a driving factor here...

 *Again*, this is why if you have particular performance requirements,
 you should consult with your database vendor to determine what bandwidth
 and IOPS you need, and benchmark your gear using different RAID configs.

Or at a minimum, you need to define what your performance requirements are.
If you can't quantify your performance requirements, you're just guessing
and taking a shot in the dark.

 You may find that RAID 5 is just fine performance-wise, and you can get
 around 1.7 times the storage capacity with the same rack space, heat,
 and power load over RAID 10. Asking here you're just going to get people
 parroting Oracle's stale recommendations and speculating wildly without
 knowing anything about your workload.

Well, the advise has been slightly better than that, but yes, we're all
speculating without knowing anything about the workload.  And I at least
have stated that in my posts/replies...

If a serious answer is needed, the OP needs to post the workload and
performance expectations at a minimum...

-- 
Eric Rostetter
The Department of Physics
The University of Texas at Austin

Go Longhorns!

___
Linux-PowerEdge mailing list
Linux-PowerEdge@dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq


Re: RAID-5 and database servers

2010-03-11 Thread Eric Rostetter
Quoting Jefferson Ogata powere...@antibozo.net:

 That's not what I mean by a full RAID failure.

My mistake; I just glossed right over the word full as if it wasn't
there...   Sorry about that...  Brain fart I guess

-- 
Eric Rostetter
The Department of Physics
The University of Texas at Austin

Go Longhorns!

___
Linux-PowerEdge mailing list
Linux-PowerEdge@dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq


Re: How to saw Hardware Raid

2010-02-19 Thread Eric Rostetter
Quoting Adnan Rizvi sari...@buffalo.edu:

 When I give this command. It gives me no output.

 ./MegaCli64 -LdPdInfo -aAll

The commands I use (via cron) are:

MEGACLI=/opt/MegaRAID/MegaCli/MegaCli

$MEGACLI -AdpAllInfo -aALL  /tmp/AdpAllInfo.txt
a=`grep Degraded /tmp/AdpAllInfo.txt | grep -o [0-9]`
b=`grep Failed Disks /tmp/AdpAllInfo.txt | grep -o [0-9]`
if [ $a != 0 -o $b != 0 ] ; then
$MEGACLI -LdPdInfo -aALL  /tmp/ldpd.txt
mailx -s Check RAID on `hostname` root  /tmp/ldpd.txt
else
cat /dev/null  $MEGAPATH/MegaSAS.log
fi

But, maybe if you don't get output from -LdPdInfo you won't get it any
from -AdpAllInfo also?

OMSA would be the other option...

-- 
Eric Rostetter
The Department of Physics
The University of Texas at Austin

Go Longhorns!

___
Linux-PowerEdge mailing list
Linux-PowerEdge@dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq


Re: AHCI permitted on some Gen 11 servers (was Third-party drives not permitted on Gen 11 servers)

2010-02-11 Thread Eric Rostetter
Quoting Tim Small t...@seoss.co.uk:

 Isn't this another reason to ditch hardware RAID controller cards
 entirely?  To be honest - whenever possible, I use Linux's built-in md
 RAID1/5/6/10 instead of proprietary RAID solutions, and use
 Enterprise-grade SATA drives.  This gives me:

Depends on your needs (or clients)...

I have a 2-node cluster of Dell 2900 machines which are configured
as a SAN/NAS, and I'd never do anything but hardware raid on these.

I had a 2-node cluster of Dell 2900 machines which are configured as
a mail cluster (SMTP, POP3/IMAP, antivirus, spam filter, etc), and
I'd never do anything but hardware raid on these.

I've also got got a bunch more Dell servers, such as one that is my
backup SMTP (MX) host, one that does my webmail, one that is for
computation jobs, one that is a print server, and so on.  ALL of these
use software raid...

My web server has hardware raid now, but only because it is running on
the hardware which was my old NAS before I built the above SAN/NAS system.
Otherwise, it wouldn't have hardware raid, but rather software raid...

Your situations may vary, but there are some situations where hardware
raid is still the best solution.  However, those situations are fewer
and fewer each year.  I used to order ALL my PE servers with HW RAID...
No I order maybe 10% with HW RAID.

 So... looking on the bright side, some Gen 11 do support SATA and AHCI,
 so why not just ditch this proprietary stuff altogether?

Well, some people work for clients, and may not be able to.  Some people
might just want the HW RAID for certain applications...  But in many
cases, SW RAID is a better choice today.  That didn't used to be the case
10 years ago, but the times have drastically changed since then...

 Tim.

-- 
Eric Rostetter
The Department of Physics
The University of Texas at Austin

Go Longhorns!

___
Linux-PowerEdge mailing list
Linux-PowerEdge@dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq


RE: Third-party drives not permitted on Gen 11 servers

2010-02-10 Thread Eric Rostetter
Quoting Ronan Mullally ro...@iol.ie:

 Dell is no NetApp.  Having used solutions from both, there's a world of
 a difference.  You are kidding yourself if you think you're on a nearby
 practice ground, let alone in the same ballpark.

Dell does sell enterprise storage solutions like NetApp (some their
own, some co-branded).  Think about their SAN and DAS boxes...

I'd have little troble with such a policy on their SAN's for example.

But, a Poweredge Server is no NetApp.  That is for sure...  I find it
funny that they use the phrase enterprise storage solution when
refering to a Poweredge Server...

Many of my PE servers store no data except the OS.  I still want to
mirror the OS in case of failure, but the storage is on a SAN/NAS
elsewhere, not in the PE server...  So trying to compare a PE Server
used this way (not for storage) to a NetApp is apples to oranges.
This is really true for PE 1U servers with only a small number of bays...

 -Ronan

-- 
Eric Rostetter
The Department of Physics
The University of Texas at Austin

Go Longhorns!

___
Linux-PowerEdge mailing list
Linux-PowerEdge@dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq


RE: Third-party drives not permitted on Gen 11 servers

2010-02-10 Thread Eric Rostetter
Quoting Mirosław Jaworski m...@ikp.pl:

 We anxiously wait for further lock-ins.

I had a slippery slope rant in my last email, but I decided to remove it
before sending...  See you think a bit alike...

The funny thing is, when we ran DEC and SUN stuff, all Dell ever told us
was why we should switch to their Industry Standard hardware; how great
it would be to use Industry Standard equipment because we could buy
stuff off-the-shelve and not have to buy from a specific vendor; how using
their Industry Standard computers would remove software incompatibilities
and such...

So much for Dell's Industry Standard hardware, huh?  Wonder what their
new sales pitch will be now?

Maybe they can steal a line from the diaper commercial (yeah, think about
it...)...  Dell -- We're a big kid now...

Anyway, again, I don't much care, I'll just buy anything new with the
older PERC 6 cards for now.  I know the bandwidth is better in the new
PERC cards, but I can deal with the current speed...

If/When Dell only offers lock-in PERC's, well, I'll figure out what to
do then...

-- 
Eric Rostetter
The Department of Physics
The University of Texas at Austin

Go Longhorns!

___
Linux-PowerEdge mailing list
Linux-PowerEdge@dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq

Re: Third-party drives not permitted on Gen 11 servers

2010-02-10 Thread Eric Rostetter
Quoting s.mishima s.mish...@gmail.com:

 I have PowerEdge Server with DELL certified Seagate harddisk.
 I bought it from DELL.

 The DELL certified Seagate harddisk has lock problem 1/320 probability
 every power-on spinup.

Yeah, I bought two of those systems too. ;) Again, thought about writing
this in my post, but decided against it...

 But DELL support says,
 Out of Warranty.

Yeah, fortunately it is a firmware upgrade you can do without warranty.
In fact, Dell wouldn't replace them at all, only provide the firmware
update.

 What is DELL certified ?
 What is DELL warranty ?

The proper question here is, why didn't Dell's extensive testing for
certification help catch this issue?

The warranty doesn't really apply, since it is just a firmware upgrade,
and the firmware is free...

-- 
Eric Rostetter
The Department of Physics
The University of Texas at Austin

Go Longhorns!

___
Linux-PowerEdge mailing list
Linux-PowerEdge@dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq


Re: help configuring a db server

2010-02-10 Thread Eric Rostetter
Quoting John G. Heim jh...@math.wisc.edu:

 to go) and mysql. We have databases  for spamassassin bayesian rules,
 drupal, moodle, imp (webmail), and our own private data.  Our private data
 is pretty small with the biggest table containing under 10,000 records. We
 do have a table with about 4,00 PDF docs in a blob field.

 I think we can get a machine with a quad core, 32 Gb of RAM, and 300 Gb disk

Sounds fine to me.  I recommend, when possible, a minimum of 1 CPU and 1 GB
memory per database.  Since you mention 5 databases, that means 5 CPUs (cores)
and 5 GB memory minimum.  Since you can't buy 5 cores, a single quad core
would be okay (a dual quad core even better).  But your spec sounds fine
to me.

 for under $6000. But I'm confused about disk. I would think disk pspeed
 would be fairly important. How can I configure a machine with a fast disk?
 What are my options from Dell in that regard?

Get the fastest RAID controller you can and are comfortable with, and as many
of the fastest disks you can afford as you can fit in the machine.

Since you want to stay under $6000, using SSD or FusionIO is probably out.
I would think (haven't verified) that you could buy 4 (or 5) 15K RPM SAS
drives in your price range, setup as a RAID 10.  If not, try 10K RPM SAS
drives.  If that fails, well, there is always SATA...

I personally like to buy 7 drives: 2 RAID-1 for the OS, 4 RAID-10 for the
MySQL databases, and 1 hotspare.  But, if you need to keep costs down, you
can dump the RAID 1 and just do 5 drives (RAID 10 plus hot-spare).

You'd have to spec all that one Dell's web site to see if it comes in
under $6K.  If not, just start dropping down until you hit your mark
(from 7 drives to 5, from 15K SAS to 10K SAS to SATA).

-- 
Eric Rostetter
The Department of Physics
The University of Texas at Austin

Go Longhorns!

___
Linux-PowerEdge mailing list
Linux-PowerEdge@dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq


Re: RAID battery-backed cache - necessary?

2010-02-10 Thread Eric Rostetter
Quoting Adam Nielsen adam.niel...@uq.edu.au:

 Ahh, okay, so it's not there for pure write speed as such, it's there so
 that software can be told yes, the data you just wrote is now on disk
 no matter what even though the actual disk write may happen at a later
 time.

Yes, all true.  But it is also used by some to do write combining or write
ordering or such (try a google search like raid5 small write problem for
info).  And it is also there so that it can be used for parity calculations
on raid sets that use parity.  And it is also there so that if you lose
power while writing out a parity stripe, such that the data is written but
the parity isn't written when the power goes out, it doesn't create a
corrupted stripe that can't be recovered on a disk failure (try a google
search on raid write hole  for info).

And I'd bet there are other reasons too.

 I guess although a UPS would stop the OS cache from going away, it can't
 help if the kernel crashes.

Or when the UPS stops on a prolonged power outage.  RAID batteries will
generally go for weeks or months without power...  UPS batteries usually
go for minutes or maybe a few hours without power...  Sometimes power is
out for a while...

 Well that answers my question - thanks :-)

At least partially... :)  There's usually more to these things than
meets the eyes...

 Cheers,
 Adam.

-- 
Eric Rostetter
The Department of Physics
The University of Texas at Austin

Go Longhorns!

___
Linux-PowerEdge mailing list
Linux-PowerEdge@dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq


Re: OMSA 6.2 install issues...

2010-02-09 Thread Eric Rostetter
Quoting John Oliver joli...@john-oliver.net:

 So what's the correct order? :-)

Not sure. :)  The following is one ordering which works, but I bet there
are at least several that work.

libsmbios smbios-utils-bin srvadmin-cm srvadmin-xmlsup srvadmin-omacore
srvadmin-omilcore srvadmin-deng srvadmin-hapi srvadmin-isvc srvadmin-omcommon
srvadmin-smcommon srvadmin-racdrsc5 srvadmin-racadm5 srvadmin-rac5-components
srvadmin-fsa srvadmin-megalib srvadmin-storage srvadmin-sysfsutils
srvadmin-storage-populator srvadmin-storelib-sysfs
srvadmin-storelib srvadmin-storelib-libpci

One thing I remember about the order that didn't work is that I installed
srvadmin-{deng,hapi,isvc} before I installed srvadmin-omilcore and maybe
some others...  Installing them too soon might be the problem?  Not sure
though, and I unfortunately didn't keep the order that didn't work...

-- 
Eric Rostetter
The Department of Physics
The University of Texas at Austin

Go Longhorns!

___
Linux-PowerEdge mailing list
Linux-PowerEdge@dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq


Re: OMSA help in unsupported install...

2010-02-06 Thread Eric Rostetter
Quoting Vanush Misha Paturyan mi...@cs.nuim.ie:

 but can you double-check following:

 1. your snmpd.conf file does have smuxpeer .1.3.6.1.4.1.674.10892.1
 2. your snmpd is started WITHOUT -I -smux (this is default on Debian
 and breaks smuxpeer)(ps ax | grep snmpd shows the parameters for snmpd)
 (according to your log files smuxpeer is running so I think answers to
 those two are yes, but just to make sure )

Yes, already checked those (the first from OMSA docs, the second from
a google search).  Both were correct.

 following is based on OMSA 5.4, not 5.1 as in your case, but here's
 how snmpd and omsa communicate (at least I think that's how)
 1. you need dsm_sa_snmp32d, dsm_sa_datamgr32d (and possibly
 dsm_sa_eventmgr32d) to be running (ps aux should show them).

Yes, have all that.

 2. once you get the PID of dsm_sa_snmp32d run lsof -p PID and confirm
 that it does have an established TCP IPv4 connection to localhost:smux
 (or localhost:199).

Ah, cool... Thanks for that one.

 To debug snmpd stop your running instance and start it from command
 prompt as /usr/sbin/snmpd -Dsmux -f -Lo -Lf /var/log/snmpd.log -u
 root (this will write output on both screen and in /var/log/snmpd.log
 file). You can specify -DALL, see what it generates, and then filter
 out things you want to analyse.

Way cool!  Thanks a million.

 And another thing worth checking (if you haven't done it yet): run ldd
 sdm_sa_smnp32d to see it has all the libraries it needs. You haven't
 told yet if your Linux is 32 or 64 bit. OMSA is 32bit so on 64bit
 versions it needs bunch of 32bit libraries.

Yea, I spent a day just getting this worked out (because yes, I'm running
a 64-bit distro, and lots of libraries were missing...)

Thanks to your help and my persistence, I now have it working...  Your
debugging tips (above) were a great help.

I actually was able to upgrade it, and I'm now running a current OMSA
(OMI-SrvAdmin-Dell-Web-LX-620-677.rhel4.tar.gz) on OpenFiler 2.3 with
the SNMP working on two (identical) machines.  Only had a few minor
issues getting the 6.2 to work...

 Misha.

 --
 Vanush Misha Paturyan
 Senior Technical Officer
 Computer Science Department
 NUI Maynooth

-- 
Eric Rostetter
The Department of Physics
The University of Texas at Austin

Go Longhorns!

___
Linux-PowerEdge mailing list
Linux-PowerEdge@dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq


OMSA help in unsupported install...

2010-02-03 Thread Eric Rostetter

I'm trying to install what is probably an unsupported OMSA (5.1.0) on
a very much unsupported OS (rpath linux, specifically OpenFiler NAS/SAN
appliance 2.3).  And I'm hoping despite that maybe someone on this list
might be able to help me. :)

I have OMSA installed and running.  omreport works and reports everything
except storage (because OMSA 5.1.0 pre-dates the perc 6/i in the machine,
so this is excepted to fail).  So everything looks good locally...  I can
see everything I've tried (fans, volts, temps, system summary, etc) other
than storage as noted above.

In /var/log/messages, I can see that OMSA is trying to talk to snmpd, ala:

Feb  3 14:01:53 filer2 snmpd: snmpd startup succeeded
Feb  3 14:01:53 filer2 snmpd[24188]: NET-SNMP version 5.4.1
Feb  3 14:01:54 filer2 snmpd[24188]: [smux_accept] accepted fd 9 from  
127.0.0.1:60802
Feb  3 14:01:54 filer2 snmpd[24188]: accepted smux peer: oid  
SNMPv2-SMI::enterprises.674.10892.1, descr Systems Management SNMP MIB  
Plug-in Manager

But when I remotely snmpwalk it, it returns no values.  It is connecting,
again via /var/log/messages:

Feb  3 14:02:48 filer2 snmpd[24188]: Connection from UDP:  
[172.16.111.73]:56624
Feb  3 14:02:48 filer2 snmpd[24188]: Received SNMP packet(s) from UDP:  
[172.16.111.73]:56624
Feb  3 14:02:48 filer2 snmpd[24188]: Connection from UDP:  
[172.16.111.73]:56624

So, the question I have is: how would I go about trying to diagnose why I
can't remotely query the machine via snmp?  Any tips at all (local checks,
snmpd config checks, OMSA config checks, etc) appreciated, since I'm rather
a novice at setting up snmpd as well as unsupported OMSA setups. :)

Thanks in advance for any help that might come my way.

-- 
Eric Rostetter
The Department of Physics
The University of Texas at Austin

Go Longhorns!

___
Linux-PowerEdge mailing list
Linux-PowerEdge@dell.com
https://lists.us.dell.com/mailman/listinfo/linux-poweredge
Please read the FAQ at http://lists.us.dell.com/faq