Re: [BULK] RE: R910/Linux CPU Heat Problems?

2010-12-08 Thread Eberhard Moenkeberg
Hi, On Wed, 8 Dec 2010, Jefferson Ogata wrote: > On 2010-12-08 23:31, Bond Masuda wrote: >> yeah, looks like the R910 has 4 PSUs definitely something off with one >> of them. I'd consider taking a physical look at it who knows? maybe one >> PSU is failing and generating a lot of heat? BT

Re: [BULK] RE: R910/Linux CPU Heat Problems?

2010-12-08 Thread Ryan Cox
I don't know much about that to be honest. I don't think I would worry too much about it. All of our rack mounts appear to have the same behavior. They are above the "Lower critical", however. I think at this point these are the steps I would try: * Reset the iDRAC/BMC. Sometimes this fixes i

RE: [BULK] RE: R910/Linux CPU Heat Problems?

2010-12-08 Thread Bond Masuda
That's a good theory, but if you saw the youtube video i posted earlier, the PSUs are not really positioned to do much cooling, they are stacked up vertically on the left side and all 4 of them occupy maybe only 20% or less of the total cross-section of the air flow. I would think that a response t

Re: [BULK] RE: R910/Linux CPU Heat Problems?

2010-12-08 Thread Erich Weiler
Hmm... [r...@server ~]# ipmitool -v sdr type Fan Sensor ID : FAN 1 RPM (0x30) Entity ID : 7.1 (System Board) Sensor Type (Analog) : Fan Sensor Reading: 1320 (+/- 120) RPM Status: ok Nominal Reading : 10080.000 Normal Minimum

Re: [BULK] RE: R910/Linux CPU Heat Problems?

2010-12-08 Thread Jefferson Ogata
On 2010-12-08 23:31, Bond Masuda wrote: > yeah, looks like the R910 has 4 PSUs definitely something off with one > of them. I'd consider taking a physical look at it who knows? maybe one > PSU is failing and generating a lot of heat? Or perhaps the high fan speed in one PSU is part of a s

RE: [BULK] RE: R910/Linux CPU Heat Problems?

2010-12-08 Thread Bond Masuda
yeah, looks like the R910 has 4 PSUs definitely something off with one of them. I'd consider taking a physical look at it who knows? maybe one PSU is failing and generating a lot of heat? -Original Message- From: Ryan Cox [mailto:ryan_...@byu.edu] Sent: Wednesday, December 08, 20

Re: [BULK] RE: R910/Linux CPU Heat Problems?

2010-12-08 Thread Ryan Cox
That is interesting. That appears to be showing two sets of fans (the 7.1s and 10.*). Do you have 4 power supplies in those? I don't know off the top of my head how many PSUs an R910 takes. 10.* in ipmitool is for power supplies and 7 is for the system board (see "ipmitool sdr entity help")

Re: [BULK] RE: R910/Linux CPU Heat Problems?

2010-12-08 Thread Eberhard Moenkeberg
Hi, On Wed, 8 Dec 2010, Erich Weiler wrote: > Very useful: > > [r...@server ~]# ipmitool sdr type Fan > FAN 1 RPM| 30h | ok | 7.1 | 1320 RPM > FAN 2 RPM| 31h | ok | 7.1 | 1320 RPM > FAN 3 RPM| 32h | ok | 7.1 | 1440 RPM > FAN 4 RPM| 33h | ok | 7.1 | 1680 RPM

Re: [BULK] RE: R910/Linux CPU Heat Problems?

2010-12-08 Thread Ryan Cox
We don't use OMSA here but do use ipmitool extensively. This may get you what you need. Load the following kernel modules first: ipmi_si, ipmi_devintf, ipmi_msghandler Give it a few seconds and then run: ipmitool sdr type Fan It can also be run remotely against an iDRAC (or BMC). We have had

Re: [BULK] RE: R910/Linux CPU Heat Problems?

2010-12-08 Thread Erich Weiler
Very useful: [r...@server ~]# ipmitool sdr type Fan FAN 1 RPM| 30h | ok | 7.1 | 1320 RPM FAN 2 RPM| 31h | ok | 7.1 | 1320 RPM FAN 3 RPM| 32h | ok | 7.1 | 1440 RPM FAN 4 RPM| 33h | ok | 7.1 | 1680 RPM FAN 5 RPM| 34h | ok | 7.1 | 1560 RPM FAN 6 RPM

Re: [BULK] RE: R910/Linux CPU Heat Problems?

2010-12-08 Thread Andrew Lyon
On Wed, Dec 8, 2010 at 9:49 PM, Erich Weiler wrote: > Yeah, I tried OMSA, but for the life of me could not get it to read > anything from the IPMI/BIOS interfaces.  No idea why.  I may just have > to reboot and go into the BIOS manually and see what I can see there. > I ran into the same problem

Re: [BULK] RE: R910/Linux CPU Heat Problems?

2010-12-08 Thread Erich Weiler
Yeah, I tried OMSA, but for the life of me could not get it to read anything from the IPMI/BIOS interfaces. No idea why. I may just have to reboot and go into the BIOS manually and see what I can see there. Thanks! On 12/08/10 13:49, Bond Masuda wrote: > Have you checked the fan speeds? Are t

Re: R910/Linux CPU Heat Problems?

2010-12-08 Thread Erich Weiler
> Just so you know, the kernel is merely responding to interrupts from the > processor cores themselves saying they are over temperature. The cores > have their thresholds set and the kernel can't and doesn't mess with > them. If the kernel reports the processors are hot, the processors are >

RE: R910/Linux CPU Heat Problems?

2010-12-08 Thread Bond Masuda
Have you checked the fan speeds? Are they at full throttle? I don't know much about the R910, but usually you can get fan speed readings from OMSA. -Original Message- From: linux-poweredge-boun...@dell.com [mailto:linux-poweredge-boun...@dell.com] On Behalf Of Erich Weiler Sent: Wednesday,

Re: biosdevname v0.3.2

2010-12-08 Thread Matt Domsch
On Wed, Dec 08, 2010 at 09:34:42AM -0600, Matt Domsch wrote: > Can you verify for me that you're running the lastest BIOS for these > please? I expected PE2950 BIOS 2.6.1 has SMBIOS 2.6, but I don't have > one handy to test with. I have a PE1955 handy, and indeed, latest BIOS 1.4.4 doesn't have S

RE: R910/Linux CPU Heat Problems?

2010-12-08 Thread Bond Masuda
I guess all this engineering isn't working for you?: http://www.youtube.com/watch?v=oulHU7hGRDM -Original Message- From: linux-poweredge-boun...@dell.com [mailto:linux-poweredge-boun...@dell.com] On Behalf Of Erich Weiler Sent: Wednesday, December 08, 2010 1:28 PM To: Ryan Cox Cc: linux

Re: R910/Linux CPU Heat Problems?

2010-12-08 Thread Erich Weiler
Thanks Ryan! So far so good. The server is under load now, as I run the script: [r...@server msr-tools-1.2]# ./showtemps | sort 0: 67 1: 70 2: 63 3: 77 4: 65 5: 71 6: 65 7: 77 8: 67 9: 72 10: 64 11: 77 12: 66 13: 73 14: 65 15: 76 16: 69 17: 76 18: 64 19: 78 20: 70 21: 75 22:

Re: R910/Linux CPU Heat Problems?

2010-12-08 Thread Ryan Cox
A few additions: I would highly recommend checking thumbscrews on the heat sink if the CPUs are legitimately hot. Thermal paste distribution on the CPUs may cause issues too. Also, "rdmsr -f 23:16 -d 0x1a2" will return the temperature threshold in degrees C. If you hit that temperature the c

Re: R910/Linux CPU Heat Problems?

2010-12-08 Thread Ryan Cox
Try running the following code. Load the "msr" kernel module and me sure rdmsr is installed. It's available from http://www.kernel.org/pub/linux/utils/cpu/msr-tools/ and is simple to compile. for a in /dev/cpu/[0-9]* do cpu=$(basename $a) printf "%2d: " $cpu echo $(($(rdmsr -f 23

R910/Linux CPU Heat Problems?

2010-12-08 Thread Erich Weiler
Hi All, We're running CentOS 5.5 (kernel 2.6.18-194.3.1.el5) on two Dell R910 servers. We're periodically getting CPU overheating messages spit out from syslogd: Message from syslogd@ at Fri Dec 3 12:06:56 2010 ... server kernel: CPU60: Temperature above threshold, cpu clock throttled Messag

Re: disk caddies

2010-12-08 Thread Philip Tait
On Wed, Dec 8, 2010 at 08:17, Michael Tiernan wrote: > On 12/8/10 12:05 PM, paul_m...@dell.com wrote: > > The second link is for a 2950/SAS and the first was for 2850/SCSI. Not a > valid comparison. > > > My apologies. I went too quickly. > > ___ > Linu

Re: disk caddy's

2010-12-08 Thread Michael Tiernan
On 12/8/10 12:05 PM, paul_m...@dell.com wrote: > The second link is for a 2950/SAS and the first was for 2850/SCSI. Not a > valid comparison. > My apologies. I went too quickly. ___ Linux-PowerEdge mailing list Linux-PowerEdge@dell.com https://lists

RE: disk caddy's

2010-12-08 Thread Paul_Maia
The second link is for a 2950/SAS and the first was for 2850/SCSI. Not a valid comparison. -Original Message- From: linux-poweredge-bounces-Lists On Behalf Of Michael Tiernan Sent: Wednesday, December 08, 2010 10:31 AM To: Ted Taichuan Lu Cc: linux-poweredge-Lists Subject: Re: disk caddy

Re: disk caddy's

2010-12-08 Thread J. Epperson
On Wed, December 8, 2010 11:30, Michael Tiernan wrote: > On 12/8/10 11:23 AM, Ted Taichuan Lu wrote: >> We bought these hot-swappable trays, which worked fine: >> http://www.provantage.com/datastor-dlpwrsata~7DTTS03K.htm >> > > That seems extreme when you can get them for less than half that here:

Re: Drivers and Downloads problems

2010-12-08 Thread Sean Dilda
I was having problems on Monday. Almost all of the BIOS/firmware updates I downloaded ended up downloading only the first 64k of the file. I had to switch to a windows box and use the java download application in order to get the full downloads. On 12/8/10 10:42 AM, Ryan Cox wrote: > Is anyon

Re: disk caddy's

2010-12-08 Thread Ted Taichuan Lu
We bought these hot-swappable trays, which worked fine: http://www.provantage.com/datastor-dlpwrsata~7DTTS03K.htm On 12/8/2010 11:18 AM, Denis Galvin wrote: > I was looking to add some third party hard drives to a 2950. Is there > anywhere that I can get disk caddy's? Dell only sells them with dr

disk caddy's

2010-12-08 Thread Denis Galvin
I was looking to add some third party hard drives to a 2950. Is there anywhere that I can get disk caddy's? Dell only sells them with drives. Denis Denis Galvin ILS Supervisor Rice University 1-713-348-3634 ___ Linux-PowerEdge mailing list Linux-PowerE

Re: Drivers and Downloads problems

2010-12-08 Thread cupertino
Yes, there is/was a problem during the day (European time). but by now it should be fixed. I download a 11G BIOS a few minutes ago. Original-Nachricht > Datum: Wed, 08 Dec 2010 08:42:25 -0700 > Von: Ryan Cox > An: linux-poweredge@dell.com > Betreff: *** GMX Spamverdacht *** Dr

Drivers and Downloads problems

2010-12-08 Thread Ryan Cox
Is anyone else having trouble downloading from support.dell.com? I've tried downloading the 2.2.9 BIOS for M610

Re: biosdevname v0.3.2

2010-12-08 Thread Matt Domsch
On Wed, Dec 08, 2010 at 01:10:05PM +, Andrew Lyon wrote: > On Mon, Dec 6, 2010 at 2:10 PM, Matt Domsch wrote: > > FYI, I know people are interested in the progress of this feature. > > -Matt > > > > - Forwarded message from Matt Domsch - > > > > Date: ? Mon, 6 Dec 2010 08:06:50 -0600

Re: biosdevname v0.3.2

2010-12-08 Thread Andrew Lyon
On Mon, Dec 6, 2010 at 2:10 PM, Matt Domsch wrote: > FYI, I know people are interested in the progress of this feature. > -Matt > > - Forwarded message from Matt Domsch - > > Date:   Mon, 6 Dec 2010 08:06:50 -0600 > From: Matt Domsch > To: linux-hotp...@vger.kernel.org, net...@vger.kerne