Marcy,

I'll ask the vendor on the memory question...that's what I keep thinking
also is that something must be running that spikes the memory and kills the
machine before it has any time to recapture memory.

Kraig and Aaron,

I'm also going to pass on the hardware information and see if that can lead
the vendor down the right path as well.

Thanks!

Joell Chockley
System Capacity/Performance Specialist
Blue Cross Blue Shield of KS
1133 Topeka Blvd
Topeka, KS  66629-0001
Work (785)291-7837



  From:       Marcy Cortes <marcy.d.cor...@wellsfargo.com>

  To:         LINUX-390@vm.marist.edu

  Date:       12/07/2010 02:01 PM

  Subject:    Re: Kernel panic - not syncing error on Red Hat Linux






Well, that microcode fun was ours and what Linux got out of that was a "low
address protection" error.

This might be a clue:
"Out of memory and no killable processes error"

Do you need more memory?  Got a memory leak??




Marcy


-----Original Message-----
From: Linux on 390 Port [mailto:linux-...@vm.marist.edu] On Behalf Of
Graves, Aaron
Sent: Tuesday, December 07, 2010 11:55 AM
To: LINUX-390@vm.marist.edu
Subject: Re: [LINUX-390] Kernel panic - not syncing error on Red Hat Linux

Take a look at your hardware microcode level.  This may be your problem.

 060310 HIPER MCL's for 2097, 2098

Document number:        060310
Machine type:   2097, 2098
Models affected:        All Models
Abstract:       HIPER MCLs have been released for machines at Driver 76 and
Driver 79




Description


HIPER MCLs have been released to address an issue reported in the field.
This issue has only been seen in the Linux on System z operating system
running as guest under zVM and has resulted in a Linux abend. This issue
occurred when the physical CPU transitioned from handling one guest
configuration to another. A very small window exists where the update of
the hardware controls related to multiprocessor address translation were
not handled properly. There are no methods to avoid exposure to this issue
other than installing the specified HIPER MCL's.

This problem has only been reported when running Linux on System z as a
guest under zVM. However, IBM believes it could also occur when running
other operating systems. Although an undetected data corruption has never
been reported as a result of this issue, there is a very small chance that
this could occur.

The following HIPER MCLs have been released to eliminate the exposure:

Driver 73 -- Driver-73 entered "End of MCL Support" August 2009 and a fix
for this issue will not be released for this Driver. Product Engineering
strongly recommends upgrading to Driver-79 to obtain this fix.
Driver 76 -- MCL's N10967.101 and N10967.102 of the I390ML stream released
to RETAIN on June 1st, 2010 in MCL bundle #47.
Driver 79 -- MCL's N24406.032 and N24406.033 of the I390ML stream released
to RETAIN on June 1st, 2010 in MCL bundle #31.

Recommended action


Product Engineering recommends the installation of HIPER MCLs . These MCLs
can be installed concurrent with system operation.

Note- Driver-76 is currently in "Limited MCL Support" mode and will enter
"End of MCL Support" on June 30, 2010. Product Engineering strongly
recommends upgrading to Driver-79 during you next microcode maintenance
window. Driver-79 was released in November 2009 and is running on 40% of
the z10 machines in the field. Driver-79 is also the last planned driver
for z10.

EDM Considerations: The installation of these HIPER MCL's at Driver-76 will
exceed the current EDM min/max definition for concurrently upgrading to
Driver-79. A new EDM sync point is being developed and is targeted to be
released mid to late June, 2010. The following options are available:

1) Upgrade to Driver-79 and install the latest HIPER MCL's
2) Install the latest Driver-76 HIPER MCL's and wait for the next EDM sync
point to be released.

Note: Since these Driver-76 HIPER MCL's exceed the current EDM min/max
level you need to take additional actions to ensure the MCL install and
activate process overrides the current min/max level. To do this you need
to insure that you select the 'check box' "Include internal code changes
which will inhibit Concurrent Upgrade Engineering Changes (EC) task from
being used to apply the next Licensed Internal Code (EC) level." before
installing the Driver-76 HIPER MCL's.

Regards,
Aaron

-----Original Message-----
From: Linux on 390 Port [mailto:linux-...@vm.marist.edu] On Behalf Of Joell
Chockley
Sent: Tuesday, December 07, 2010 2:08 PM
To: LINUX-390@VM.MARIST.EDU
Subject: Kernel panic - not syncing error on Red Hat Linux

Has anyone had any experience with the following error in Red Hat Linux?

Kernel panic - not syncing: Out of memory and no killable processes error
and system is frozen and has to be hard rebooted.

cat /etc/redhat-release output is:
Red Hat Enterprise Linux ES release 4 (Nahant Update 4).

uname -a output is:
Linux timmon1.corp.bcbsks.com 2.6.9-42.ELsmp #1 SMP Wed Jul 12 23:27:17 EDT
2006 i686 i686 i386 GNU/Linux

I have a screen print of the console when it's frozen, but didn't know
about attaching a graphic to a question.  I also have a meminfo.out file
and sysreport output that I can send if anyone thinks that would be
helpful.  We've changed the setting for lower_zone_protection as
recommended from Red Hat and it's not having any impact on this problem
occuring.

Setting change from Red Hat:
echo 100 > /proc/sys/vm/lower_zone_protection

We've had this problem occur off and on with both of our Red Hat systems,
but this one in particular has been having a major problem the last week
and has been locking up about 3 times a day.  We use the Linux systems with
our CA Wily product for monitoring on the network.

The top command right now shows this output:
top - 12:44:49 up  2:16,  1 user,  load average: 0.42, 0.21, 0.17
Tasks:  73 total,   1 running,  72 sleeping,   0 stopped,   0 zombie
Cpu(s): 23.2% us,  0.3% sy,  0.0% ni, 64.6% id, 11.8% wa,  0.0% hi,  0.0%
si
Mem:   4144696k total,  1137308k used,  3007388k free,      448k buffers%
Swap:  4088532k total,        0k used,  4088532k free,    41672k cached

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
 4256 root      16   0  470m 204m 9204 S   94  5.1  36:01.64 tim
 3271 apache    16   0 13164 7068 3052 S    0  0.2   0:01.41 httpd

And the system just locked up on me again...3rd time today.

Any thoughts or ideas of where to look would be appreciated.

Thanks!

Joell Chockley
System Capacity/Performance Specialist
Blue Cross Blue Shield of KS
1133 Topeka Blvd
Topeka, KS  66629-0001
Work (785)291-7837



CONFIDENTIALITY NOTICE: This email message and any attachments are for the
sole use of the intended recipient(s) and may contain proprietary,
confidential, trade secret or privileged information.  Any unauthorized
review use, disclosure or distribution is prohibited and may be a violation
of law.  If you are not the intended recipient or a person responsible for
delivering this message to an intended recipient, please contact the sender
by reply email and destroy all copies of the original message.

----------------------------------------------------------------------
For LINUX-390 subscribe / signoff / archive access instructions,
send email to lists...@vm.marist.edu with the message: INFO LINUX-390 or
visit
http://www.marist.edu/htbin/wlvindex?LINUX-390
----------------------------------------------------------------------
For more information on Linux on System z, visit
http://wiki.linuxvm.org/

----------------------------------------------------------------------
For LINUX-390 subscribe / signoff / archive access instructions,
send email to lists...@vm.marist.edu with the message: INFO LINUX-390 or
visit
http://www.marist.edu/htbin/wlvindex?LINUX-390
----------------------------------------------------------------------
For more information on Linux on System z, visit
http://wiki.linuxvm.org/

----------------------------------------------------------------------
For LINUX-390 subscribe / signoff / archive access instructions,
send email to lists...@vm.marist.edu with the message: INFO LINUX-390 or
visit
http://www.marist.edu/htbin/wlvindex?LINUX-390
----------------------------------------------------------------------
For more information on Linux on System z, visit
http://wiki.linuxvm.org/





CONFIDENTIALITY NOTICE: This email message and any attachments are for the sole 
use of the intended recipient(s) and may contain proprietary, confidential, 
trade secret or privileged information.  Any unauthorized review use, 
disclosure or distribution is prohibited and may be a violation of law.  If you 
are not the intended recipient or a person responsible for delivering this 
message to an intended recipient, please contact the sender by reply email and 
destroy all copies of the original message.

----------------------------------------------------------------------
For LINUX-390 subscribe / signoff / archive access instructions,
send email to lists...@vm.marist.edu with the message: INFO LINUX-390 or visit
http://www.marist.edu/htbin/wlvindex?LINUX-390
----------------------------------------------------------------------
For more information on Linux on System z, visit
http://wiki.linuxvm.org/

Reply via email to