https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=288607

            Bug ID: 288607
           Summary: MCA kernel panic on non-existent BANK 8
           Product: Base System
           Version: 14.3-RELEASE
          Hardware: amd64
                OS: Any
            Status: New
          Severity: Affects Only Me
          Priority: ---
         Component: kern
          Assignee: [email protected]
          Reporter: [email protected]

Fresh 14.3 machine on hardware that has been stable past 5 years; regular
reboots after about 5 hours of uptime.

Message just before reboot shows an MCA error in Bank 8.

uname: FreeBSD jade.hostname 14.3-RELEASE FreeBSD 14.3-RELEASE
releng/14.3-n271432-8c9ce319fef7 GENERIC amd64

dmidecode -t memory | grep BANK
        Bank Locator: BANK 0
        Bank Locator: BANK 1
        Bank Locator: BANK 2
        Bank Locator: BANK 3
(full dmidecode at the end)


Aug  2 06:25:37 hostname syslogd: kernel boot file is /boot/kernel/kernel
Aug  2 06:25:37 hostname kernel: MCA: Bank 8, Status 0xbe20000000011152
Aug  2 06:25:37 hostname syslogd: last message repeated 4 times
Aug  2 06:25:37 hostname kernel: MCA: Global Cap 0x0000000000000c09, Status
0x0000000000000005
Aug  2 06:25:37 hostname syslogd: last message repeated 1 times
Aug  2 06:25:37 hostname kernel: MCA: Bank 8, Status 0xbe20000000011152
Aug  2 06:25:37 hostname kernel: MCA: Vendor "GenuineIntel", ID 0x306a9, APIC
ID 3
Aug  2 06:25:37 hostname kernel: MCA: CPU 3 UNCOR EN MCA: Bank 8, Status
0xbe20000000011152
Aug  2 06:25:37 hostname kernel: MCA: Global Cap 0x0000000000000c09, Status
0x0000000000000005
Aug  2 06:25:37 hostname syslogd: last message repeated 1 times
Aug  2 06:25:37 hostname kernel: MCA: Bank 8, Status 0xbe20000000011152
Aug  2 06:25:37 hostname kernel: MCA: Vendor "GenuineIntel", ID 0x306a9, APIC
ID 4
Aug  2 06:25:37 hostname kernel: MCA: Global Cap 0x0000000000000c09, Status
0x0000000000000005
Aug  2 06:25:37 hostname kernel: MCA: Vendor "GenuineIntel", ID 0x306a9, APIC
ID 7
Aug  2 06:25:37 hostname kernel: MCA: Global Cap 0x0000000000000c09, Status
0x0000000000000005
Aug  2 06:25:37 hostname kernel: MCA: Vendor "GenuineIntel", ID 0x306a9, APIC
ID 1
Aug  2 06:25:37 hostname kernel: MCA: CPU 1 UNCOR EN PCC ICACHE L2 IRD error
Aug  2 06:25:37 hostname kernel: MCA: Address 0xae5b40MCA: Vendor
"GenuineIntel", ID 0x306a9, APIC ID 5
Aug  2 06:25:37 hostname kernel: MCA: Global Cap 0x0000000000000c09, Status
0x0000000000000005
Aug  2 06:25:37 hostname kernel: MCA: CPU 5 MCA: Vendor "GenuineIntel", ID
0x306a9, APIC ID 6
Aug  2 06:25:37 hostname kernel: MCA: CPU 7 UNCOR EN PCC ICACHE L2 IRD
errorMCA: CPU 6 UNCOR EN PCC ICACHE L2 IRD error
Aug  2 06:25:37 hostname kernel: MCA: Address 0xae5b40MCA: Vendor
"GenuineIntel", ID 0x306a9, APIC ID 2
Aug  2 06:25:37 hostname kernel: MCA: Global Cap 0x0000000000000c09, Status
0x0000000000000005
Aug  2 06:25:37 hostname kernel: MCA: Vendor "GenuineIntel", ID 0x306a9, APIC
ID 0
Aug  2 06:25:37 hostname kernel: UNCOR MCA: CPU 0 UNCOR EN PCC ICACHE L2 IRD
error
Aug  2 06:25:37 hostname kernel: MCA: Address 0xae5b40
Aug  2 06:25:37 hostname kernel: MCA: Misc 0x702203c086
Aug  2 06:25:37 hostname kernel: panic: Unrecoverable machine check exception
Aug  2 06:25:37 hostname kernel: cpuid = 0
Aug  2 06:25:37 hostname kernel: time = 1754115805
Aug  2 06:25:37 hostname kernel: KDB: stack backtrace:
Aug  2 06:25:37 hostname kernel: Uptime: 14h26m11s


And at this point

---<<BOOT>>---


# dmidecode -t memory 
# dmidecode 3.6
Scanning /dev/mem for entry point.
SMBIOS 2.7 present.

Handle 0x005D, DMI type 17, 34 bytes
Memory Device
        Array Handle: 0x005E
        Error Information Handle: 0x0062
        Total Width: 64 bits
        Data Width: 64 bits
        Size: 8 GB
        Form Factor: DIMM
        Set: None
        Locator: ChannelA-DIMM0
        Bank Locator: BANK 0
        Type: DDR3
        Type Detail: Synchronous
        Speed: 1600 MT/s
        Manufacturer: 859B
        Serial Number: A10FD65E
        Asset Tag: 9876543210
        Part Number: CT102464BA160B.C16
        Rank: 2
        Configured Memory Speed: 1600 MT/s

Handle 0x005E, DMI type 16, 23 bytes
Physical Memory Array
        Location: System Board Or Motherboard
        Use: System Memory
        Error Correction Type: None
        Maximum Capacity: 32 GB
        Error Information Handle: 0x005F
        Number Of Devices: 4

Handle 0x0061, DMI type 17, 34 bytes
Memory Device
        Array Handle: 0x005E
        Error Information Handle: No Error
        Total Width: 64 bits
        Data Width: 64 bits
        Size: 8 GB
        Form Factor: DIMM
        Set: None
        Locator: ChannelA-DIMM1
        Bank Locator: BANK 1
        Type: DDR3
        Type Detail: Synchronous
        Speed: 1600 MT/s
        Manufacturer: 859B
        Serial Number: A10FD67B
        Asset Tag: 9876543210
        Part Number: CT102464BA160B.C16
        Rank: 2
        Configured Memory Speed: 1600 MT/s

Handle 0x0064, DMI type 17, 34 bytes
Memory Device
        Array Handle: 0x005E
        Error Information Handle: 0x0065
        Total Width: 64 bits
        Data Width: 64 bits
        Size: 8 GB
        Form Factor: DIMM
        Set: None
        Locator: ChannelB-DIMM0
        Bank Locator: BANK 2
        Type: DDR3
        Type Detail: Synchronous
        Speed: 1600 MT/s
        Manufacturer: 859B
        Serial Number: A3188697
        Asset Tag: 9876543210
        Part Number: CT102464BA160B.C16
        Rank: 2
        Configured Memory Speed: 1600 MT/s

Handle 0x0067, DMI type 17, 34 bytes
Memory Device
        Array Handle: 0x005E
        Error Information Handle: No Error
        Total Width: 64 bits
        Data Width: 64 bits
        Size: 8 GB
        Form Factor: DIMM
        Set: None
        Locator: ChannelB-DIMM1
        Bank Locator: BANK 3
        Type: DDR3
        Type Detail: Synchronous
        Speed: 1600 MT/s
        Manufacturer: 859B
        Serial Number: A4115DDA
        Asset Tag: 9876543210
        Part Number: CT102464BA160B.C16
        Rank: 2
        Configured Memory Speed: 1600 MT/s


# dmesg
---<<BOOT>>---
Copyright (c) 1992-2023 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
        The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 14.3-RELEASE releng/14.3-n271432-8c9ce319fef7 GENERIC amd64
FreeBSD clang version 19.1.7 (https://github.com/llvm/llvm-project.git
llvmorg-19.1.7-0-gcd708029e0b2)
VT(vga): resolution 640x480
CPU: Intel(R) Core(TM) i7-3770 CPU @ 3.40GHz (3400.13-MHz K8-class CPU)
  Origin="GenuineIntel"  Id=0x306a9  Family=0x6  Model=0x3a  Stepping=9
 
Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE>
 
Features2=0x7fbae3ff<SSE3,PCLMULQDQ,DTES64,MON,DS_CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,PCID,SSE4.1,SSE4.2,x2APIC,POPCNT,TSCDLT,AESNI,XSAVE,OSXSAVE,AVX,F16C,RDRAND>
  AMD Features=0x28100800<SYSCALL,NX,RDTSCP,LM>
  AMD Features2=0x1<LAHF>
  Structured Extended Features=0x281<FSGSBASE,SMEP,ERMS>
  Structured Extended Features3=0x9c000000<IBPB,STIBP,L1DFL,SSBD>
  XSAVE Features=0x1<XSAVEOPT>
  VT-x: PAT,HLT,MTF,PAUSE,EPT,UG,VPID
  TSC: P-state invariant, performance statistics
real memory  = 34359738368 (32768 MB)
avail memory = 33275420672 (31733 MB)
Event timer "LAPIC" quality 600
ACPI APIC Table: <ALASKA A M I>
FreeBSD/SMP: Multiprocessor System Detected: 8 CPUs
FreeBSD/SMP: 1 package(s) x 4 core(s) x 2 hardware threads
random: registering fast source Intel Secure Key RNG
random: fast provider: "Intel Secure Key RNG"
random: unblocking device.
ioapic0 <Version 2.0> irqs 0-23
Launching APs: 1 7 6 4 3 2 5
random: entropy device external interface
kbd1 at kbdmux0
vtvga0: <VT VGA driver>
smbios0: <System Management BIOS> at iomem 0xf04c0-0xf04de
smbios0: Entry point: v2.1 (32-bit), Version: 2.7, BCD Revision: 2.7
aesni0: <AES-CBC,AES-CCM,AES-GCM,AES-ICM,AES-XTS>
acpi0: <ALASKA A M I>
acpi0: Power Button (fixed)
cpu0: <ACPI CPU> on acpi0
hpet0: <High Precision Event Timer> iomem 0xfed00000-0xfed003ff on acpi0
Timecounter "HPET" frequency 14318180 Hz quality 950
Event timer "HPET" frequency 14318180 Hz quality 550
atrtc0: <AT realtime clock> port 0x70-0x77 irq 8 on acpi0
atrtc0: Warning: Couldn't map I/O.
atrtc0: registered as a time-of-day clock, resolution 1.000000s
Event timer "RTC" frequency 32768 Hz quality 0
attimer0: <AT timer> port 0x40-0x43,0x50-0x53 irq 0 on acpi0
Timecounter "i8254" frequency 1193182 Hz quality 0
Event timer "i8254" frequency 1193182 Hz quality 100
Timecounter "ACPI-fast" frequency 3579545 Hz quality 900
acpi_timer0: <24-bit timer at 3.579545MHz> port 0x408-0x40b on acpi0
pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0
pci0: <ACPI PCI bus> on pcib0
pcib1: <ACPI PCI-PCI bridge> irq 16 at device 1.0 on pci0
pci1: <ACPI PCI bus> on pcib1
vgapci0: <VGA-compatible display> port 0xf000-0xf03f mem
0xf7800000-0xf7bfffff,0xe0000000-0xefffffff irq 16 at device 2.0 on pci0
vgapci0: Boot video device
pci0: <simple comms> at device 22.0 (no driver attached)
ehci0: <Intel Panther Point USB 2.0 controller> mem 0xf7d08000-0xf7d083ff irq
23 at device 26.0 on pci0
usbus0: EHCI version 1.0
usbus0 on ehci0
usbus0: 480Mbps High Speed USB v2.0
hdac0: <Intel Panther Point HDA Controller> mem 0xf7d00000-0xf7d03fff irq 22 at
device 27.0 on pci0
pcib2: <ACPI PCI-PCI bridge> irq 16 at device 28.0 on pci0
pci2: <ACPI PCI bus> on pcib2
pcib3: <ACPI PCI-PCI bridge> irq 16 at device 28.4 on pci0
pci3: <ACPI PCI bus> on pcib3
re0: <RealTek 8168/8111 B/C/CP/D/DP/E/F/G PCIe Gigabit Ethernet> port
0xe000-0xe0ff mem 0xf0004000-0xf0004fff,0xf0000000-0xf0003fff irq 16 at device
0.0 on pci3
re0: Using 1 MSI-X message
re0: turning off MSI enable bit.
re0: Chip rev. 0x48000000
re0: MAC rev. 0x00000000
miibus0: <MII bus> on re0
rgephy0: <RTL8169S/8110S/8211 1000BASE-T media interface> PHY 1 on miibus0
rgephy0:  none, 10baseT, 10baseT-FDX, 10baseT-FDX-flow, 100baseTX,
100baseTX-FDX, 100baseTX-FDX-flow, 1000baseT-FDX, 1000baseT-FDX-master,
1000baseT-FDX-flow, 1000baseT-FDX-flow-master, auto, auto-flow
re0: Using defaults for TSO: 65518/35/2048
re0: Ethernet address: 50:46:5d:9f:65:f2
re0: netmap queues/slots: TX 1/256, RX 1/256
pcib4: <ACPI PCI-PCI bridge> irq 18 at device 28.6 on pci0
pci4: <ACPI PCI bus> on pcib4
ahci0: <Marvell 88SE9172 AHCI SATA controller> port
0xd040-0xd047,0xd030-0xd033,0xd020-0xd027,0xd010-0xd013,0xd000-0xd00f mem
0xf7c10000-0xf7c101ff irq 18 at device 0.0 on pci4
ahci0: AHCI v1.00 with 2 6Gbps ports, Port Multiplier supported with FBS
ahci0: quirks=0x1000000<IOMMU_BUSWIDE>
ahcich0: <AHCI channel> at channel 0 on ahci0
ahcich1: <AHCI channel> at channel 1 on ahci0
ehci1: <Intel Panther Point USB 2.0 controller> mem 0xf7d07000-0xf7d073ff irq
23 at device 29.0 on pci0
usbus1: EHCI version 1.0
usbus1 on ehci1
usbus1: 480Mbps High Speed USB v2.0
isab0: <PCI-ISA bridge> at device 31.0 on pci0
isa0: <ISA bus> on isab0
ahci1: <Intel Panther Point AHCI SATA controller> port
0xf0b0-0xf0b7,0xf0a0-0xf0a3,0xf090-0xf097,0xf080-0xf083,0xf060-0xf07f mem
0xf7d06000-0xf7d067ff irq 19 at device 31.2 on pci0
ahci1: AHCI v1.30 with 6 6Gbps ports, Port Multiplier not supported
ahcich2: <AHCI channel> at channel 0 on ahci1
ahcich3: <AHCI channel> at channel 1 on ahci1
ahcich4: <AHCI channel> at channel 2 on ahci1
ahcich5: <AHCI channel> at channel 3 on ahci1
ahcich6: <AHCI channel> at channel 4 on ahci1
ahcich7: <AHCI channel> at channel 5 on ahci1
ahciem0: <AHCI enclosure management bridge> on ahci1
acpi_button0: <Power Button> on acpi0
acpi_tz0: <Thermal Zone> on acpi0
acpi_tz1: <Thermal Zone> on acpi0
atkbdc0: <Keyboard controller (i8042)> port 0x60,0x64 irq 1 on acpi0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
kbd0 at atkbd0
atkbd0: [GIANT-LOCKED]
orm0: <ISA Option ROMs> at iomem
0xc0000-0xce7ff,0xce800-0xd17ff,0xd1800-0xd27ff pnpid ORM0000 on isa0
est0: <Enhanced SpeedStep Frequency Control> on cpu0
cpufreq0: <CPU frequency control> on cpu0
cpufreq1: <CPU frequency control> on cpu1
cpufreq2: <CPU frequency control> on cpu2
cpufreq3: <CPU frequency control> on cpu3
cpufreq4: <CPU frequency control> on cpu4
cpufreq5: <CPU frequency control> on cpu5
cpufreq6: <CPU frequency control> on cpu6
cpufreq7: <CPU frequency control> on cpu7
Timecounter "TSC-low" frequency 1700000097 Hz quality 1000
Timecounters tick every 1.000 msec
ugen1.1: <Intel EHCI root HUB> at usbus1
ugen0.1: <Intel EHCI root HUB> at usbus0
uhub0 on usbus1
uhub0: <Intel EHCI root HUB, class 9/0, rev 2.00/1.00, addr 1> on usbus1
ZFS filesystem version: 5
ZFS storage pool version: features support (5000)
uhub1 on usbus0
uhub1: <Intel EHCI root HUB, class 9/0, rev 2.00/1.00, addr 1> on usbus0
hdacc0: <Realtek ALC892 HDA CODEC> at cad 0 on hdac0
hdaa0: <Realtek ALC892 Audio Function Group> at nid 1 on hdacc0
pcm0: <Realtek ALC892 (Rear Analog 7.1/2.0)> at nid 20,22,21,23 and 24,26 on
hdaa0
pcm1: <Realtek ALC892 (Front Analog)> at nid 27 and 25 on hdaa0
pcm2: <Realtek ALC892 (Rear Digital)> at nid 30 on hdaa0
pcm3: <Realtek ALC892 (Onboard Digital)> at nid 17 on hdaa0
hdacc1: <Intel Panther Point HDA CODEC> at cad 3 on hdac0
hdaa1: <Intel Panther Point Audio Function Group> at nid 1 on hdacc1
pcm4: <Intel Panther Point (HDMI/DP 8ch)> at nid 5 on hdaa1
pcm5: <Intel Panther Point (HDMI/DP 8ch)> at nid 7 on hdaa1
Trying to mount root from zfs:zroot/ROOT/default []...
ada0 at ahcich1 bus 0 scbus1 target 0 lun 0
ada0: <HGST HMS5C4040BLE640 MPAOA5D0> ATA8-ACS SATA 3.x device
ada0: Serial Number PL2331LAGUP9BJ
ada0: 600.000MB/s transfers (SATA 3.x, UDMA6, PIO 8192bytes)
ada0: Command Queueing enabled
ada0: 3815447MB (7814037168 512 byte sectors)
ada1 at ahcich2 bus 0 scbus2 target 0 lun 0
ada1: <HGST HMS5C4040BLE640 MPAOA5D0> ATA8-ACS SATA 3.x device
ada1: Serial Number PL2331LAGUSZYJ
ada1: 600.000MB/s transfers (SATA 3.x, UDMA6, PIO 8192bytes)
ada1: Command Queueing enabled
ada1: 3815447MB (7814037168 512 byte sectors)
ada2 at ahcich3 bus 0 scbus3 target 0 lun 0
ada2: <HGST HMS5C4040BLE640 MPAOA5D0> ATA8-ACS SATA 3.x device
ada2: Serial Number PL2331LAGUT00J
ada2: 600.000MB/s transfers (SATA 3.x, UDMA6, PIO 8192bytes)
ada2: Command Queueing enabled
ada2: 3815447MB (7814037168 512 byte sectors)
ada3 at ahcich4 bus 0 scbus4 target 0 lun 0
ada3: <HGST HMS5C4040BLE640 MPAOA5D0> ATA8-ACS SATA 3.x device
ada3: Serial Number PL2331LAGUT6MJ
ada3: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes)
ada3: Command Queueing enabled
ada3: 3815447MB (7814037168 512 byte sectors)
ada4 at ahcich5 bus 0 scbus5 target 0 lun 0
ada4: <HGST HMS5C4040BLE640 MPAOA5D0> ATA8-ACS SATA 3.x device
ada4: Serial Number PL2331LAGUT3XJ
ada4: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes)
ada4: Command Queueing enabled
ada4: 3815447MB (7814037168 512 byte sectors)
ada5 at ahcich6 bus 0 scbus6 target 0 lun 0
ada5: <HGST HMS5C4040BLE640 MPAOA5D0> ATA8-ACS SATA 3.x device
ada5: Serial Number PL2331LAGUSZAJ
ada5: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes)
ada5: Command Queueing enabled
ada5: 3815447MB (7814037168 512 byte sectors)
ada6 at ahcich7 bus 0 scbus7 target 0 lun 0
ada6: <HGST HMS5C4040BLE640 MPAOA5D0> ATA8-ACS SATA 3.x device
ada6: Serial Number PL2331LAGUT0YJ
ada6: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes)
ada6: Command Queueing enabled
ada6: 3815447MB (7814037168 512 byte sectors)
ses0 at ahciem0 bus 0 scbus8 target 0 lun 0
ses0: <AHCI SGPIO Enclosure 2.00 0001> SEMB S-E-S 2.00 device
ses0: SEMB SES Device
ses0: ada1,pass1 in 'Slot 00', SATA Slot: scbus2 target 0
ses0: ada2,pass2 in 'Slot 01', SATA Slot: scbus3 target 0
ses0: ada3,pass3 in 'Slot 02', SATA Slot: scbus4 target 0
ses0: ada4,pass4 in 'Slot 03', SATA Slot: scbus5 target 0
ses0: ada5,pass5 in 'Slot 04', SATA Slot: scbus6 target 0
ses0: ada6,pass6 in 'Slot 05', SATA Slot: scbus7 target 0
uhub0: 2 ports with 2 removable, self powered
uhub1: 2 ports with 2 removable, self powered
GEOM_MIRROR: Device mirror/swap launched (7/7).
Root mount waiting for: usbus0 usbus1
ugen1.2: <vendor 0x8087 product 0x0024> at usbus1
uhub2 on uhub0
uhub2: <vendor 0x8087 product 0x0024, class 9/0, rev 2.00/0.00, addr 2> on
usbus1
ugen0.2: <vendor 0x8087 product 0x0024> at usbus0
uhub3 on uhub1
uhub3: <vendor 0x8087 product 0x0024, class 9/0, rev 2.00/0.00, addr 2> on
usbus0
Root mount waiting for: usbus0 usbus1
uhub3: 6 ports with 6 removable, self powered
uhub2: 8 ports with 8 removable, self powered
GEOM_ELI: Device mirror/swap.eli created.
GEOM_ELI: Encryption: AES-XTS 128
GEOM_ELI:     Crypto: accelerated software
ichsmb0: <Intel Panther Point SMBus controller> port 0xf040-0xf05f mem
0xf7d05000-0xf7d050ff irq 18 at device 31.3 on pci0
smbus0: <System Management Bus> on ichsmb0
acpi_wmi0: <ACPI-WMI mapping> on acpi0
acpi_wmi0: cannot find EC device
acpi_wmi0: Embedded MOF found
ACPI: \134AMW0.WQMO: 1 arguments were passed to a non-method ACPI object
(Buffer) (20221020/nsarguments-361)
re0: link state changed to UP
lo0: link state changed to UP
re0: link state changed to DOWN
re0: link state changed to UP
pflog0: promiscuous mode enabled
Security policy loaded: MAC/ntpd (mac_ntpd)
ovpn0: changing name to 'tun0'
tun0: link state changed to UP
wg0: link state changed to UP

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to