Hi Seth,

I set wrong information type and I correct the type to Public. Sorry for
the wrong setting.

I tried to replace rasdaemon and edac-utils, and run again the test. And
it can catch the mce error log after PFA test. The test result is below.
I think we can close this bug. Thank you.

Steps:
root@conie:/home/conie# sudo rasdaemon --enable
rasdaemon: ras:mc_event event enabled
rasdaemon: ras:aer_event event enabled
rasdaemon: mce:mce_record event enabled
rasdaemon: ras:extlog_mem_event event enabled
root@conie:/home/conie# sudo systemctl enable rasdaemon
Synchronizing state of rasdaemon.service with SysV service script with 
/lib/systemd/systemd-sysv-install.
Executing: /lib/systemd/systemd-sysv-install enable rasdaemon
root@conie:/home/conie#
root@conie:/home/conie# sudo systemctl start rasdaemon
root@conie:/home/conie# sudo systemctl status rasdaemon
● rasdaemon.service - RAS daemon to log the RAS events
     Loaded: loaded (/lib/systemd/system/rasdaemon.service; enabled; vendor 
preset: enabled)
     Active: active (running) since Tue 2022-05-10 08:28:19 UTC; 2min 32s ago
   Main PID: 1692 (rasdaemon)
      Tasks: 1 (limit: 308967)
     Memory: 13.6M
        CPU: 171ms
     CGroup: /system.slice/rasdaemon.service
             └─1692 /usr/sbin/rasdaemon -f -r

May 10 08:28:18 conie rasdaemon[1692]: rasdaemon: Enabled event 
ras:extlog_mem_event
May 10 08:28:18 conie rasdaemon[1692]: Enabled event mce:mce_record
May 10 08:28:18 conie rasdaemon[1692]: ras:extlog_mem_event event enabled
May 10 08:28:18 conie rasdaemon[1692]: Enabled event ras:extlog_mem_event
May 10 08:28:18 conie rasdaemon[1692]: rasdaemon: Listening to events for cpus 
0 to 71
May 10 08:28:19 conie systemd[1]: Started RAS daemon to log the RAS events.
May 10 08:28:19 conie rasdaemon[1692]: rasdaemon: Recording mc_event events
May 10 08:28:19 conie rasdaemon[1692]: rasdaemon: Recording aer_event events
May 10 08:28:19 conie rasdaemon[1692]: rasdaemon: Recording extlog_event events
May 10 08:28:19 conie rasdaemon[1692]: rasdaemon: Recording mce_record events
root@conie:/home/conie# ras-mc-ctl --summary
No Memory errors.

No PCIe AER errors.

No Extlog errors.

No MCE errors.
root@conie:/home/conie# ras-mc-ctl --errors
No Memory errors.

No PCIe AER errors.

No Extlog errors.

No MCE errors.

root@conie:/home/conie# cd ras-tool-master
root@conie:/home/conie/ras-tool-master# ./Init.sh
./Init.sh: line 1: mcelog: command not found
mount: /sys/kernel/debug: none already mounted on 
/run/credentials/systemd-sysusers.service.
root@conie:/home/conie/ras-tool-master# chmod -R 777 injection_error.sh
root@conie:/home/conie/ras-tool-master# ./mca-recover
flags for page 15e98f: uptodate mmap anon swapbacked
vtop(7f9749724000) = 15e98f000
Hit any key to access: ^Z
[1]+  Stopped                 ./mca-recover
root@conie:/home/conie/ras-tool-master# ./injection_error.sh 0x8 0x15e98f000 
0xfffffffffffff000 10
0x00000008      Memory Correctable
0x00000010      Memory Uncorrectable non-fatal
0x00000020      Memory Uncorrectable fatal
Injecting Correctable Memory Error
Injecting 10 errors at address 0x15e98f000.
System performance will be affected while errors are being injected.
inject times: 1
inject times: 2
inject times: 3
inject times: 4
inject times: 5
inject times: 6
inject times: 7
inject times: 8
inject times: 9
inject times: 10
Injection Complete
root@conie:/home/conie/ras-tool-master# dmesg|tail
[  357.937520] {1}[Hardware Error]: event severity: corrected
[  357.937522] {1}[Hardware Error]:  Error 0, type: corrected
[  357.937523] {1}[Hardware Error]:   section_type: memory error
[  357.937524] {1}[Hardware Error]:   error_status: 0x0000000000000400
[  357.937525] {1}[Hardware Error]:   physical_address: 0x000000015e98f000
[  357.937527] {1}[Hardware Error]:   physical_address_mask: 0x00003fffffffffc0
[  357.937529] {1}[Hardware Error]:   node: 0 card: 0 module: 0 rank: 1 bank: 8 
device: 1 row: 5068 column: 448
[  357.937529] {1}[Hardware Error]:   error_type: 2, single-bit ECC
[  357.937531] {1}[Hardware Error]:   DIMM location: CPU 1 DIMM 8
[  357.941117] mce: [Hardware Error]: Machine check events logged
root@conie:/home/conie/ras-tool-master# ras-mc-ctl --summary
No Memory errors.

No PCIe AER errors.

No Extlog errors.

MCE records summary:
        6 MEMORY CONTROLLER RD_CHANNELunspecified_ERR Transaction: Memory read 
error errors
root@conie:/home/conie/ras-tool-master# ras-mc-ctl --errors
No Memory errors.

No PCIe AER errors.

No Extlog errors.

MCE events:
1 2022-05-10 08:33:53 +0000 error: MEMORY CONTROLLER RD_CHANNELunspecified_ERR 
Transaction: Memory read error, mcg mcgstatus=0, mci Corrected_error 
Error_enabled, n_errors=0, mcgcap=0x0f000c17, status=0x9c0000000000009f, 
addr=0x15e98f000, misc=0x0000008c, walltime=0x627a2373, cpuid=0x0005065b, 
bank=0x000000ff
2 2022-05-10 08:34:08 +0000 error: MEMORY CONTROLLER RD_CHANNELunspecified_ERR 
Transaction: Memory read error, mcg mcgstatus=0, mci Corrected_error 
Error_enabled, n_errors=0, mcgcap=0x0f000c17, status=0x9c0000000000009f, 
addr=0x15e98f000, misc=0x0000008c, walltime=0x627a2383, cpu=0x00000024, 
cpuid=0x0005065b, apicid=0x00000001, bank=0x000000ff
3 2022-05-10 08:34:08 +0000 error: MEMORY CONTROLLER RD_CHANNELunspecified_ERR 
Transaction: Memory read error, mcg mcgstatus=0, mci Corrected_error 
Error_enabled, n_errors=0, mcgcap=0x0f000c17, status=0x9c0000000000009f, 
addr=0x15e98f000, misc=0x0000008c, walltime=0x627a2383, cpu=0x00000024, 
cpuid=0x0005065b, apicid=0x00000001, bank=0x000000ff
4 2022-05-10 08:34:08 +0000 error: MEMORY CONTROLLER RD_CHANNELunspecified_ERR 
Transaction: Memory read error, mcg mcgstatus=0, mci Corrected_error 
Error_enabled, n_errors=0, mcgcap=0x0f000c17, status=0x9c0000000000009f, 
addr=0x15e98f000, misc=0x0000008c, walltime=0x627a2383, cpu=0x00000024, 
cpuid=0x0005065b, apicid=0x00000001, bank=0x000000ff
5 2022-05-10 08:34:08 +0000 error: MEMORY CONTROLLER RD_CHANNELunspecified_ERR 
Transaction: Memory read error, mcg mcgstatus=0, mci Corrected_error 
Error_enabled, n_errors=0, mcgcap=0x0f000c17, status=0x9c0000000000009f, 
addr=0x15e98f000, misc=0x0000008c, walltime=0x627a2383, cpu=0x00000024, 
cpuid=0x0005065b, apicid=0x00000001, bank=0x000000ff
6 2022-05-10 08:34:08 +0000 error: MEMORY CONTROLLER RD_CHANNELunspecified_ERR 
Transaction: Memory read error, mcg mcgstatus=0, mci Corrected_error 
Error_enabled, n_errors=0, mcgcap=0x0f000c17, status=0x9c0000000000009f, 
addr=0x15e98f000, misc=0x0000008c, walltime=0x627a2383, cpu=0x00000024, 
cpuid=0x0005065b, apicid=0x00000001, bank=0x000000ff

root@conie:/home/conie/ras-tool-master#

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1972149

Title:
  "mcelog --client" cannot ouput after performing PFA test on Ubunt22.04
  and SR850v2

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/mcelog/+bug/1972149/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to