[Bug 1972149] Re: "mcelog --client" cannot ouput after performing PFA test on Ubunt22.04 and SR850v2

2022-05-16 Thread Jeff Lane 
** Changed in: mcelog (Ubuntu)
   Status: Incomplete => Invalid

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1972149

Title:
  "mcelog --client" cannot ouput after performing PFA test on Ubunt22.04
  and SR850v2

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/mcelog/+bug/1972149/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1972149] Re: "mcelog --client" cannot ouput after performing PFA test on Ubunt22.04 and SR850v2

2022-05-10 Thread conie chang
Hi Seth,

I set wrong information type and I correct the type to Public. Sorry for
the wrong setting.

I tried to replace rasdaemon and edac-utils, and run again the test. And
it can catch the mce error log after PFA test. The test result is below.
I think we can close this bug. Thank you.

Steps:
root@conie:/home/conie# sudo rasdaemon --enable
rasdaemon: ras:mc_event event enabled
rasdaemon: ras:aer_event event enabled
rasdaemon: mce:mce_record event enabled
rasdaemon: ras:extlog_mem_event event enabled
root@conie:/home/conie# sudo systemctl enable rasdaemon
Synchronizing state of rasdaemon.service with SysV service script with 
/lib/systemd/systemd-sysv-install.
Executing: /lib/systemd/systemd-sysv-install enable rasdaemon
root@conie:/home/conie#
root@conie:/home/conie# sudo systemctl start rasdaemon
root@conie:/home/conie# sudo systemctl status rasdaemon
● rasdaemon.service - RAS daemon to log the RAS events
 Loaded: loaded (/lib/systemd/system/rasdaemon.service; enabled; vendor 
preset: enabled)
 Active: active (running) since Tue 2022-05-10 08:28:19 UTC; 2min 32s ago
   Main PID: 1692 (rasdaemon)
  Tasks: 1 (limit: 308967)
 Memory: 13.6M
CPU: 171ms
 CGroup: /system.slice/rasdaemon.service
 └─1692 /usr/sbin/rasdaemon -f -r

May 10 08:28:18 conie rasdaemon[1692]: rasdaemon: Enabled event 
ras:extlog_mem_event
May 10 08:28:18 conie rasdaemon[1692]: Enabled event mce:mce_record
May 10 08:28:18 conie rasdaemon[1692]: ras:extlog_mem_event event enabled
May 10 08:28:18 conie rasdaemon[1692]: Enabled event ras:extlog_mem_event
May 10 08:28:18 conie rasdaemon[1692]: rasdaemon: Listening to events for cpus 
0 to 71
May 10 08:28:19 conie systemd[1]: Started RAS daemon to log the RAS events.
May 10 08:28:19 conie rasdaemon[1692]: rasdaemon: Recording mc_event events
May 10 08:28:19 conie rasdaemon[1692]: rasdaemon: Recording aer_event events
May 10 08:28:19 conie rasdaemon[1692]: rasdaemon: Recording extlog_event events
May 10 08:28:19 conie rasdaemon[1692]: rasdaemon: Recording mce_record events
root@conie:/home/conie# ras-mc-ctl --summary
No Memory errors.

No PCIe AER errors.

No Extlog errors.

No MCE errors.
root@conie:/home/conie# ras-mc-ctl --errors
No Memory errors.

No PCIe AER errors.

No Extlog errors.

No MCE errors.

root@conie:/home/conie# cd ras-tool-master
root@conie:/home/conie/ras-tool-master# ./Init.sh
./Init.sh: line 1: mcelog: command not found
mount: /sys/kernel/debug: none already mounted on 
/run/credentials/systemd-sysusers.service.
root@conie:/home/conie/ras-tool-master# chmod -R 777 injection_error.sh
root@conie:/home/conie/ras-tool-master# ./mca-recover
flags for page 15e98f: uptodate mmap anon swapbacked
vtop(7f9749724000) = 15e98f000
Hit any key to access: ^Z
[1]+  Stopped ./mca-recover
root@conie:/home/conie/ras-tool-master# ./injection_error.sh 0x8 0x15e98f000 
0xf000 10
0x0008  Memory Correctable
0x0010  Memory Uncorrectable non-fatal
0x0020  Memory Uncorrectable fatal
Injecting Correctable Memory Error
Injecting 10 errors at address 0x15e98f000.
System performance will be affected while errors are being injected.
inject times: 1
inject times: 2
inject times: 3
inject times: 4
inject times: 5
inject times: 6
inject times: 7
inject times: 8
inject times: 9
inject times: 10
Injection Complete
root@conie:/home/conie/ras-tool-master# dmesg|tail
[  357.937520] {1}[Hardware Error]: event severity: corrected
[  357.937522] {1}[Hardware Error]:  Error 0, type: corrected
[  357.937523] {1}[Hardware Error]:   section_type: memory error
[  357.937524] {1}[Hardware Error]:   error_status: 0x0400
[  357.937525] {1}[Hardware Error]:   physical_address: 0x00015e98f000
[  357.937527] {1}[Hardware Error]:   physical_address_mask: 0x3fc0
[  357.937529] {1}[Hardware Error]:   node: 0 card: 0 module: 0 rank: 1 bank: 8 
device: 1 row: 5068 column: 448
[  357.937529] {1}[Hardware Error]:   error_type: 2, single-bit ECC
[  357.937531] {1}[Hardware Error]:   DIMM location: CPU 1 DIMM 8
[  357.941117] mce: [Hardware Error]: Machine check events logged
root@conie:/home/conie/ras-tool-master# ras-mc-ctl --summary
No Memory errors.

No PCIe AER errors.

No Extlog errors.

MCE records summary:
6 MEMORY CONTROLLER RD_CHANNELunspecified_ERR Transaction: Memory read 
error errors
root@conie:/home/conie/ras-tool-master# ras-mc-ctl --errors
No Memory errors.

No PCIe AER errors.

No Extlog errors.

MCE events:
1 2022-05-10 08:33:53 + error: MEMORY CONTROLLER RD_CHANNELunspecified_ERR 
Transaction: Memory read error, mcg mcgstatus=0, mci Corrected_error 
Error_enabled, n_errors=0, mcgcap=0x0f000c17, status=0x9c9f, 
addr=0x15e98f000, misc=0x008c, walltime=0x627a2373, cpuid=0x0005065b, 
bank=0x00ff
2 2022-05-10 08:34:08 + error: MEMORY CONTROLLER RD_CHANNELunspecified_ERR 
Transaction: Memory read error, mcg mcgstatus=0, mci Corrected_error 
Error_enabled, n_error

[Bug 1972149] Re: "mcelog --client" cannot ouput after performing PFA test on Ubunt22.04 and SR850v2

2022-05-09 Thread conie chang
** Information type changed from Private Security to Public Security

** Information type changed from Public Security to Public

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1972149

Title:
  "mcelog --client" cannot ouput after performing PFA test on Ubunt22.04
  and SR850v2

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/mcelog/+bug/1972149/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1972149] Re: "mcelog --client" cannot ouput after performing PFA test on Ubunt22.04 and SR850v2

2022-05-09 Thread conie chang
** Attachment added: "syslog"
   
https://bugs.launchpad.net/ubuntu/+source/mcelog/+bug/1972149/+attachment/5587841/+files/syslog

** Information type changed from Public to Public Security

** Information type changed from Public Security to Private Security

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1972149

Title:
  "mcelog --client" cannot ouput after performing PFA test on Ubunt22.04
  and SR850v2

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/mcelog/+bug/1972149/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1972149] Re: "mcelog --client" cannot ouput after performing PFA test on Ubunt22.04 and SR850v2

2022-05-09 Thread conie chang
** Attachment added: "mcelog configuration file"
   
https://bugs.launchpad.net/ubuntu/+source/mcelog/+bug/1972149/+attachment/5587839/+files/mcelog.conf

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1972149

Title:
  "mcelog --client" cannot ouput after performing PFA test on Ubunt22.04
  and SR850v2

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/mcelog/+bug/1972149/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1972149] Re: "mcelog --client" cannot ouput after performing PFA test on Ubunt22.04 and SR850v2

2022-05-09 Thread conie chang
mcelog version: v181 
#git clone https://kernel.googlesource.com/pub/scm/utils/cpu/mce/mcelog.git

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1972149

Title:
  "mcelog --client" cannot ouput after performing PFA test on Ubunt22.04
  and SR850v2

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/mcelog/+bug/1972149/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs