Hi, Hugo,

On 1/8/26 17:53, Hugo Villeneuve wrote:
Hi Claudiu,

On Thu, 8 Jan 2026 11:12:54 +0200
Claudiu Beznea <[email protected]> wrote:

Hi, Hugo,

On 1/7/26 23:48, Hugo Villeneuve wrote:
Hi,
when issuing a reboot command, I encounter the following kernel panic:

[   36.183478] SError Interrupt on CPU1, code 0x00000000be000011 -- SError
[   36.183492] CPU: 1 UID: 0 PID: 1 Comm: systemd-shutdow Tainted: G   M        
        6.19.0-rc4-arm64-renesas-00019-g067a81578add #62 NONE
[   36.183504] Tainted: [M]=MACHINE_CHECK
[   36.183507] Hardware name: Gecko ECO2 nxtpad (DT)
[   36.183512] pstate: 80400005 (Nzcv daif +PAN -UAO -TCO -DIT -SSBS BTYPE=--)
[   36.183519] pc : rzg2l_mipi_dsi_host_transfer+0x114/0x458
[   36.183538] lr : rzg2l_mipi_dsi_host_transfer+0x98/0x458
[   36.183547] sp : ffff8000813db860
[   36.183550] x29: ffff8000813db890 x28: ffff800080c602c0 x27: ffff000009dd7450
[   36.183563] x26: ffff800080c5fcc0 x25: ffff000009dd7450 x24: ffff800080e1f7a8
[   36.183573] x23: ffff000009dd7400 x22: 0000000000000000 x21: ffff000009dd7430
[   36.183582] x20: ffff8000813db8e8 x19: 0000000002050028 x18: 00000000ffffffff
[   36.183592] x17: 0000000000000000 x16: 0000000000000000 x15: ffff8000813db220
[   36.183602] x14: 0000000000000000 x13: ffff800081255bc0 x12: 00000000000009a2
[   36.183611] x11: 0000000000000336 x10: ffff8000812b28d0 x9 : ffff800081255bc0
[   36.183621] x8 : ffff800081399000 x7 : ffff00000a042600 x6 : 0000000000000000
[   36.183631] x5 : 0000000000000805 x4 : 0000000002000000 x3 : 0000000000000028
[   36.183640] x2 : 0000000049627000 x1 : ffff800080c60b40 x0 : ffff800081780000
[   36.183652] Kernel panic - not syncing: Asynchronous SError Interrupt
[   36.183657] CPU: 1 UID: 0 PID: 1 Comm: systemd-shutdow Tainted: G   M        
        6.19.0-rc4-arm64-renesas-00019-g067a81578add #62 NONE
[   36.183665] Tainted: [M]=MACHINE_CHECK
[   36.183668] Hardware name: devboard1 (DT)
[   36.183672] Call trace:
[   36.183675]  show_stack+0x18/0x24 (C)
[   36.183692]  dump_stack_lvl+0x34/0x8c
[   36.183702]  dump_stack+0x18/0x24
[   36.183708]  vpanic+0x314/0x35c
[   36.183716]  nmi_panic+0x0/0x64
[   36.183722]  add_taint+0x0/0xbc
[   36.183728]  arm64_serror_panic+0x70/0x80
[   36.183735]  do_serror+0x28/0x68
[   36.183742]  el1h_64_error_handler+0x34/0x50
[   36.183751]  el1h_64_error+0x6c/0x70
[   36.183758]  rzg2l_mipi_dsi_host_transfer+0x114/0x458 (P)
[   36.183770]  mipi_dsi_device_transfer+0x44/0x58
[   36.183781]  mipi_dsi_dcs_set_display_off_multi+0x9c/0xc4
[   36.183792]  ili9881c_unprepare+0x38/0x88
[   36.183802]  drm_panel_unprepare+0xbc/0x108
[   36.183814]  panel_bridge_atomic_post_disable+0x50/0x60
[   36.183823]  drm_atomic_bridge_call_post_disable+0x24/0x4c
[   36.183835]  drm_atomic_bridge_chain_post_disable+0xa8/0x100
[   36.183845]  drm_atomic_helper_commit_modeset_disables+0x2fc/0x5f8
[   36.183856]  drm_atomic_helper_commit_tail_rpm+0x24/0x7c
[   36.183865]  commit_tail+0xa4/0x18c
[   36.183874]  drm_atomic_helper_commit+0x17c/0x194
[   36.183884]  drm_atomic_commit+0x8c/0xcc
[   36.183892]  drm_atomic_helper_disable_all+0x200/0x210
[   36.183901]  drm_atomic_helper_shutdown+0xa8/0x150
[   36.183911]  rzg2l_du_shutdown+0x18/0x24
[   36.183920]  platform_shutdown+0x24/0x34
[   36.183931]  device_shutdown+0x128/0x284
[   36.183938]  kernel_restart+0x44/0xa4
[   36.183950]  __do_sys_reboot+0x178/0x270
[   36.183959]  __arm64_sys_reboot+0x24/0x30
[   36.183968]  invoke_syscall.constprop.0+0x50/0xe4
[   36.183979]  do_el0_svc+0x40/0xc0
[   36.183988]  el0_svc+0x3c/0x164
[   36.183995]  el0t_64_sync_handler+0xa0/0xe4
[   36.184002]  el0t_64_sync+0x198/0x19c
[   36.184020] Kernel Offset: disabled
[   36.184022] CPU features: 0x200000,00020001,4000c501,0400720b
[   36.184028] Memory Limit: none
[   36.495305] ---[ end Kernel panic - not syncing: Asynchronous SError 
Interrupt ]---

The problem is present since linux-6.18-rc1, but not in linux-6.17. I also 
confirm the bug is present in linux-6.19-rc4.

The bug seems to be happening in rzg2l_mipi_dsi_host_transfer().

After bisecting, here is the first bad commit:

      commit 56de5e305d4b ("clk: renesas: r9a07g044: Add MSTOP for RZ/G2L")

Reverting this change makes the bug disappear.

My limited understanding seems to indicate that the MIPI/DSI host may
no longer be available/on when the panel tries to send MIPI/DSI
commands in ili9881c_unprepare(), maybe because the MIPI/DSI clock has been 
stopped...

The exact same board with two other panels (jd9365da and st7703) doesn't have 
the bug.

Could you please provide the output of command:

cat /sys/kernel/debug/mstop

for both cases?

Here it is for the panel which has the bug:

----------------------------------
                            MSTOP
                      clk   -------------------------
clk_name             cnt   cnt   off   val    shared
--------             ----- ----- ----- ------ ------
gic                  1     1     0xb80 0x0
ia55_clk             2     2     0xb70 0x0    ia55_pclk ia55_clk
ia55_pclk            1     2     0xb70 0x0    ia55_pclk ia55_clk
dmac_aclk            2     1     0xb80 0x0
dmac_pclk            1     1     0xb80 0x0
ostm0_pclk           0     0     0xb7c 0x10
ostm1_pclk           1     1     0xb7c 0x0
ostm2_pclk           1     1     0xb7c 0x0
mtu_x_mck            0     0     0xb64 0x4
gpt_pclk             1     1     0xb64 0x0
poeg_a_clkp          0     0     0xb64 0x20
poeg_b_clkp          0     0     0xb64 0x40
poeg_c_clkp          0     0     0xb64 0x80
poeg_d_clkp          0     0     0xb64 0x100
wdt0_pclk            1     2     0xb7c 0x0    wdt0_pclk wdt0_clk
wdt0_clk             1     2     0xb7c 0x0    wdt0_pclk wdt0_clk
wdt1_pclk            0     0     0xb7c 0x8    wdt1_pclk wdt1_clk
wdt1_clk             0     0     0xb7c 0x8    wdt1_pclk wdt1_clk
spi_clk2             0     0     0xb64 0x2    spi_clk2 spi_clk
spi_clk              0     0     0xb64 0x2    spi_clk2 spi_clk
sdhi0_imclk          1     4     0xb6c 0x0    sdhi0_imclk sdhi0_imclk2 
sdhi0_clk_hs sdhi0_aclk
sdhi0_imclk2         2     4     0xb6c 0x0    sdhi0_imclk sdhi0_imclk2 
sdhi0_clk_hs sdhi0_aclk
sdhi0_clk_hs         1     4     0xb6c 0x0    sdhi0_imclk sdhi0_imclk2 
sdhi0_clk_hs sdhi0_aclk
sdhi0_aclk           1     4     0xb6c 0x0    sdhi0_imclk sdhi0_imclk2 
sdhi0_clk_hs sdhi0_aclk
sdhi1_imclk          0     0     0xb6c 0x2    sdhi1_imclk sdhi1_imclk2 
sdhi1_clk_hs sdhi1_aclk
sdhi1_imclk2         0     0     0xb6c 0x2    sdhi1_imclk sdhi1_imclk2 
sdhi1_clk_hs sdhi1_aclk
sdhi1_clk_hs         0     0     0xb6c 0x2    sdhi1_imclk sdhi1_imclk2 
sdhi1_clk_hs sdhi1_aclk
sdhi1_aclk           0     0     0xb6c 0x2    sdhi1_imclk sdhi1_imclk2 
sdhi1_clk_hs sdhi1_aclk
gpu_clk              1     1     0xb80 0x0
cru_sysclk           0     0     0xb78 0x8    cru_sysclk cru_vclk cru_pclk 
cru_aclk
cru_vclk             0     0     0xb78 0x8    cru_sysclk cru_vclk cru_pclk 
cru_aclk
cru_pclk             0     0     0xb78 0x8    cru_sysclk cru_vclk cru_pclk 
cru_aclk
cru_aclk             0     0     0xb78 0x8    cru_sysclk cru_vclk cru_pclk 
cru_aclk

dsi_pll_clk          1     6     0xb78 0x0    dsi_pll_clk dsi_sys_clk dsi_aclk 
dsi_pclk dsi_vclk dsi_lpclk
dsi_sys_clk          1     6     0xb78 0x0    dsi_pll_clk dsi_sys_clk dsi_aclk 
dsi_pclk dsi_vclk dsi_lpclk
dsi_aclk             1     6     0xb78 0x0    dsi_pll_clk dsi_sys_clk dsi_aclk 
dsi_pclk dsi_vclk dsi_lpclk
dsi_pclk             1     6     0xb78 0x0    dsi_pll_clk dsi_sys_clk dsi_aclk 
dsi_pclk dsi_vclk dsi_lpclk
dsi_vclk             1     6     0xb78 0x0    dsi_pll_clk dsi_sys_clk dsi_aclk 
dsi_pclk dsi_vclk dsi_lpclk
dsi_lpclk            1     6     0xb78 0x0    dsi_pll_clk dsi_sys_clk dsi_aclk 
dsi_pclk dsi_vclk dsi_lpclk

I was expected the MSTOP for these to be set to anythong other than 0x0 here. But I presume, they are somehow set in the reboot process before exectution reach rzg2l_mipi_dsi_host_transfer(). To be honest, I don't know the rz-du code.

[...]


I do not have acces to the other panels for the moment to run the same command.


Also, could you please check if the following diff solves your problem:

diff --git a/drivers/gpu/drm/renesas/rz-du/rzg2l_mipi_dsi.c
b/drivers/gpu/drm/renesas/rz-du/rzg2l_mipi_dsi.c
index 5edd45424562..62957632a96f 100644
--- a/drivers/gpu/drm/renesas/rz-du/rzg2l_mipi_dsi.c
+++ b/drivers/gpu/drm/renesas/rz-du/rzg2l_mipi_dsi.c
@@ -1282,6 +1282,10 @@ static ssize_t
rzg2l_mipi_dsi_host_transfer(struct mipi_dsi_host *host,
                  value |= SQCH0DSC0AR_FMT_SHORT;
          }

+       ret = pm_runtime_resume_and_get(dsi->dev);
+       if (ret)
+               return ret;
+
          rzg2l_mipi_dsi_link_write(dsi, SQCH0DSC0AR, value);

          /*
@@ -1322,6 +1326,8 @@ static ssize_t rzg2l_mipi_dsi_host_transfer(struct
mipi_dsi_host *host,
                          ret = packet.payload_length;
          }

+       pm_runtime_put(dsi->dev);
+
          return ret;
   }

I confirm that it fixes the bug, altought I assume this is just for testing and is not 
the "proper" fix.

To me, this, or something similar should be done anyway. Previously it used to work because there was no MSTOP support available.

The MSTOP is now set when all of the clocks sharing it are disabled. With the MSTOP set, any master accessing a HW block that has MSTOP set will get sync abort. That wasn't the case previously, when the HW registers could have been accessed w/o generating such exeception.

Thank you,
Claudiu

Reply via email to