ccing the iSER maintianer.

Hey Or,

Check out the line:
[ 1404.996346] fmr_pool: fmr_create failed for FMR 184
[ 1404.996533] iser: iser_create_ib_conn_res:unable to alloc mem or create resource, err -12
[ 1405.246314] iser: iscsi_iser_ep_poll:ib conn ffff8801ef31e210 rc = -1
[ 1408.522342] iser: iscsi_iser_ep_disconnect:ib conn ffff8801ef31e210 state 4 [ 1408.522347] iser: iser_conn_terminate:Failed to disconnect, conn: 0xffff8801ef31e210 err -22 [ 1408.522351] iser: iser_free_ib_conn_res:freeing conn ffff8801ef31e210 cma_id ffff8801ef1f3400 fmr pool fffffffffffffff4 qp (null) [ 1408.522377] BUG: unable to handle kernel NULL pointer dereference at 000000000000004c [ 1408.522387] IP: [<ffffffffa03b6714>] ib_destroy_fmr_pool+0x24/0x120 [ib_core]
[ 1408.522404] PGD 1f0fdd067 PUD 1fcc6a067 PMD 0

Off the top of my head it looks like we could not allocate something then when handling the error we might be trying to use the null pointer, tight?





On 03/24/2011 07:15 AM, Степан Фёдоров wrote:
Hello!

I got kernel bug during logging on iSER targets:

[ 1408.522377] BUG: unable to handle kernel NULL pointer dereference
at 000000000000004c
[ 1408.522387] IP: [<ffffffffa03b6714>] ib_destroy_fmr_pool+0x24/0x120 [ib_core]
[ 1408.522404] PGD 1f0fdd067 PUD 1fcc6a067 PMD 0
[ 1408.522410] Oops: 0000 [#1] SMP
[ 1408.522415] last sysfs file:
/sys/devices/pci0000:00/0000:00:07.0/0000:03:00.0/host103/session99/iscsi_session/session99/ifacename
[ 1408.522421] CPU 0
[ 1408.522423] Modules linked in: sd_mod crc_t10dif sg iscsi_tcp
libiscsi_tcp usbbk gntdev netbk blkbk blkback_pagemap blktap domctl
xenbus_be xt_multiport ipt_addrtype xt_tcpudp iptable_filter ip_tables
x_tables bridge stp llc bonding rdma_ucm ib_iser(F) libiscsi(F)
scsi_transport_iscsi(F) rdma_cm iw_cm ib_addr dm_multipath scsi_dh
ib_srp scsi_transport_srp scsi_tgt ib_ipoib ib_cm ib_sa ipv6 ib_uverbs
ib_umad mlx4_ib ib_mad ib_core loop ext2 mbcache xenblk cdrom xennet
dm_mod brd af_packet bnx2 joydev ide_pci_generic ide_core usbhid hid
ata_generic ata_piix uhci_hcd rtc_cmos rtc_core rtc_lib ahci igb
serio_raw 8250_pnp 8250 serial_core tpm_tis tpm tpm_bios mlx4_core
ehci_hcd libata mpt2sas scsi_transport_sas raid_class pcspkr scsi_mod
i2c_i801 i2c_core iTCO_wdt iTCO_vendor_support usbcore ioatdma dca
thermal button processor thermal_sys hwmon
[ 1408.522520] Supported: No, Unsupported modules are loaded
[ 1408.522525] Pid: 7854, comm: iscsid Tainted: GF
2.6.32.27-0.2-xen #1 B8DT6
[ 1408.522530] RIP: e030:[<ffffffffa03b6714>]  [<ffffffffa03b6714>]
ib_destroy_fmr_pool+0x24/0x120 [ib_core]
[ 1408.522541] RSP: e02b:ffff8801f88f39c8  EFLAGS: 00010292
[ 1408.522545] RAX: 000000000000008b RBX: ffff8801ef31e210 RCX: ffffffff806aac40
[ 1408.522549] RDX: 0000000000000000 RSI: 00000000a707a707 RDI: fffffffffffffff4
[ 1408.522553] RBP: ffff8801f88ee940 R08: 00000000ffffffff R09: 0000000000000000
[ 1408.522557] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
[ 1408.522561] R13: ffff8801f88f39c8 R14: fffffffffffffff4 R15: 0000000000000014
[ 1408.522569] FS:  00007f15763e8700(0000) GS:ffff88000800d000(0000)
knlGS:0000000000000000
[ 1408.522574] CS:  e033 DS: 0000 ES: 0000 CR0: 000000008005003b
[ 1408.522578] CR2: 000000000000004c CR3: 00000001f88e4000 CR4: 0000000000002660
[ 1408.522583] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 1408.522587] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[ 1408.522592] Process iscsid (pid: 7854, threadinfo ffff8801f88f2000,
task ffff8801f34e0280)
[ 1408.522596] Stack:
[ 1408.522598]  ffff8801f88f39c8 ffff8801f88f39c8 0000000000000020
ffff8801ef31e210
[ 1408.522604]<0>  ffff8801f88ee940 ffff8801ef2c4810 ffffffffa05318e0
ffff8801ef2c4800
[ 1408.522611]<0>  0000000000000048 ffffffffa052b5c1 00000000ffffffff
ffff8801ef31e210
[ 1408.522619] Call Trace:
[ 1408.522646]  [<ffffffffa052b5c1>] iser_conn_release+0xa1/0x120 [ib_iser]
[ 1408.522658]  [<ffffffffa052c168>] iser_conn_terminate+0xa8/0xe0 [ib_iser]
[ 1408.522670]  [<ffffffffa0503d19>] iscsi_if_transport_ep+0xd9/0x140
[scsi_transport_iscsi]
[ 1408.522684]  [<ffffffffa0505449>] iscsi_if_recv_msg+0x239/0x4e0
[scsi_transport_iscsi]
[ 1408.522699]  [<ffffffffa050584a>] iscsi_if_rx+0x15a/0x1d0
[scsi_transport_iscsi]
[ 1408.522714]  [<ffffffff802d3082>] netlink_unicast+0x292/0x2a0
[ 1408.522721]  [<ffffffff802d3c26>] netlink_sendmsg+0x1f6/0x2e0
[ 1408.522729]  [<ffffffff802a1e5c>] sock_sendmsg+0xfc/0x140
[ 1408.522735]  [<ffffffff802a2087>] sys_sendmsg+0x1e7/0x370
[ 1408.522744]  [<ffffffff80007458>] system_call_fastpath+0x16/0x1b
[ 1408.522753]  [<00007f1575f4b910>] 0x7f1575f4b910
[ 1408.522756] Code: df e9 47 ff ff ff 90 41 57 41 56 49 89 fe 4d 8d
7e 20 41 55 41 54 45 31 e4 55 53 48 83 ec 18 48 89 24 24 48 89 64 24
08 49 89 e5<48>  8b 7f 58 e8 f3 08 ca df 4c 89 f7 e8 eb fd ff ff 49 8b
46 20
[ 1408.522804] RIP  [<ffffffffa03b6714>]
ib_destroy_fmr_pool+0x24/0x120 [ib_core]
[ 1408.522814]  RSP<ffff8801f88f39c8>
[ 1408.522816] CR2: 000000000000004c
[ 1408.522820] ---[ end trace f4d760ae3a0cabb1 ]---


My system is SLES11 SP1 with 2.6.32.27-0.2-xen kernel, with 8Gb dom0
memory in first test, and 2Gb in second, and 10Gb in third. Changing
memory size of dom0 to 2Gb not affects initiator behaviour - in both -
first and second cases, error happens during logging on target 115.
After changing dom0 memory to 10Gb - error happens during logging on
target 123.

Steps to reproduce:
1. On iSER target node create more than 123 iSER targets with one LUN
per target.
2. On initiator side login to this iSER targets.

Full dmesg supplied in attachement to this message.

Thank you.


--
You received this message because you are subscribed to the Google Groups 
"open-iscsi" group.
To post to this group, send email to open-iscsi@googlegroups.com.
To unsubscribe from this group, send email to 
open-iscsi+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/open-iscsi?hl=en.

Reply via email to