On 27-Dec-06, at 5:00 PM, Bill Sommerfeld wrote:

On Wed, 2006-12-27 at 14:09 -0800, Siegfried Nikolaivich wrote:
I have been stuck trying to unzip a file over the network. Each time I try, the server machine slows down to a crawl until it is frozen, even the
ethernet links go down.

by any chance, is this an NFS-exported ZFS filesystem from a SPARC
system with ATA disks?    this may be a known bug:

This is indeed an NFS-exported ZFS filesystem, but it's on x86 with SATA disks using the marvell chipset. The problem also happens on UFS slices with CIFS.

Are there any DTrace scripts I can run to show me where the machine is
freezing?

lockstat's kernel profiling would be the first thing I'd use (this is in
some sense a "dtrace script" as lockstat is now built on top of dtrace
mechanisms)

Thanks, I've tried to analyze the results, but haven't been able to exactly find the culprit. It seems the system is mostly idle while it freezes. Could it be the underlying PCI or I/O subsystems? I did notice that the ethernet links go down/up while transferring the file (s).

In case anyone's interested, here is most of the output (shortened a bit):

Profiling interrupt: 3884 events in 20.024 seconds (194 events/sec)

------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
1935 50% 50% 0.00 9533201069173428 cpu[1] cpu_halt +0xb2

      nsec ------ Time Distribution ------ count     Stack
                                                     idle+0x89
                                                     thread_start+0x8
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
1919  49%  99% 0.00     8968 cpu[0]                 cpu_halt+0xb2

      nsec ------ Time Distribution ------ count     Stack
      1024 |                               55        idle+0x89
      2048 |@@                             163       thread_start+0x8
      4096 |@@@@                           280
      8192 |@@@@@@@@                       515
     16384 |@@@@@@@@@@                     649
     32768 |@@@@                           257
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
    2   0%  99% 0.00    13745 cpu[0]+6               gethrtime+0xb

      nsec ------ Time Distribution ------ count     Stack
16384 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 2 bge_chip_poll_engine+0x2e bge_chip_disable_engine+0x50
                                                     bge_chip_stop+0xca
bge_chip_factotum+0x108 av_dispatch_softvect+0x62
                                                     dosoftint+0x32
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
    2   0%  99% 0.00    10151 cpu[0]+6               intr_restore+0x2

      nsec ------ Time Distribution ------ count     Stack
     16384 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 2         gethrtime+0xa
                                                     drv_usecwait+0x7a
bge_chip_poll_engine+0x2e bge_chip_disable_engine+0x50
                                                     bge_chip_stop+0xca
bge_chip_factotum+0x108 av_dispatch_softvect+0x62
                                                     dosoftint+0x32
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
    1   0%  99% 0.00     6020 cpu[0]+11              apic_send_ipi+0x73

      nsec ------ Time Distribution ------ count     Stack
      8192 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 1         poke_cpu+0x1c
                                                     wakeup_cpu+0x77
                                                     setbackdq+0x1a9
sleepq_wakeone_chan+0x68
                                                     cv_signal+0xa6
taskq_bucket_dispatch+0x95 taskq_dispatch +0x262 qenable_locked +0xdf
                                                     backenable+0xac
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
    1   0%  99% 0.00    17370 cpu[0]+6               copyb+0x72

      nsec ------ Time Distribution ------ count     Stack
     32768 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 1         copymsg+0x21
                                                     copymsgchain+0x28
i_dls_link_ether_rx+0x16b
                                                     mac_rx+0x46
                                                     aggr_recv_cb+0xe2
                                                     mac_rx+0x46
                                                     bge_receive+0x98
                                                     bge_intr+0xaf
av_dispatch_autovect+0x78 ------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
    1   0%  99% 0.00     9691 cpu[0]+11              disp+0x1ac

      nsec ------ Time Distribution ------ count     Stack
     16384 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 1         swtch+0xa3
                                                     cv_wait+0x68
                                                     taskq_thread+0x14f
                                                     thread_start+0x8
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
    1   0%  99% 0.00     7722 cpu[0]+6               bcopy+0xa

      nsec ------ Time Distribution ------ count     Stack
8192 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 1 bge_receive_ring +0x62
                                                     bge_receive+0x71
                                                     bge_intr+0xaf
av_dispatch_autovect+0x78
                                                     intr_thread+0x50
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
    1   0%  99% 0.00     2803 cpu[0]+6               mutex_enter+0x10

      nsec ------ Time Distribution ------ count     Stack
4096 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 1 dld_str_rx_fastpath+0x24 i_dls_link_ether_rx+0x183
                                                     mac_rx+0x46
                                                     aggr_recv_cb+0xe2
                                                     mac_rx+0x46
                                                     bge_receive+0x98
                                                     bge_intr+0xaf
av_dispatch_autovect+0x78
                                                     intr_thread+0x50
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
1 0% 100% 0.00 1133 cpu[0]+5 mv_crpb_done_intr+0x150

      nsec ------ Time Distribution ------ count     Stack
      2048 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 1         mv_intr+0xca
av_dispatch_autovect+0x78
                                                     intr_thread+0x50
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
1 0% 100% 0.00 13791 cpu[0]+5 mv_crpb_done_intr+0xc9

      nsec ------ Time Distribution ------ count     Stack
16384 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 1 mv_subctrl_intr +0xd9
                                                     mv_intr+0xca
av_dispatch_autovect+0x78
                                                     intr_thread+0x50
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
    1   0% 100% 0.00     7105 cpu[0]+4               mul32+0x6

      nsec ------ Time Distribution ------ count     Stack
      8192 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 1         gethrtime+0xa
                                                     drv_usecwait+0x7a
bge_chip_poll_engine+0x2e bge_chip_disable_engine+0x50
                                                     bge_chip_stop+0xca
bge_chip_reset +0x2d4
                                                     bge_reset+0xba
                                                     bge_restart+0x4c
bge_chip_factotum+0x140 ------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
    1   0% 100% 0.00    21861 cpu[0]+6               bge_send+0x21b

      nsec ------ Time Distribution ------ count     Stack
     32768 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 1         bge_m_tx+0x6d
                                                     aggr_m_tx+0x65
                                                     dls_tx+0xe
str_mdata_fastpath_put+0x1f tcp_send_data +0x650 tcp_rput_data +0x1a95 squeue_enter_chain+0x16e
                                                     ip_input+0x18c
                                                     ip_rput+0xd9
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
    1   0% 100% 0.00    12419 cpu[0]+6               pfrput+0x65

      nsec ------ Time Distribution ------ count     Stack
     16384 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 1         putnext+0x1f1
dld_str_rx_unitdata+0x2f i_dls_link_ether_rx+0x153
                                                     mac_rx+0x46
                                                     aggr_recv_cb+0xe2
                                                     mac_rx+0x46
                                                     bge_receive+0x98
                                                     bge_intr+0xaf
av_dispatch_autovect+0x78 ------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
1 0% 100% 0.00 12202 cpu[0]+6 tsc_gethrtime_delta+0x24

      nsec ------ Time Distribution ------ count     Stack
     16384 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 1         gethrtime+0xa
                                                     drv_usecwait+0x7a
bge_chip_poll_engine+0x2e bge_chip_disable_engine+0x50
                                                     bge_chip_stop+0xca
bge_chip_factotum+0x108 av_dispatch_softvect+0x62
                                                     dosoftint+0x32
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
1 0% 100% 0.00 753 cpu[0]+6 tsc_gethrtime_delta+0x41

      nsec ------ Time Distribution ------ count     Stack
      1024 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 1         gethrtime+0xa
                                                     drv_usecwait+0x7a
bge_chip_poll_engine+0x2e bge_chip_disable_engine+0x50
                                                     bge_chip_stop+0xca
bge_chip_factotum+0x108 av_dispatch_softvect+0x62
                                                     dosoftint+0x32
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
1 0% 100% 0.00 18446744073707043264 cpu[1] anon_decref+0x27

      nsec ------ Time Distribution ------ count     Stack
                                                     anon_free+0x6d
                                                     segvn_free+0x1c1
                                                     seg_free+0x3f
                                                     segvn_unmap+0x54b
                                                     as_free+0xac
                                                     relvm+0x1f7
                                                     exec_args+0x1c5
                                                     elf32exec+0x34c
                                                     gexec+0x208
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
1 0% 100% 0.00 18446744073707042749 cpu[1] kmem_cache_alloc+0x44

      nsec ------ Time Distribution ------ count     Stack
                                                     kmem_zalloc+0x3b
sata_pkt_alloc +0x22 sata_scsi_init_pkt+0x143
                                                     scsi_init_pkt+0x44
sd_initpkt_for_uscsi+0x75
                                                     sd_start_cmds+0xa5
sd_core_iostart +0x87 sd_uscsi_strategy+0xd6 sd_send_scsi_SYNCHRONIZE_CACHE+0x10f ------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
1 0% 100% 0.00 18446744073707042811 cpu[1] rw_enter+0x2b

      nsec ------ Time Distribution ------ count     Stack
                                                     thread_start+0x8
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
1 0% 100% 0.00 18446744073707043769 cpu[1] rw_write_held+0x3

      nsec ------ Time Distribution ------ count     Stack
                                                     dnode_hold+0x14
dmu_bonus_hold +0x2e
                                                     zfs_zget+0x54
                                                     zfs_vget+0x12c
                                                     fsop_vget+0xb
                                                     nfs3_fhtovp+0x69
                                                     rfs3_write+0x6e
common_dispatch +0x5b8
                                                     rfs_dispatch+0x21
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
1 0% 100% 0.00 18446744073707044453 cpu[1] (usermode)

      nsec ------ Time Distribution ------ count     Stack
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
1 0% 100% 0.00 18446744073707042731 cpu[1] kcopy+0x2c

      nsec ------ Time Distribution ------ count     Stack
                                                     uiomove+0x93
                                                     struiocopyout+0xa1
                                                     kstrgetmsg+0x4d6
sotpi_recvmsg +0x253
                                                     socktpi_read+0x84
                                                     fop_read+0x31
                                                     read+0x188
                                                     read32+0xe
sys_syscall32 +0x101 ------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
    1   0% 100% 0.00    10816 cpu[0]                 hat_unlock+0x9

      nsec ------ Time Distribution ------ count     Stack
16384 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 1 segmap_pageunlock+0xfa
                                                     wrip+0x44d
                                                     ufs_write+0x211
                                                     fop_write+0x31
                                                     pwrite64+0x198
sys_syscall32 +0x101 ------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
1 0% 100% 0.00 18446744073707042246 cpu[1]+11 lock_set+0x4

      nsec ------ Time Distribution ------ count     Stack
                                                     cv_block+0x96
                                                     cv_timedwait+0x7c
taskq_d_thread +0xc5
                                                     thread_start+0x8
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
    1   0% 100% 0.00     1231 cpu[0]                 (usermode)

      nsec ------ Time Distribution ------ count     Stack
      2048 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 1
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
    1   0% 100% 0.00     3528 cpu[0]                 deltamap_add+0x19f

      nsec ------ Time Distribution ------ count     Stack
      4096 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 1         top_delta+0x61
                                                     bmap_write+0xd88
                                                     wrip+0x759
                                                     ufs_write+0x211
                                                     fop_write+0x31
                                                     pwrite64+0x198
sys_syscall32 +0x101 ------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
1 0% 100% 0.00 13765 cpu[0]+4 tsc_gethrtime_delta+0x24

      nsec ------ Time Distribution ------ count     Stack
     16384 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 1         gethrtime+0xa
                                                     drv_usecwait+0x7a
bge_poll_firmware+0x32 bge_chip_reset +0x172
                                                     bge_reset+0xba
                                                     bge_restart+0x4c
bge_chip_factotum+0x140 av_dispatch_softvect+0x62
                                                     dosoftint+0x32
------------------------------------------------------------------------ -------
Count indv cuml rcnt     nsec CPU+PIL                Caller
    1   0% 100% 0.00    12988 cpu[0]+4               intr_restore+0x2

      nsec ------ Time Distribution ------ count     Stack
     16384 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 1         gethrtime+0xa
                                                     drv_usecwait+0x7a
bge_chip_poll_engine+0x2e bge_chip_disable_engine+0x50
                                                     bge_chip_stop+0xca
bge_chip_reset +0x2d4
                                                     bge_reset+0xba
                                                     bge_restart+0x4c
bge_chip_factotum+0x140 ------------------------------------------------------------------------ -------



Siegfried

_______________________________________________
networking-discuss mailing list
[email protected]

Reply via email to