G'day, I've got a OpenSolaris server n95, that I use for media, serving. It's uses a DQ35JOE motherboard, dual core, and I have my rpool mirrored on two IDE 40GB drives, and my media mirrored on 2 x 500GB SATA drives.
I've got a few CIFS shares on the media drive, and I'm using MediaTomb to stream to my PS3. No problems at all, until today. I was at work (obviously not working too hard :) ), when I thought that I really should scrub my pools, since I hasn't done it for awhile. So I SSHed into the box, and did a scrub on both pools. A few minutes later, I lost my SSH connection... uh oh, but not too worried, I thought that the ADSL must've gone down or something. Came home, and the server is in a reboot loop, kernel panic. Nuts... Booted into the LiveDVD of snv_95, no problem, set about scrubbing my rpool, everything is good, until I decide to import and start scrubbing my storage pool... kernel panic... Nuts... Removed the storage pool drives from the machine, no problem, boots up fine and starts scrubbing the rpool again. No problems. Decided to more the storage drives over to my desktop machine, try to import.... kernel panic... So, the trick is, how do I fix it? I've read a few posts, and I've seen other people with similar problems, but I have to admit I'm simply not smart enough to solve the problem, so, anyone got any ideas? Here's some info that I hope prove useful. [EMAIL PROTECTED]:~/Desktop$ pfexec zpool import pool: storage id: 6933883927787501942 state: ONLINE status: The pool is formatted using an older on-disk version. action: The pool can be imported using its name or numeric identifier, though some features will not be available without an explicit 'zpool upgrade'. config: storage ONLINE mirror ONLINE c3t3d0 ONLINE c3t2d0 ONLINE [EMAIL PROTECTED]:~/Desktop$ zdb -uuu -e storage Uberblock magic = 0000000000bab10c version = 10 txg = 3818020 guid_sum = 6700303293925244073 timestamp = 1220003402 UTC = Fri Aug 29 17:50:02 2008 rootbp = [L0 DMU objset] 400L/200P DVA[0]=<0:6a00058e00:200> DVA[1]=<0:20000a8600:200> DVA[2]=<0:3800050600:200> fletcher4 lzjb LE contiguous birth=3818020 fill=170 cksum=8b56cdef9:38379d3cd95:b809c1c9bb15:197649b024bfd1 [EMAIL PROTECTED]:~/Desktop$ zdb -e -bb storage Traversing all blocks to verify nothing leaked ... No leaks (block sum matches space maps exactly) bp count: 3736040 bp logical: 484538716672 avg: 129693 bp physical: 484064542720 avg: 129566 compression: 1.00 bp allocated: 484259193344 avg: 129618 compression: 1.00 SPA allocated: 484259193344 used: 97.20% Blocks LSIZE PSIZE ASIZE avg comp %Total Type 105 1.11M 339K 1017K 9.7K 3.35 0.00 deferred free 2 32K 4K 12.0K 6.00K 8.00 0.00 object directory 2 1K 1K 3.00K 1.50K 1.00 0.00 object array 1 16K 1.50K 4.50K 4.50K 10.67 0.00 packed nvlist - - - - - - - packed nvlist size 1 16K 3.00K 9.00K 9.00K 5.33 0.00 bplist - - - - - - - bplist header - - - - - - - SPA space map header 373 2.14M 801K 2.35M 6.44K 2.73 0.00 SPA space map 3 40.0K 40.0K 40.0K 13.3K 1.00 0.00 ZIL intent log 552 8.62M 2.40M 4.82M 8.94K 3.60 0.00 DMU dnode 8 8K 4K 8.50K 1.06K 2.00 0.00 DMU objset - - - - - - - DSL directory 8 4K 4K 12.0K 1.50K 1.00 0.00 DSL directory child map 7 3.50K 3.50K 10.5K 1.50K 1.00 0.00 DSL dataset snap map 15 225K 25.0K 75.0K 5.00K 8.98 0.00 DSL props - - - - - - - DSL dataset - - - - - - - ZFS znode - - - - - - - ZFS V0 ACL 3.56M 451G 451G 451G 127K 1.00 100.00 ZFS plain file 1.55K 9.9M 1.51M 3.03M 1.95K 6.55 0.00 ZFS directory 7 3.50K 3.50K 7.00K 1K 1.00 0.00 ZFS master node 40 550K 87.0K 174K 4.35K 6.32 0.00 ZFS delete queue - - - - - - - zvol object - - - - - - - zvol prop - - - - - - - other uint8[] - - - - - - - other uint64[] 1 512 512 1.50K 1.50K 1.00 0.00 other ZAP - - - - - - - persistent error log 1 128K 10.0K 30.0K 30.0K 12.80 0.00 SPA history - - - - - - - SPA history offsets - - - - - - - Pool properties - - - - - - - DSL permissions 107 53.5K 53.5K 107K 1K 1.00 0.00 ZFS ACL - - - - - - - ZFS SYSACL 4 64K 4K 8K 2K 16.00 0.00 FUID table - - - - - - - FUID table size - - - - - - - DSL dataset next clones - - - - - - - scrub work queue 3.56M 451G 451G 451G 127K 1.00 100.00 Total I've checked my /var/adm/messages file, and found the following: Aug 29 17:37:29 asmodeus unix: [ID 836849 kern.notice] Aug 29 17:37:29 asmodeus ^Mpanic[cpu2]/thread=ffffff00087f0c80: Aug 29 17:37:29 asmodeus genunix: [ID 335743 kern.notice] BAD TRAP: type=e (#pf Page fault) rp=ffffff00087effc0 addr=2a0 occ urred in module "unix" due to a NULL pointer dereference Aug 29 17:37:29 asmodeus unix: [ID 100000 kern.notice] Aug 29 17:37:29 asmodeus unix: [ID 839527 kern.notice] sched: Aug 29 17:37:29 asmodeus unix: [ID 753105 kern.notice] #pf Page fault Aug 29 17:37:29 asmodeus unix: [ID 532287 kern.notice] Bad kernel fault at addr=0x2a0 Aug 29 17:37:29 asmodeus unix: [ID 243837 kern.notice] pid=0, pc=0xfffffffffb842a1b, sp=0xffffff00087f00b8, eflags=0x10246 Aug 29 17:37:29 asmodeus unix: [ID 211416 kern.notice] cr0: 8005003b<pg,wp,ne,et,ts,mp,pe> cr4: 6f8<xmme,fxsr,pge,mce,pae,ps e,de> Aug 29 17:37:29 asmodeus unix: [ID 624947 kern.notice] cr2: 2a0 Aug 29 17:37:29 asmodeus unix: [ID 625075 kern.notice] cr3: 3400000 Aug 29 17:37:29 asmodeus unix: [ID 625715 kern.notice] cr8: c Aug 29 17:37:29 asmodeus unix: [ID 100000 kern.notice] Aug 29 17:37:29 asmodeus unix: [ID 592667 kern.notice] rdi: 2a0 rsi: 4 rdx: ffffff00087f0c80 Aug 29 17:37:29 asmodeus unix: [ID 592667 kern.notice] rcx: 2 r8: 1d0 r9: ff000000ff00 Aug 29 17:37:29 asmodeus unix: [ID 592667 kern.notice] rax: 0 rbx: 4 rbp: ffffff00087f0110 Aug 29 17:37:29 asmodeus unix: [ID 592667 kern.notice] r10: 43 r11: 1d0c0 r12: 2a0 Aug 29 17:37:29 asmodeus unix: [ID 592667 kern.notice] r13: 0 r14: 0 r15: ffffff01db281800 Aug 29 17:37:29 asmodeus unix: [ID 592667 kern.notice] fsb: 0 gsb: ffffff01caa58580 ds: 4b Aug 29 17:37:29 asmodeus unix: [ID 592667 kern.notice] es: 4b fs: 0 gs: 1c3 Aug 29 17:37:29 asmodeus unix: [ID 592667 kern.notice] trp: e err: 2 rip: fffffffffb842a1b Aug 29 17:37:29 asmodeus unix: [ID 592667 kern.notice] cs: 30 rfl: 10246 rsp: ffffff00087f00b8 Aug 29 17:37:29 asmodeus unix: [ID 266532 kern.notice] ss: 38 Aug 29 17:37:29 asmodeus unix: [ID 100000 kern.notice] Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087efea0 unix:die+c8 () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087effb0 unix:trap+13b9 () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087effc0 unix:cmntrap+e9 () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087f0110 unix:mutex_enter+b () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087f0130 zfs:zio_buf_alloc+28 () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087f0170 zfs:zio_read_init+49 () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087f01a0 zfs:zio_execute+7f () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087f01e0 zfs:zio_wait+2e () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087f0290 zfs:arc_read_nolock+739 () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087f0330 zfs:arc_read+7d () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087f0460 zfs:scrub_visitbp+141 () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087f0570 zfs:scrub_visitbp+1bd () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087f0680 zfs:scrub_visitbp+42c () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087f0790 zfs:scrub_visitbp+1bd () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087f08a0 zfs:scrub_visitbp+2ea () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087f08f0 zfs:scrub_visit_rootbp+4e () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087f0aa0 zfs:dsl_pool_scrub_sync+12c () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087f0b10 zfs:dsl_pool_sync+158 () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087f0bb0 zfs:spa_sync+254 () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087f0c60 zfs:txg_sync_thread+226 () Aug 29 17:37:29 asmodeus genunix: [ID 655072 kern.notice] ffffff00087f0c70 unix:thread_start+8 () Aug 29 17:37:29 asmodeus unix: [ID 100000 kern.notice] Aug 29 17:37:29 asmodeus genunix: [ID 672855 kern.notice] syncing file systems... Aug 29 17:37:29 asmodeus genunix: [ID 904073 kern.notice] done Aug 29 17:37:30 asmodeus genunix: [ID 111219 kern.notice] dumping to /dev/dsk/c3t0d0s1, offset 429391872, content: kernel Aug 29 17:37:30 asmodeus ahci: [ID 405573 kern.info] NOTICE: ahci0: ahci_tran_reset_dport port 0 reset port Aug 29 17:37:33 asmodeus genunix: [ID 409368 kern.notice] ^M100% done: 120113 pages dumped, compression ratio 3.77, Aug 29 17:37:33 asmodeus genunix: [ID 851671 kern.notice] dump succeeded Any help would be appreciated. -- This message posted from opensolaris.org _______________________________________________ zfs-discuss mailing list zfs-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/zfs-discuss