Thanks for responding!
# mdb 53
Loading modules: [ unix genunix specfs dtrace mac cpu.generic uppc pcplusmp
scsi_vhci zfs mpt sd ip hook neti sockfs arp usba stmf stmf_sbd lofs
mpt_sas random idm cpc crypto smbsrv nfs ufs nsmb ipc ]
> ::status
debugging crash dump vmcore.53 (64-bit) from OmniOS
operating system: 5.11 omnios-b281e50 (i86pc)
image uuid: ffb45a23-f199-e8d5-e3cb-ab9cf90bca68
panic message:
BAD TRAP: type=e (#pf Page fault) rp=ffffff001fadf970 addr=20 occurred in
module "unix" due to
a NULL pointer dereference
dump content: kernel pages only
> $C
ffffff001fadfb60 mutex_enter+0xb()
ffffff001fadfc20 taskq_thread+0x2d0(ffffff04ec9004b8)
ffffff001fadfc30 thread_start+8()
> $r
%rax = 0x0000000000000000 %r9 = 0x0000000000000078
%rbx = 0x0000000000000000 %r10 = 0xfffffffffbcf7910
sleepq_head+0x4e10
%rcx = 0x0000000000000000 %r11 = 0xffffff001fadfc40
%rdx = 0xffffff001fadfc40 %r12 = 0x0000000000000000
%rsi = 0xffffff04ea7e1588 %r13 = 0x0000000000000001
%rdi = 0x0000000000000020 %r14 = 0x0000000000000020
%r8 = 0xffffff04e760d8c0 %r15 = 0x0000000000000000
%rip = 0xfffffffffb85ec8b mutex_enter+0xb
%rbp = 0xffffff001fadfb60
%rsp = 0xffffff001fadfa68
%rflags = 0x00010246
id=0 vip=0 vif=0 ac=0 vm=0 rf=1 nt=0 iopl=0x0
status=<of,df,IF,tf,sf,ZF,af,PF,cf>
%cs = 0x0030 %ds = 0x004b %es = 0x004b
%trapno = 0xe %fs = 0x0000 %gs = 0x01c3
%err = 0x2
On Mon, Oct 7, 2013 at 1:43 PM, Robert Mustacchi <[email protected]> wrote:
> On 10/7/13 13:25 , Travis LaDuke wrote:
> > Hi,
> > Looking at crash dumps on unix is new territory for me. I have this
> > OmniOS-stable VM on esxi, and it's having kernel panics and rebooting 2
> - 3
> > times a day. I'm guessing it's bad RAM, but I haven't been able to bring
> > the machine down to test yet.
> > Sorry if I'm posting on the wrong list. I'm mostly curious what other
> > info/output it would be useful to look at and if this output below says
> > anything obvious about what the problem is. Or what else I can do to
> test.
>
> Hey Travis,
>
> So it's a bit hard to say from just that what it could be. That stack
> trace doesn't immediately stand out to me. What would help here is to
> open up the dump in mdb and git a little bit more information such as
> the arguments to the functions and the taskq in question.
>
> To start off what I might run is something like this:
>
> cd /var/crash/volatile
>
> In there you should see files with the name of vmdump.%d. So if you have
> a vmdump.0, you'd run `savecore -fv vmdump.0 .` and then `mdb 0`. If the
> trailing digit is different, just replace that everywhere. Once you do
> that you should have a prompt for mdb. To start with run the following:
>
> > ::status
> ... output ...
> > $C
> ... output ...
> > $r
> ... output ...
> > $q
>
> That last one will cause you to exit.
>
> Robert
>
> > # cat fmdump.txt
> > TIME UUID
> > SUNW-MSG-ID
> > Oct 04 2013 17:21:54.666705000 622961c5-bb0f-6890-929d-cbad7b19385b
> > SUNOS-8000-KL
> >
> > TIME CLASS ENA
> > Oct 04 17:21:54.6631 ireport.os.sunos.panic.dump_available
> > 0x0000000000000000
> > Oct 04 17:21:50.6898 ireport.os.sunos.panic.dump_pending_on_device
> > 0x0000000000000000
> >
> > nvlist version: 0
> > version = 0x0
> > class = list.suspect
> > uuid = 622961c5-bb0f-6890-929d-cbad7b19385b
> > code = SUNOS-8000-KL
> > diag-time = 1380932514 663452
> > de = fmd:///module/software-diagnosis
> > fault-list-sz = 0x1
> > fault-list = (array of embedded nvlists)
> > (start fault-list[0])
> > nvlist version: 0
> > version = 0x0
> > class = defect.sunos.kernel.panic
> > certainty = 0x64
> > asru =
> > sw:///:path=/var/crash/unknown/.622961c5-bb0f-6890-929d-cbad7b19385b
> > resource =
> > sw:///:path=/var/crash/unknown/.622961c5-bb0f-6890-929d-cbad7b19385b
> > savecore-succcess = 1
> > dump-dir = /var/crash/unknown
> > dump-files = vmdump.45
> > os-instance-uuid = 622961c5-bb0f-6890-929d-cbad7b19385b
> > panicstr = BAD TRAP: type=e (#pf Page fault)
> > rp=ffffff001fac9970 addr=20 occurred in module "unix" due to a NULL
> pointer
> > dereference
> > panicstack = unix:die+df () | unix:trap+db3 () |
> > unix:cmntrap+e6 () | unix:mutex_enter+b () | genunix:taskq_thread+2d0 ()
> |
> > unix:thread_start+8 () |
> > crashtime = 1380932428
> > panic-time = Fri Oct 4 17:20:28 2013 PDT
> > (end fault-list[0])
> >
> > fault-status = 0x1
> > severity = Major
> > __ttl = 0x1
> > __tod = 0x524f5ba2 0x27bd1c68
> >
> >
> >
> > -------------------------------------------
> > illumos-discuss
> > Archives: https://www.listbox.com/member/archive/182180/=now
> > RSS Feed:
> https://www.listbox.com/member/archive/rss/182180/21175748-a2cc1e82
> > Modify Your Subscription:
> https://www.listbox.com/member/?&
> > Powered by Listbox: http://www.listbox.com
> >
>
>
-------------------------------------------
illumos-discuss
Archives: https://www.listbox.com/member/archive/182180/=now
RSS Feed: https://www.listbox.com/member/archive/rss/182180/21175430-2e6923be
Modify Your Subscription:
https://www.listbox.com/member/?member_id=21175430&id_secret=21175430-6a77cda4
Powered by Listbox: http://www.listbox.com