Re: Panic in stable/11 (amd64) @r303903: page fault while in kernel mode

2016-08-10 Thread Bryan Drewery
On 8/10/16 9:54 AM, David Wolfskill wrote:
> Happened after a few iterations of {"pkill dhclient" followed by
> "dhclient wlan0"}.
> 
> Gory details (both "normal" and gzipped, and including the crash
> dump and crashinfo) are in
> .
> 
> Summary:
> Wed Aug 10 15:56:26 UTC 2016
> 
> FreeBSD  11.0-BETA4 FreeBSD 11.0-BETA4 #69  r303902M/303903:1100120: Wed Aug 
> 10 04:00:09 PDT 2016 
> r...@g1-252.catwhisker.org:/common/S3/obj/usr/src/sys/CANARY  amd64
> 
> panic: page fault
> 
> GNU gdb 6.1.1 [FreeBSD]
> Copyright 2004 Free Software Foundation, Inc.
> GDB is free software, covered by the GNU General Public License, and you are
> welcome to change it and/or distribute copies of it under certain conditions.
> Type "show copying" to see the conditions.
> There is absolutely no warranty for GDB.  Type "show warranty" for details.
> This GDB was configured as "amd64-marcel-freebsd"...
> 
> Unread portion of the kernel message buffer:
> 
> 
> Fatal trap 12: page fault while in kernel mode
> cpuid = 7; apic id = 07
> fault virtual address = 0x0
> fault code= supervisor read data, page not present
> instruction pointer   = 0x20:0x80bdaaa1
> stack pointer = 0x28:0xfe060bc956e0
> frame pointer = 0x28:0xfe060bc957b0
> code segment  = base 0x0, limit 0xf, type 0x1b
>   = DPL 0, pres 1, long 1, def32 0, gran 1
> processor eflags  = interrupt enabled, resume, IOPL = 0
> current process   = 20685 (wpa_supplicant)
> trap number   = 12
> panic: page fault
> cpuid = 7
> KDB: stack backtrace:
> #0 0x80add787 at kdb_backtrace+0x67
> #1 0x80a950e2 at vpanic+0x182
> #2 0x80a94f53 at panic+0x43
> #3 0x80eead51 at trap_fatal+0x351
> #4 0x80eeaf43 at trap_pfault+0x1e3
> #5 0x80eea4ec at trap+0x26c
> #6 0x80ece0d1 at calltrap+0x8
> #7 0x80b9811c at ifioctl+0x133c
> #8 0x80afc914 at kern_ioctl+0x2d4
> #9 0x80afc5d1 at sys_ioctl+0x171
> #10 0x80eeb6c9 at amd64_syscall+0x4e9
> #11 0x80ece3bb at Xfast_syscall+0xfb
> Uptime: 3h0m4s
> ...
> Reading symbols from /boot/kernel/linux64.ko...Reading symbols from 
> /usr/lib/debug//boot/kernel/linux64.ko.debug...done.
> done.
> Loaded symbols for /boot/kernel/linux64.ko
> #0  doadump (textdump=) at pcpu.h:221
> 221   pcpu.h: No such file or directory.
>   in pcpu.h
> (kgdb) #0  doadump (textdump=) at pcpu.h:221
> #1  0x80a94b69 in kern_reboot (howto=260)
> at /usr/src/sys/kern/kern_shutdown.c:366
> #2  0x80a9511b in vpanic (fmt=, 
> ap=) at /usr/src/sys/kern/kern_shutdown.c:759
> #3  0x80a94f53 in panic (fmt=0x0)
> at /usr/src/sys/kern/kern_shutdown.c:690
> #4  0x80eead51 in trap_fatal (frame=0xfe060bc95630, eva=0)
> at /usr/src/sys/amd64/amd64/trap.c:841
> #5  0x80eeaf43 in trap_pfault (frame=0xfe060bc95630, usermode=0)
> at /usr/src/sys/amd64/amd64/trap.c:691
> #6  0x80eea4ec in trap (frame=0xfe060bc95630)
> at /usr/src/sys/amd64/amd64/trap.c:442
> #7  0x80ece0d1 in calltrap ()
> at /usr/src/sys/amd64/amd64/exception.S:236
> #8  0x80bdaaa1 in ieee80211_ioctl (ifp=0xf80007991800, 
> cmd=, data=)
> at /usr/src/sys/net80211/ieee80211_ioctl.c:3398

The code crashing is quite recent:

> commit c6321695321bae43c0cd024db564c5207a7e8e31
> Author: avos 
> Date:   Mon May 2 20:46:05 2016 +
> 
> net80211: fix MAC address change via SIOCSIFLLADDR ioctl.
> 
> Recheck MAC address on SIOCSIFFLAGS; as a result,
> 'ifconfig wlan0 ether ' can be used after interface startup.
> 
> PR: 208933
> 
> 
> git-svn-id: svn+ssh://svn.freebsd.org/base/head@298941 
> ccf9f872-aa2e-dd11-9fc8-001c23d0bc1f
> 
> diff --git sys/net80211/ieee80211_ioctl.c sys/net80211/ieee80211_ioctl.c
> index c3b02e8..823906b 100644
> --- sys/net80211/ieee80211_ioctl.c
> +++ sys/net80211/ieee80211_ioctl.c
> @@ -3382,8 +3382,18 @@ ieee80211_ioctl(struct ifnet *ifp, u_long cmd, caddr_t 
> data)
> }
> IEEE80211_UNLOCK(ic);
> /* Wait for parent ioctl handler if it was queued */
> -   if (wait)
> +   if (wait) {
> ieee80211_waitfor_parent(ic);
> +
> +   /*
> +* Check if the MAC address was changed
> +* via SIOCSIFLLADDR ioctl.
> +*/
> +   if ((ifp->if_flags & IFF_UP) == 0 &&
> +   !IEEE80211_ADDR_EQ(vap->iv_myaddr, 
> IF_LLADDR(ifp)))
> +   IEEE80211_ADDR_COPY(vap->iv_myaddr,
> +   IF_LLADDR(ifp));
> +   }
> break;
> case SIOCADDMULTI:
> case SIOCDELMULTI:


> #9  0x80b9811c in ifioctl (so=, 
>

Panic in stable/11 (amd64) @r303903: page fault while in kernel mode

2016-08-10 Thread David Wolfskill
Happened after a few iterations of {"pkill dhclient" followed by
"dhclient wlan0"}.

Gory details (both "normal" and gzipped, and including the crash
dump and crashinfo) are in
.

Summary:
Wed Aug 10 15:56:26 UTC 2016

FreeBSD  11.0-BETA4 FreeBSD 11.0-BETA4 #69  r303902M/303903:1100120: Wed Aug 10 
04:00:09 PDT 2016 
r...@g1-252.catwhisker.org:/common/S3/obj/usr/src/sys/CANARY  amd64

panic: page fault

GNU gdb 6.1.1 [FreeBSD]
Copyright 2004 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "amd64-marcel-freebsd"...

Unread portion of the kernel message buffer:


Fatal trap 12: page fault while in kernel mode
cpuid = 7; apic id = 07
fault virtual address   = 0x0
fault code  = supervisor read data, page not present
instruction pointer = 0x20:0x80bdaaa1
stack pointer   = 0x28:0xfe060bc956e0
frame pointer   = 0x28:0xfe060bc957b0
code segment= base 0x0, limit 0xf, type 0x1b
= DPL 0, pres 1, long 1, def32 0, gran 1
processor eflags= interrupt enabled, resume, IOPL = 0
current process = 20685 (wpa_supplicant)
trap number = 12
panic: page fault
cpuid = 7
KDB: stack backtrace:
#0 0x80add787 at kdb_backtrace+0x67
#1 0x80a950e2 at vpanic+0x182
#2 0x80a94f53 at panic+0x43
#3 0x80eead51 at trap_fatal+0x351
#4 0x80eeaf43 at trap_pfault+0x1e3
#5 0x80eea4ec at trap+0x26c
#6 0x80ece0d1 at calltrap+0x8
#7 0x80b9811c at ifioctl+0x133c
#8 0x80afc914 at kern_ioctl+0x2d4
#9 0x80afc5d1 at sys_ioctl+0x171
#10 0x80eeb6c9 at amd64_syscall+0x4e9
#11 0x80ece3bb at Xfast_syscall+0xfb
Uptime: 3h0m4s
...
Reading symbols from /boot/kernel/linux64.ko...Reading symbols from 
/usr/lib/debug//boot/kernel/linux64.ko.debug...done.
done.
Loaded symbols for /boot/kernel/linux64.ko
#0  doadump (textdump=) at pcpu.h:221
221 pcpu.h: No such file or directory.
in pcpu.h
(kgdb) #0  doadump (textdump=) at pcpu.h:221
#1  0x80a94b69 in kern_reboot (howto=260)
at /usr/src/sys/kern/kern_shutdown.c:366
#2  0x80a9511b in vpanic (fmt=, 
ap=) at /usr/src/sys/kern/kern_shutdown.c:759
#3  0x80a94f53 in panic (fmt=0x0)
at /usr/src/sys/kern/kern_shutdown.c:690
#4  0x80eead51 in trap_fatal (frame=0xfe060bc95630, eva=0)
at /usr/src/sys/amd64/amd64/trap.c:841
#5  0x80eeaf43 in trap_pfault (frame=0xfe060bc95630, usermode=0)
at /usr/src/sys/amd64/amd64/trap.c:691
#6  0x80eea4ec in trap (frame=0xfe060bc95630)
at /usr/src/sys/amd64/amd64/trap.c:442
#7  0x80ece0d1 in calltrap ()
at /usr/src/sys/amd64/amd64/exception.S:236
#8  0x80bdaaa1 in ieee80211_ioctl (ifp=0xf80007991800, 
cmd=, data=)
at /usr/src/sys/net80211/ieee80211_ioctl.c:3398
#9  0x80b9811c in ifioctl (so=, 
cmd=, data=, 
td=) at /usr/src/sys/net/if.c:2447
#10 0x80afc914 in kern_ioctl (td=, 
fd=, com=2149607696, data=0xfe060bc958e0 "wlan0")
at file.h:327
#11 0x80afc5d1 in sys_ioctl (td=, 
uap=0xfe060bc95a40) at /usr/src/sys/kern/sys_generic.c:743
#12 0x80eeb6c9 in amd64_syscall (td=, 
traced=) at subr_syscall.c:135
#13 0x80ece3bb in Xfast_syscall ()
at /usr/src/sys/amd64/amd64/exception.S:396
#14 0x0008015c448a in ?? ()
Previous frame inner to this frame (corrupt stack?)
Current language:  auto; currently minimal
(kgdb) 

This was on my laptop, which I'm actively using at work as I type
-- though it's now connected via wired NIC (em0).  I had experienced
no trouble with wlan0 at home (before coming in to work) or on the
bus (en route to work).  (I didn't attempt it while cycling to the
bus stop. :-})

Also, I had no issues running stable/11 (amd64) @303870 -- either
at home or at work -- yesterday.  On the other hand, this is (so
far) a one-off, so alleging a "pattern" at this point is not something
I'm willing to do.

Peace,
david
-- 
David H. Wolfskill  da...@catwhisker.org
Those who would murder in the name of God or prophet are blasphemous cowards.

See http://www.catwhisker.org/~david/publickey.gpg for my public key.


signature.asc
Description: PGP signature


Re: [Bug 211491] System hangs after "Uptime" on reboot with iSCSI, zfs, and altroot

2016-08-10 Thread Eric van Gyzen
On 08/10/16 10:19 AM, Kubilay Kocak wrote:
>> Furthermore, it's a new regression that will go into 11.0-RELEASE, so
>> getting some attention is a good thing.  I imagine this is why koobs@
>> CC'd stable@.
> 
> It was the original reporter CC'd, I added mfc-stable{10,11} flags in
> case the issue was in those branches.

Oh!  My apologies.  I misremembered.

(Hmmm...Bugzilla doesn't seem to record changes in the CC list).

> I'm very careful with cc'ing lists for src/base issues unless SNR is
> very high, high impact or it needs eyes now.
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: [Bug 211491] System hangs after "Uptime" on reboot with iSCSI, zfs, and altroot

2016-08-10 Thread Eric van Gyzen
On 08/09/16 06:10 PM, bugzilla-nore...@freebsd.org wrote:
> https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=211491
> 
> --- Comment #13 from Ngie Cooper  ---
> Please don't add -current or -stable to bugs like this; it spams the list
> unnecessarily (this issue impacts users of iSCSI + ZFS -- which seems a bit
> niche right now)

I'm sorry that some people were annoyed, although I wouldn't call it
spam.  It's a discussion of a bug that affects -STABLE, so stable@ is
appropriate.

Furthermore, it's a new regression that will go into 11.0-RELEASE, so
getting some attention is a good thing.  I imagine this is why koobs@
CC'd stable@.

I thought it was limited to iSCSI, so I put that in the summary.
However, we now know that it is not limited to iSCSI.  I just updated
the summary accordingly.

Cheers,

Eric
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: [Bug 211491] System hangs after "Uptime" on reboot with iSCSI, zfs, and altroot

2016-08-10 Thread Kubilay Kocak
On 11/08/2016 1:14 AM, Eric van Gyzen wrote:
> On 08/09/16 06:10 PM, bugzilla-nore...@freebsd.org wrote:
>> https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=211491
>>
>> --- Comment #13 from Ngie Cooper  ---
>> Please don't add -current or -stable to bugs like this; it spams the list
>> unnecessarily (this issue impacts users of iSCSI + ZFS -- which seems a bit
>> niche right now)
> 
> I'm sorry that some people were annoyed, although I wouldn't call it
> spam.  It's a discussion of a bug that affects -STABLE, so stable@ is
> appropriate.
> 
> Furthermore, it's a new regression that will go into 11.0-RELEASE, so
> getting some attention is a good thing.  I imagine this is why koobs@
> CC'd stable@.

It was the original reporter CC'd, I added mfc-stable{10,11} flags in
case the issue was in those branches.

I'm very careful with cc'ing lists for src/base issues unless SNR is
very high, high impact or it needs eyes now.

> I thought it was limited to iSCSI, so I put that in the summary.
> However, we now know that it is not limited to iSCSI.  I just updated
> the summary accordingly.
> 
> Cheers,
> 
> Eric

___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: fsck_ufs dumps core

2016-08-10 Thread Dmitry Sivachenko

> On 10 Aug 2016, at 17:55, Konstantin Belousov  wrote:
> 
> On Wed, Aug 10, 2016 at 05:29:31PM +0300, Dmitry Sivachenko wrote:
>> Hello,
>> 
>> I am running FreeBSD 10.3-STABLE #0 r299261M
>> 
>> After unclean reboot I am unable to fsck my UFS filesystem:
>> 
>> # fsck  /dev/mfid0p1
>> ** /dev/mfid0p1
>> ** Last Mounted on /opt
>> ** Phase 1 - Check Blocks and Sizes
>> fsck: /dev/mfid0p1: Segmentation fault
>> 
>> pid 482 (fsck_ufs), uid 0: exited on signal 11 (core dumped)
>> 
>> # gdb -c fsck_ufs.482 /sbin/fsck_ufs 
>> GNU gdb 6.1.1 [FreeBSD]
>> Copyright 2004 Free Software Foundation, Inc.
>> GDB is free software, covered by the GNU General Public License, and you are
>> welcome to change it and/or distribute copies of it under certain conditions.
>> Type "show copying" to see the conditions.
>> There is absolutely no warranty for GDB.  Type "show warranty" for details.
>> This GDB was configured as "amd64-marcel-freebsd"...
>> Core was generated by `fsck_ufs'.
>> Program terminated with signal 11, Segmentation fault.
>> Reading symbols from /lib/libufs.so.6...done.
>> Loaded symbols for /lib/libufs.so.6
>> Reading symbols from /lib/libc.so.7...done.
>> Loaded symbols for /lib/libc.so.7
>> Reading symbols from /libexec/ld-elf.so.1...done.
>> Loaded symbols for /libexec/ld-elf.so.1
>> #0  0x00409a8b in pass1 () at /place/WRK/src/sbin/fsck_ffs/pass1.c:83
>> 83  setbmap(i);
>> (gdb) bt
>> #0  0x00409a8b in pass1 () at /place/WRK/src/sbin/fsck_ffs/pass1.c:83
>> #1  0x00409050 in main (argc=, 
>>argv=) at /place/WRK/src/sbin/fsck_ffs/main.c:447
>> Current language:  auto; currently minimal
>> (gdb) 
>> 
> 
> Try to use alternative superblock (-b switch).  You can get the list of
> the possible values for -b by 'newfs -N' invocation, but you have to know
> the parameters which were used for formatting.


Yes, I tried several different backup superblocks, with the same result.  (I 
created this FS few years ago so I can't be 100% sure about the parameters, but 
I usually only use larger -i NN for big filesystems, and I can guess the exact 
value examining df -ik).


BTW I just noticed that when I use larger values for backup superblock, it 
reports an error which looks like overflow:

# fsck_ufs -b 7437746112 /dev/mfid0p1
Alternate super block location: -1152188480
** /dev/mfid0p1

CANNOT SEEK BLK: -1152188480
CONTINUE? [yn] 
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: fsck_ufs dumps core

2016-08-10 Thread Konstantin Belousov
On Wed, Aug 10, 2016 at 05:29:31PM +0300, Dmitry Sivachenko wrote:
> Hello,
> 
> I am running FreeBSD 10.3-STABLE #0 r299261M
> 
> After unclean reboot I am unable to fsck my UFS filesystem:
> 
> # fsck  /dev/mfid0p1
> ** /dev/mfid0p1
> ** Last Mounted on /opt
> ** Phase 1 - Check Blocks and Sizes
> fsck: /dev/mfid0p1: Segmentation fault
> 
> pid 482 (fsck_ufs), uid 0: exited on signal 11 (core dumped)
> 
> # gdb -c fsck_ufs.482 /sbin/fsck_ufs 
> GNU gdb 6.1.1 [FreeBSD]
> Copyright 2004 Free Software Foundation, Inc.
> GDB is free software, covered by the GNU General Public License, and you are
> welcome to change it and/or distribute copies of it under certain conditions.
> Type "show copying" to see the conditions.
> There is absolutely no warranty for GDB.  Type "show warranty" for details.
> This GDB was configured as "amd64-marcel-freebsd"...
> Core was generated by `fsck_ufs'.
> Program terminated with signal 11, Segmentation fault.
> Reading symbols from /lib/libufs.so.6...done.
> Loaded symbols for /lib/libufs.so.6
> Reading symbols from /lib/libc.so.7...done.
> Loaded symbols for /lib/libc.so.7
> Reading symbols from /libexec/ld-elf.so.1...done.
> Loaded symbols for /libexec/ld-elf.so.1
> #0  0x00409a8b in pass1 () at /place/WRK/src/sbin/fsck_ffs/pass1.c:83
> 83  setbmap(i);
> (gdb) bt
> #0  0x00409a8b in pass1 () at /place/WRK/src/sbin/fsck_ffs/pass1.c:83
> #1  0x00409050 in main (argc=, 
> argv=) at /place/WRK/src/sbin/fsck_ffs/main.c:447
> Current language:  auto; currently minimal
> (gdb) 
> 

Try to use alternative superblock (-b switch).  You can get the list of
the possible values for -b by 'newfs -N' invocation, but you have to know
the parameters which were used for formatting.
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


fsck_ufs dumps core

2016-08-10 Thread Dmitry Sivachenko
Hello,

I am running FreeBSD 10.3-STABLE #0 r299261M

After unclean reboot I am unable to fsck my UFS filesystem:

# fsck  /dev/mfid0p1
** /dev/mfid0p1
** Last Mounted on /opt
** Phase 1 - Check Blocks and Sizes
fsck: /dev/mfid0p1: Segmentation fault

pid 482 (fsck_ufs), uid 0: exited on signal 11 (core dumped)

# gdb -c fsck_ufs.482 /sbin/fsck_ufs 
GNU gdb 6.1.1 [FreeBSD]
Copyright 2004 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "amd64-marcel-freebsd"...
Core was generated by `fsck_ufs'.
Program terminated with signal 11, Segmentation fault.
Reading symbols from /lib/libufs.so.6...done.
Loaded symbols for /lib/libufs.so.6
Reading symbols from /lib/libc.so.7...done.
Loaded symbols for /lib/libc.so.7
Reading symbols from /libexec/ld-elf.so.1...done.
Loaded symbols for /libexec/ld-elf.so.1
#0  0x00409a8b in pass1 () at /place/WRK/src/sbin/fsck_ffs/pass1.c:83
83  setbmap(i);
(gdb) bt
#0  0x00409a8b in pass1 () at /place/WRK/src/sbin/fsck_ffs/pass1.c:83
#1  0x00409050 in main (argc=, 
argv=) at /place/WRK/src/sbin/fsck_ffs/main.c:447
Current language:  auto; currently minimal
(gdb) 



___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"


Re: Problems with em0 driver after upgrade to r303832

2016-08-10 Thread Pete French
Bit more testing on this, and I think it was a false alarm and my problem
was actually hardware related. Moving to a different PCI slot
stopped the problem happening. The original ethernet card is now
dead however, and the slow has burn marks on it - which makes me
think ,it isnt software really ;) Appologies for the noise, it just
coincided with an upgrade and I didnt expect the slot to be at fault after
trying three cards (though that was the logical next step)

-pete.
___
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"