On Wed, Mar 03, 2021 at 01:54:52PM +0800, Kevin Lo wrote: > On Tue, Mar 02, 2021 at 08:23:38PM -0800, Nick Owens wrote: > > > > On Tue, Mar 2, 2021 at 7:00 PM Kevin Lo <ke...@kevlo.org> wrote: > > > > > > On Tue, Mar 02, 2021 at 02:20:12PM -0800, Nick Owens wrote: > > > > > > > > >Synopsis: multiple rge(4) nics causes hang > > > > >Category: kernel > > > > >Environment: > > > > System : OpenBSD 6.8 > > > > Details : OpenBSD 6.8 (GENERIC.MP) #5: Mon Feb 22 04:36:10 > > > > MST 2021 > > > > > > > > r...@syspatch-68-amd64.openbsd.org:/usr/src/sys/arch/amd64/compile/GENERIC.MP > > > > > > > > Architecture: OpenBSD.amd64 > > > > Machine : amd64 > > > > >Description: > > > > i have odroid h2+ [1] with the 'net card' [2]. it has two > > > > onboard rge(4) nics and four more on the net card. > > > > > > > > using only rge0/1, the system seems stable. but when i try to > > > > bring up more nics, e.g. ifconfig rge2/3/4/5 up, the system will hang. i > > > > cannot do anything, except pressing ^C on the console prints a ^C back > > > > very slowly but doesn't do anything else. it seems like ifconfig is > > > > stuck in kernel, and the system also stops doing anything else like > > > > routing the other working nics. > > > > > > > > current setup is rge0 for WAN with dhcpcd from ports for IPv6 > > > > access, and bridge(4) + vether(4) + rge1/2/3/4/5 with dhcpd/rad for LAN. > > > > for full config, you can find the ansible scrips at > > > > https://github.com/mischief/ansible/. have never used ansible before so > > > > i am sure it is crappy :) > > > > > > > > questions: > > > > 1. how can i debug what is happening when i up the nic > > > > with ifconfig? > > > > 2. would someone on openbsd team (maybe kevin lo) like a copy of > > > > this hardware donated? > > > > > > Hi, > > > > > > otto@ fixed interrupt storm on rge(4): > > > https://cvsweb.openbsd.org/cgi-bin/cvsweb/src/sys/dev/pci/if_rgereg.h.diff?r1=1.4&r2=1.5 > > > > > > Please apply the patch or try to run the latest OpenBSD snapshot, thanks. > > > > hi! thanks! > > > > i updated to a snapshot and up'ing the interfaces no longer hangs my > > system, and the nics all seem to work. my original bug was fixed > > months ago :-D > > > > one last concern, this seems to happen often: > > > > fugu# dmesg| grep 'watchdog timeout' > > rge4: watchdog timeout > > rge5: watchdog timeout > > rge2: watchdog timeout > > rge2: watchdog timeout > > > > is this anything to worry about? > > Generally watchdog timeout means driver didn't get TX completion interrupt > for a given time which in turn indicates TX MAC is in stuck condition. > RX MAC could be also in the stuck condition at that moment > (i.e. complete MAC lock-up). > > If this happens often, it's definitely a bug.
not happening that much, it maybe only happened during nic up or when i was replugging the physical cables. i will keep an eye on it. thank you for developing this driver so that this hardware works as an awesome router!