Re: [OpenAFS] Server (file) host wedge: WARNING: osi_NetIfPoller: ldi_open_by_name failed: 19

2011-12-24 Thread Jeffrey Altman
I'm fairly sure this is a Solaris bug.  The error indicates that
/dev/udp is an unknown device.  OpenAFS used to panic when this
condition was reached.  The versions you are using will continue
to operate and simply fail to update the current interface list.

However, the root cause of the problem is outside of OpenAFS.  You
should contact Oracle for a fix.

Jeffrey Altman


On 12/24/2011 2:15 PM, Jeff Blaine wrote:
> I'm pretty sure this is the 2nd time we've seen this
> now.
> 
> AFS fileserver ur.our.org wedged today.  Our monitoring
> shows CPU usage pegged at 100% right when the problem
> happened (didn't escalate over hours...).
> 
> SunOS ur.our.org 5.10 Generic_144488-13 sun4u sparc SUNW,Sun-Fire-V240
> 
> /:ur # strings /kernel/fs/sparcv9/afs | grep OpenAFS
> @(#) OpenAFS 1.4.14 built  2011-07-07
> /:ur # strings /usr/afs/bin/fileserver | grep OpenAFS
> @(#) OpenAFS 1.4.11 built  2009-07-14
> /:ur #
> 
> It had been up 20 days (almost exactly).
> 
> The console showed repeating:
> 
> WARNING: osi_NetIfPoller: ldi_open_by_name failed: 19
> 
> No console login possible, no SSH possible.  Had to
> force-stop the OS.  Issuing 'sync' at the 'ok' prompt
> to force a crash dump generated tons of SCSI reset
> errors,
> 
> 
> ___
> OpenAFS-info mailing list
> OpenAFS-info@openafs.org
> https://lists.openafs.org/mailman/listinfo/openafs-info



signature.asc
Description: OpenPGP digital signature


Re: [OpenAFS] Server (file) host wedge: WARNING: osi_NetIfPoller: ldi_open_by_name failed: 19

2011-12-24 Thread Brandon Allbery
On Sat, Dec 24, 2011 at 14:15, Jeff Blaine  wrote:

> No console login possible, no SSH possible.  Had to
> force-stop the OS.  Issuing 'sync' at the 'ok' prompt
> to force a crash dump generated tons of SCSI reset
> errors,
>

That tells me your problems are at a lower level than AFS; possibly a
failing disk or controller.

-- 
brandon s allbery  allber...@gmail.com
wandering unix systems administrator (available) (412) 475-9364 vm/sms


[OpenAFS] Server (file) host wedge: WARNING: osi_NetIfPoller: ldi_open_by_name failed: 19

2011-12-24 Thread Jeff Blaine

I'm pretty sure this is the 2nd time we've seen this
now.

AFS fileserver ur.our.org wedged today.  Our monitoring
shows CPU usage pegged at 100% right when the problem
happened (didn't escalate over hours...).

SunOS ur.our.org 5.10 Generic_144488-13 sun4u sparc SUNW,Sun-Fire-V240

/:ur # strings /kernel/fs/sparcv9/afs | grep OpenAFS
@(#) OpenAFS 1.4.14 built  2011-07-07
/:ur # strings /usr/afs/bin/fileserver | grep OpenAFS
@(#) OpenAFS 1.4.11 built  2009-07-14
/:ur #

It had been up 20 days (almost exactly).

The console showed repeating:

WARNING: osi_NetIfPoller: ldi_open_by_name failed: 19

No console login possible, no SSH possible.  Had to
force-stop the OS.  Issuing 'sync' at the 'ok' prompt
to force a crash dump generated tons of SCSI reset
errors,


___
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info