On Tue, Apr 28, 2015 at 1:04 PM, Anuradha Karuppiah
<anurad...@cumulusnetworks.com> wrote:
> On Tue, Apr 28, 2015 at 12:37 PM, Scott Feldman <sfel...@gmail.com> wrote:
>> On Tue, Apr 28, 2015 at 8:39 AM, Anuradha Karuppiah
>> <anurad...@cumulusnetworks.com> wrote:
>>>
>>>
>>> On Mon, Apr 27, 2015 at 10:45 PM, Scott Feldman <sfel...@gmail.com> wrote:
>>>>
>>>> On Mon, Apr 27, 2015 at 10:38 AM,  <anurad...@cumulusnetworks.com> wrote:
>>>> > From: Anuradha Karuppiah <anurad...@cumulusnetworks.com>
>>>> >
>>>> > User space daemons can detect errors in the network that need to be
>>>> > notified to the switch device drivers.
>>>> >
>>>> > Drivers can react to this error state by doing a phy-down on the
>>>> > switch-port which would result in a carrier-off locally and on the
>>>> > directly connected switch. Doing that would prevent loops and
>>>> > black-holes in the network.
>>>>
>>>> (Sorry if this was asked earlier)
>>>>
>>>> Can the application simply send a SETLINK with IFF_UP clear and the
>>>> port driver's ndo_stop would bring the PHY link down?
>>>
>>>
>>> Yes, Clearing IFF_UP on detecting errors (PROTO_DOWN) is possible and we
>>> tried
>>> that implementation as well. Unfortunately it failed because of the
>>> following
>>> reasons -
>>>
>>> 1. There is no way to disambiguate between admin_down (!IFF_UP) and an
>>> APP/driver enforced error_down (IFF_PROTO_DOWN). Administrator or
>>> automation-scripts that monitor the config assumed that switch-port
>>> configuration had somehow fallen out of sync (and attempted to reinstate the
>>> admin_up repeatedly).
>>>
>>> 2. Automatic error recovery was not possible; consider the following
>>> scenario
>>> for e.g.
>>>    a. The MLAG peer-link is down so the MLAG app on the secondary switch has
>>>       proto_down’ed all the MLAG ports (including switch-port swp1) by
>>> clearing
>>>       IFF_UP.
>>>    b. At the same time the administrator is in the process of making some
>>>       changes on the network connected to swp1. To avoid doing it live he
>>> would
>>>       admin_disable swp1 (!IFF_UP) by doing an "ip link set swp1 down" (this
>>>       is a no-op as event #a has already cleared IFF_UP on swp1).
>>>    c. If the MLAG peer-link recovers at this point the MLAG app on the
>>>       secondary switch would try to automatically recover the MLAG ports
>>>       by clearing proto_down (i.e. setting IFF_UP); including on swp1. Doing
>>>       that overrides the administrator’s directive to keep swp1 admin_down.
>>>       Overriding an admin-down in a live network can be very dangerous so it
>>>       is not possible to do auto-error-recovery unless we have a way to
>>>       disambiguate between the admin and error states
>>
>> That makes sense.
>>
>> Dang, this is so close to IFF_DORMANT.  The interface can be IFF_UP
>> and link mode can be DORMANT.  Can the port driver kill PHY link if
>> dev->flags&IFF_DORMANT in ndo_set_rx_mode()?  Would require
>> IFF_DORMANT is included in dev->flags in __dev_change_flags().
>
> Yes, IFF_DORMANT does seem close to what is needed; in the current/standard
> interpretation IFF_DORMANT keeps the switch port phy-up and running (and most
> PDUs are also exchanged in the dormant state). Like you said we could
> re-interpret IFF_DORMANT in this context to phy-down the switch-port;
> unfortunately we are already using IFF_DORMANT as well (in its standard
> interpretation)...

That makes sense; best to not confuse IFF_DORMANT with this new need.

> We are using the dormant mode (for the MLAG app itself) to hold the MLAG port
> in a brief/transition-ary suspended state when the switch-port link/carrier up
> happens. This has been done to co-ordinate states across the MLAG peer 
> switches
> and to ensure that egress port block masks are programmed on the peer switch
> before transitioning the local switch port to an OPER_UP state. If we didn't 
> do
> that the dual-connected server would see duplicate packets every time a
> link-down to link-up happened on a MLAG port.

How can we see this in action?  I didn't find where the kernel egress
blocks the port when dormant.  What are the requirements for a kernel
port driver to support your MLAG app?  Is this MLAG app available
somewhere?
--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to