Hi Pradeep,

On Wed, Jul 29, 2009 at 2:14 PM, Pradeep Satyanarayana <
[email protected]> wrote:

> Hal Rosenstock wrote:
> > Hi,
> >
> > I'm seeing the following messages from IPoIB:
> > ib0: post_send failed
> > ib0: post_send failed
> > ib0: post_send failed
> > ib0: post_send failed
> > ib0: post_send failed
> > ib0: post_send failed
> > NETDEV WATCHDOG: ib0: transmit timed out
> > ib0: transmit timeout: latency 1374 msecs
> > ib0: queue stopped 1, tx_head 140245691, tx_tail 140245565
> >
> > What are the possible (and most likely) causes of post_send failures ? I
> > went through the code for all the errors (some at the driver level) but
> > none popped out at me.
> >
>
> Is it possible that the receiver is overwhelmed and hence the tx_ring is
> full?


It's possible but from the message you can't tell whether the tx_ring is
full.

Does it make sense to increase the transmit ring size via send_queue_size
mod param ?


>
> Is this a UDP application?


There is at least some UDP and there are many concurrent clients.


>
>
> > Once the transmit queue is stopped, does the interface need to be taken
> > down and then back up to restart this ?
>
> One does not need to take down the interface. It should be able to recover
> on it's
> own. There is a timer that kicks in and checks if the tx_ring is still full
> or not-
> the transmits should start again. Thanks!


Thanks for the help!

-- Hal


>
>
> Pradeep
>
>
_______________________________________________
general mailing list
[email protected]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Reply via email to