>From adding some more dumping of CQ state, what _may_ be happening is that under rare conditions the HCA's CQ consumer index gets incremented by 1 too many. Then when the CQ is completely empty it will look full to the HW and we'll get an overrun for the next CQE. (I saw it happen after ~300K increments of the CQ's CI, ~160K of which were for >1)
I didn't see how the driver could be doing this, since the HCA ended up with a CI that was one more than the number of increments that the driver did. Also, converting all of the increment CI dbells to only increment by 1 fixes the problem, which is more evidence of a FW glitch. Thanks, Roland _______________________________________________ openib-general mailing list [EMAIL PROTECTED] http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
