So I changed all 45 nodes in the cluster back to the 2.6.32 kernel and
restarted a test job. After 15-20 minutes, some of the nodes had dropped
out - responding still to pings, but impossible to ssh to them.
Output from one node as follows (after connecting to console),
Ben Hutchings wrote:
We'
Ben Hutchings wrote:
What protocol(s) are you using when this occurs?
I was running the Hadoop application (http://hadoop.apache.org/) which
uses TCP as far as I know.
I just tried to reproduce the problem using iperf but have sent GBs
between two machines running 2.6.32 without seeing any
On Tue, 2010-03-02 at 10:33 +, stephen mulcahy wrote:
> Package: linux-2.6
> Version: 2.6.32-5
> Severity: grave
>
> When running linux-image-2.6.32-trunk-amd64, the network stops
> responding if large amounts of traffic are transmitted/received. Running
> ifdown eth0 followed by ifup eth0 r
Package: linux-2.6
Version: 2.6.32-5
Severity: grave
When running linux-image-2.6.32-trunk-amd64, the network stops
responding if large amounts of traffic are transmitted/received. Running
ifdown eth0 followed by ifup eth0 restores operation of the network.
There are no errors relating to this
4 matches
Mail list logo