Re: [CentOS] ECC RAM Error

2007-10-15 Thread Centos
Thanks every one for help and response. I just noticed that these errors might be soft error, because only happens when I overload the storage with copying simultaneously large files on the same port and scsi controller, so I was thinking it should be ECC speed to calculation of the parity or

Re: [CentOS] ECC RAM Error

2007-10-11 Thread John R Pierce
Peter Arremann wrote: On Thursday 11 October 2007, John R Pierce wrote: Peter Arremann wrote: On Thursday 11 October 2007, Centos wrote: The ECC errors only happens when I am transferring data from other storage to this one that we get error. it only happens when it is writing d

Re: [CentOS] ECC RAM Error

2007-10-11 Thread Peter Arremann
On Thursday 11 October 2007, John R Pierce wrote: > Peter Arremann wrote: > > On Thursday 11 October 2007, Centos wrote: > >> The ECC errors only happens when I am transferring data from other > >> storage to this one that we get error. > >> it only happens when it is writing data to it. > > What d

Re: [CentOS] ECC RAM Error

2007-10-11 Thread John R Pierce
Peter Arremann wrote: On Thursday 11 October 2007, Centos wrote: The ECC errors only happens when I am transferring data from other storage to this one that we get error. it only happens when it is writing data to it. What do you mean by "transferring data from other storage to th

Re: [CentOS] ECC RAM Error

2007-10-11 Thread Centos
the interesting thing is I only see these ECC errors when I am writting data to this box, and no error shows up when I am reading data from it, so if it was corrupted Memory or controller those errors should show up even when I am reading them. am I missing some thing here ? Peter Arremann

Re: [CentOS] ECC RAM Error

2007-10-11 Thread Peter Arremann
On Thursday 11 October 2007, Centos wrote: > The ECC errors only happens when I am transferring data from other > storage to this one that we get error. > it only happens when it is writing data to it. ECC errors can happen anywhere. It can be that the data is corrupted while it is transmitted to

Re: [CentOS] ECC RAM Error

2007-10-11 Thread Dan Halbert
do you think replacing ram will solve our problem ? how can I make sure it is the ram ? This is almost certainly a hardware problem. It could be the RAM, a particular motherboard DIMM slot, or maybe the RAM is just not seated quite right in the memory slot. I have seen all three of these probl

Re: [CentOS] ECC RAM Error

2007-10-11 Thread Lance Davis
On Thu, 11 Oct 2007, Centos wrote: Thank you Lance, We will change the memory to see if it is resolving the problem. that storage only has basic linux kernel , which unfortunately does not carry memtest86. memtest86 is usually a package that you boot into ... Regards Lance Lance Davis

Re: [CentOS] ECC RAM Error

2007-10-11 Thread Centos
Thank you Lance, We will change the memory to see if it is resolving the problem. that storage only has basic linux kernel , which unfortunately does not carry memtest86. Lance Davis wrote: On Thu, 11 Oct 2007, Centos wrote: do you think replacing ram will solve our problem ? assuming i

Re: [CentOS] ECC RAM Error

2007-10-11 Thread Lance Davis
On Thu, 11 Oct 2007, Centos wrote: do you think replacing ram will solve our problem ? assuming it is RAM gone faulty and not some other issue then it should. how can I make sure it is the ram ? memtest86 ?? Regards Lance Lance Davis wrote: On Thu, 11 Oct 2007, Centos wrote: > The

Re: [CentOS] ECC RAM Error

2007-10-11 Thread Centos
do you think replacing ram will solve our problem ? how can I make sure it is the ram ? Lance Davis wrote: On Thu, 11 Oct 2007, Centos wrote: The ECC errors only happens when I am transferring data from other storage to this one that we get error. it only happens when it is writing data to i

Re: [CentOS] ECC RAM Error

2007-10-11 Thread Lance Davis
On Thu, 11 Oct 2007, Centos wrote: The ECC errors only happens when I am transferring data from other storage to this one that we get error. it only happens when it is writing data to it. Well that is when it is detected ... As I said ECC RAM errors are concerned with an error in storage - n

Re: [CentOS] ECC RAM Error

2007-10-11 Thread Centos
The ECC errors only happens when I am transferring data from other storage to this one that we get error. it only happens when it is writing data to it. Lance Davis wrote: On Thu, 11 Oct 2007, Centos wrote: was wondering if it is safe to use the device, until we receive RAM. that device is o

Re: [CentOS] ECC RAM Error

2007-10-11 Thread Lance Davis
On Thu, 11 Oct 2007, Centos wrote: was wondering if it is safe to use the device, until we receive RAM. that device is our main storage. does data retransmit when ECC errors happen. I don't want to have data corruption. You are not talking about data transission - but storage If two or more

Re: [CentOS] ECC RAM Error

2007-10-11 Thread Centos
was wondering if it is safe to use the device, until we receive RAM. that device is our main storage. does data retransmit when ECC errors happen. I don't want to have data corruption. Matthew Miller wrote: On Thu, Oct 11, 2007 at 09:57:12AM -0300, Centos wrote: Has any one have any exper

Re: [CentOS] ECC RAM Error

2007-10-11 Thread Matthew Miller
On Thu, Oct 11, 2007 at 09:57:12AM -0300, Centos wrote: > Has any one have any experience in ECC RAM Errors. > we are seeing ECC fault Errors but I am not sure if it can be related to > RAM it self or > it is related to bad connection and noise. > please let me know if you have a good document reg

[CentOS] ECC RAM Error

2007-10-11 Thread Centos
Hello Has any one have any experience in ECC RAM Errors. we are seeing ECC fault Errors but I am not sure if it can be related to RAM it self or it is related to bad connection and noise. please let me know if you have a good document regarding ECC Errors, specially I want to know if data will