Hardware machine logs are even more strange for me.

Kern.log: This is last log before today... There are only those entries in the 
whole log!
Jul  4 00:57:01 azul-manager kernel: [730178.213562] md: data-check of RAID 
array md0
Jul  4 00:57:01 azul-manager kernel: [730178.213571] md: minimum _guaranteed_  
speed: 1000 KB/sec/disk.
Jul  4 00:57:01 azul-manager kernel: [730178.213578] md: using maximum 
available idle IO bandwidth (but not more than 200000 KB/sec) for data-check.
Jul  4 00:57:01 azul-manager kernel: [730178.213588] md: using 128k window, 
over a total of 488386496 blocks.
Jul  4 02:52:34 azul-manager kernel: [737110.476389] md: md0: data-check done.

Next log is:
Jul 28 10:12:08 azul-manager kernel: imklog 4.2.0, log source = /proc/kmsg 
started.
Jul 28 10:12:08 azul-manager kernel: [    0.000000] Initializing cgroup subsys 
cpuset
Jul 28 10:12:08 azul-manager kernel: [    0.000000] Initializing cgroup subsys 
cpu
Jul 28 10:12:08 azul-manager kernel: [    0.000000] Linux version 
2.6.32-22-server (bui...@yellow) (gcc version 4.4.3 (Ubuntu 4.4.3-4ubuntu5) ) 
#36-Ubuntu SMP Thu Jun 3 2

Reboot: Day 28. But it was working meanwhile...

messages.log is also strange:
--------------------------------------

Jul 25 06:40:03 azul-manager rsyslogd: [origin software="rsyslogd" 
swVersion="4.2.0" x-pid="1404" x-info="http://www.rsyslog.com";] rsyslogd was 
HUPed, type 'lightweight'.
Jul 26 06:54:20 azul-manager rsyslogd: [origin software="rsyslogd" 
swVersion="4.2.0" x-pid="1404" x-info="http://www.rsyslog.com";] rsyslogd was 
HUPed, type 'lightweight'.
Jul 27 06:42:59 azul-manager rsyslogd: [origin software="rsyslogd" 
swVersion="4.2.0" x-pid="1404" x-info="http://www.rsyslog.com";] rsyslogd was 
HUPed, type 'lightweight'.
Jul 28 10:12:08 azul-manager kernel: imklog 4.2.0, log source = /proc/kmsg 
started.
Jul 28 10:12:08 azul-manager rsyslogd: [origin software="rsyslogd" 
swVersion="4.2.0" x-pid="1419" x-info="http://www.rsyslog.com";] (re)start
Jul 28 10:12:08 azul-manager rsyslogd: rsyslogd's groupid changed to 103
Jul 28 10:12:08 azul-manager rsyslogd: rsyslogd's userid changed to 101
Jul 28 10:12:08 azul-manager kernel: [    0.000000] Initializing cgroup subsys 
cpuset


Only 3 entries in 3 days? Is that normal?

And here comes the problem, surely: daemon.log

Jul 25 06:59:48 azul-manager ntpd[1382]: kernel time sync status change 2001
Jul 25 07:08:03 azul-manager smartd[1967]: Device: /dev/sde, SMART Usage 
Attribute: 194 Temperature_Celsius changed from 41 to 40
Jul 25 08:08:02 azul-manager smartd[1967]: Device: /dev/sde, SMART Prefailure 
Attribute: 1 Raw_Read_Error_Rate changed from 56 to 55
Jul 25 08:08:02 azul-manager smartd[1967]: Device: /dev/sde, SMART Usage 
Attribute: 195 Hardware_ECC_Recovered changed from 56 to 55
Jul 25 08:38:03 azul-manager smartd[1967]: Device: /dev/sde, SMART Prefailure 
Attribute: 1 Raw_Read_Error_Rate changed from 55 to 57
Jul 25 08:38:03 azul-manager smartd[1967]: Device: /dev/sde, SMART Usage 
Attribute: 195 Hardware_ECC_Recovered changed from 55 to 57
Jul 25 08:59:18 azul-manager ntpd[1382]: kernel time sync status change 6001
Jul 25 09:08:03 azul-manager smartd[1967]: Device: /dev/sde, SMART Prefailure 
Attribute: 1 Raw_Read_Error_Rate changed from 57 to 56
Jul 25 09:08:03 azul-manager smartd[1967]: Device: /dev/sde, SMART Usage 
Attribute: 195 Hardware_ECC_Recovered changed from 57 to 56
Jul 25 09:16:21 azul-manager ntpd[1382]: kernel time sync status change 2001
Jul 25 09:38:02 azul-manager smartd[1967]: Device: /dev/sdd, SMART Usage 
Attribute: 194 Temperature_Celsius changed from 106 to 105
Jul 25 09:38:02 azul-manager smartd[1967]: Device: /dev/sde, SMART Prefailure 
Attribute: 1 Raw_Read_Error_Rate changed from 56 to 57
Jul 25 09:38:02 azul-manager smartd[1967]: Device: /dev/sde, SMART Usage 
Attribute: 194 Temperature_Celsius changed from 40 to 41
Jul 25 09:38:02 azul-manager smartd[1967]: Device: /dev/sde, SMART Usage 
Attribute: 195 Hardware_ECC_Recovered changed from 56 to 57
....
Jul 27 13:08:03 azul-manager smartd[1967]: Device: /dev/sde, SMART Prefailure 
Attribute: 1 Raw_Read_Error_Rate changed from 55 to 56
Jul 27 13:08:03 azul-manager smartd[1967]: Device: /dev/sde, SMART Usage 
Attribute: 194 Temperature_Celsius changed from 42 to 43
Jul 27 13:08:03 azul-manager smartd[1967]: Device: /dev/sde, SMART Usage 
Attribute: 195 Hardware_ECC_Recovered changed from 55 to 56
Jul 27 13:38:02 azul-manager smartd[1967]: Device: /dev/sda, SMART Usage 
Attribute: 190 Airflow_Temperature_Cel changed from 63 to 62
Jul 27 13:38:02 azul-manager smartd[1967]: Device: /dev/sda, SMART Usage 
Attribute: 194 Temperature_Celsius changed from 37 to 38
Jul 27 14:08:02 azul-manager smartd[1967]: Device: /dev/sde, SMART Usage 
Attribute: 194 Temperature_Celsius changed from 43 to 42
Jul 27 14:12:24 azul-manager ntpd[1382]: kernel time sync status change 2001
Jul 27 14:38:02 azul-manager smartd[1967]: Device: /dev/sdd, SMART Usage 
Attribute: 194 Temperature_Celsius changed from 104 to 103
Jul 27 14:38:02 azul-manager smartd[1967]: Device: /dev/sde, SMART Prefailure 
Attribute: 1 Raw_Read_Error_Rate changed from 56 to 55
Jul 27 14:38:02 azul-manager smartd[1967]: Device: /dev/sde, SMART Usage 
Attribute: 194 Temperature_Celsius changed from 42 to 43
Jul 27 14:38:02 azul-manager smartd[1967]: Device: /dev/sde, SMART Usage 
Attribute: 195 Hardware_ECC_Recovered changed from 56 to 55
Jul 27 15:08:03 azul-manager smartd[1967]: Device: /dev/sdb, SMART Usage 
Attribute: 190 Airflow_Temperature_Cel changed from 63 to 62
Jul 27 15:08:03 azul-manager smartd[1967]: Device: /dev/sdb, SMART Usage 
Attribute: 194 Temperature_Celsius changed from 37 to 38
Jul 27 15:08:03 azul-manager smartd[1967]: Device: /dev/sdc, SMART Usage 
Attribute: 194 Temperature_Celsius changed from 104 to 103
Jul 27 15:08:03 azul-manager smartd[1967]: Device: /dev/sde, SMART Prefailure 
Attribute: 1 Raw_Read_Error_Rate changed from 55 to 56
Jul 27 15:08:03 azul-manager smartd[1967]: Device: /dev/sde, SMART Usage 
Attribute: 195 Hardware_ECC_Recovered changed from 55 to 56
Jul 28 10:12:08 azul-manager init: apport pre-start process (1485) terminated 
with status 1
Jul 28 10:12:08 azul-manager init: apport post-stop process (1515) terminated 
with status 1


I know that temperature is far from ideal but...  194 Temperature_Celsius 
changed from 104 to 103... Does this means that the disk was at 104C? That 
can't be right...

I continue investigating as none of the logs says that the disk
failed...

-- 
Server disk was reverted as it were 3 months ago. How?
https://bugs.launchpad.net/bugs/611188
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to