Re: [PLUG] Recommended Disk Test

2015-01-15 Thread Brian Martin

> sudo smartctl -a /dev/sda

As mentioned by others, your disk is 4.2 years old (Power_On_Hours / 24
/ 365), so it's beyond the planned lifespan of 3  years -- though most
disks exceed that.  You can safely do the following:
 - "sudo smartctl -t offline /dev/sda"
 - wait the named amount of time +50%, because their estimates are
always optimistic
 - "sudo smartctl -l selftest /dev/sda"
You should see an entry under extended tests with a LifeTime hours that
matches the current value of Power_On_Hours.  It'll report whether it
detected errors or not.  This test is non-destructive, can be run
on-line (despite the "offline" name -- probably a historical artifact),
but is only somewhat effective in detecting problems.

I've used lots of other test from the various manufacturers (see my
article at http://www.pdxsys.com/articles/hard-drive-test-tools/ for
info), but these are destructive and the best one was only 75% effective
at detecting a known failed drive.

-B



Brian P. Martin, Chief Consultant
Martin Consulting Services, Inc.
Phone: 503-617-4500
E-mail: br...@martinconsulting.com
Web: www.martinconsulting.com
Blog: www.pdxsys.com



___
PLUG mailing list
PLUG@lists.pdxlinux.org
http://lists.pdxlinux.org/mailman/listinfo/plug


Re: [PLUG] denyhosts not blocking some IPs with failed ssh root logins

2015-01-15 Thread Paul Heinlein

On Wed, 14 Jan 2015, Galen Seitz wrote:


Hi,

Is anyone else seeing problems with denyhosts not blocking some failed
logins?  This popped up in last night's logwatch:


Galen,

I've largely ditched DenyHosts for Fail2ban, but I saw similar things 
a few weeks ago. The problem was that somewhere along the line the 
entries in syslog no longer matched the regex that indicated a failed 
login.


I ended up writing a new set of regexes for Fail2ban. They sometimes 
overlap with the existing ones (which I didn't alter), but I'd rather 
have a bad log entry match twice than not at all.


Bottom line: I'd suggest comparing the log entries that weren't 
matched with the regex code in DenyHosts.


--
Paul Heinlein
heinl...@madboa.com
45°38' N, 122°6' W___
PLUG mailing list
PLUG@lists.pdxlinux.org
http://lists.pdxlinux.org/mailman/listinfo/plug


Re: [PLUG] Recommended Disk Test

2015-01-15 Thread Dick Steffens
On 01/15/2015 12:42 PM, Brian Martin wrote:
>> sudo smartctl -a /dev/sda
> As mentioned by others, your disk is 4.2 years old (Power_On_Hours / 24
> / 365), so it's beyond the planned lifespan of 3  years -- though most
> disks exceed that.  You can safely do the following:
>   - "sudo smartctl -t offline /dev/sda"
>   - wait the named amount of time +50%, because their estimates are
> always optimistic
=== START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION ===
Sending command: "Execute SMART off-line routine immediately in off-line 
mode".
Drive command "Execute SMART off-line routine immediately in off-line 
mode" successful.
Testing has begun.
Please wait 625 seconds for test to complete.
Test will complete after Thu Jan 15 15:07:29 2015

I'll come back to it after 3:30 and run the next part.

>   - "sudo smartctl -l selftest /dev/sda"
> You should see an entry under extended tests with a LifeTime hours that
> matches the current value of Power_On_Hours.  It'll report whether it
> detected errors or not.  This test is non-destructive, can be run
> on-line (despite the "offline" name -- probably a historical artifact),
> but is only somewhat effective in detecting problems.
>
> I've used lots of other test from the various manufacturers (see my
> article at http://www.pdxsys.com/articles/hard-drive-test-tools/ for
> info), but these are destructive and the best one was only 75% effective
> at detecting a known failed drive

Thanks. I've got a new power supply on order. Should arrive Monday.

I'll keep in mind that this  hard drive is old, and should be replaced, 
soon.


-- 
Regards,

Dick Steffens

___
PLUG mailing list
PLUG@lists.pdxlinux.org
http://lists.pdxlinux.org/mailman/listinfo/plug


Re: [PLUG] Recommended Disk Test

2015-01-15 Thread Bill Barry
Maybe the problem is in the software, not the hardware. What happens
if you just run the machine for a few weeks without MythTV running.
Or you could run top or  something occasionally and see if there is a
memory leak or maybe try the windows solution and manually reboot it
every evening and see if the problem goes away. Or perhaps there is a
better way to narrow down the problem.

Bill

On Thu, Jan 15, 2015 at 5:01 PM, Dick Steffens  wrote:
> On 01/15/2015 12:42 PM, Brian Martin wrote:
>>> sudo smartctl -a /dev/sda
>> As mentioned by others, your disk is 4.2 years old (Power_On_Hours / 24
>> / 365), so it's beyond the planned lifespan of 3  years -- though most
>> disks exceed that.  You can safely do the following:
>>   - "sudo smartctl -t offline /dev/sda"
>>   - wait the named amount of time +50%, because their estimates are
>> always optimistic
> === START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION ===
> Sending command: "Execute SMART off-line routine immediately in off-line
> mode".
> Drive command "Execute SMART off-line routine immediately in off-line
> mode" successful.
> Testing has begun.
> Please wait 625 seconds for test to complete.
> Test will complete after Thu Jan 15 15:07:29 2015
>
> I'll come back to it after 3:30 and run the next part.
>
>>   - "sudo smartctl -l selftest /dev/sda"
>> You should see an entry under extended tests with a LifeTime hours that
>> matches the current value of Power_On_Hours.  It'll report whether it
>> detected errors or not.  This test is non-destructive, can be run
>> on-line (despite the "offline" name -- probably a historical artifact),
>> but is only somewhat effective in detecting problems.
>>
>> I've used lots of other test from the various manufacturers (see my
>> article at http://www.pdxsys.com/articles/hard-drive-test-tools/ for
>> info), but these are destructive and the best one was only 75% effective
>> at detecting a known failed drive
>
> Thanks. I've got a new power supply on order. Should arrive Monday.
>
> I'll keep in mind that this  hard drive is old, and should be replaced,
> soon.
>
>
> --
> Regards,
>
> Dick Steffens
>
> ___
> PLUG mailing list
> PLUG@lists.pdxlinux.org
> http://lists.pdxlinux.org/mailman/listinfo/plug
___
PLUG mailing list
PLUG@lists.pdxlinux.org
http://lists.pdxlinux.org/mailman/listinfo/plug


Re: [PLUG] Recommended Disk Test

2015-01-15 Thread Dick Steffens
On 01/15/2015 03:09 PM, Bill Barry wrote:
> Maybe the problem is in the software, not the hardware. What happens
> if you just run the machine for a few weeks without MythTV running.

I'd miss the game. :-)

> Or you could run top or  something occasionally and see if there is a
> memory leak

I used to run top over an ssh connection for days at a time. When the 
machine froze there wasn't anything useful I could see other than that 
top was also frozen.

> or maybe try the windows solution and manually reboot it
> every evening and see if the problem goes away.

That's an interesting thought. Once I've installed the new power supply, 
if the problem doesn't go away, I'll give that a try.

> Or perhaps there is a
> better way to narrow down the problem.

When it first started happening I had left the MythTV front end program 
running. I've since shut that program down after watching the TV. It 
hasn't made any difference I've been able to observe, though.

Thanks for your ideas.

-- 
Regards,

Dick Steffens

___
PLUG mailing list
PLUG@lists.pdxlinux.org
http://lists.pdxlinux.org/mailman/listinfo/plug


Re: [PLUG] Recommended Disk Test

2015-01-15 Thread Dick Steffens
On 01/15/2015 03:01 PM, Dick Steffens wrote:
> On 01/15/2015 12:42 PM, Brian Martin wrote:
>>> sudo smartctl -a /dev/sda
>> As mentioned by others, your disk is 4.2 years old (Power_On_Hours / 24
>> / 365), so it's beyond the planned lifespan of 3  years -- though most
>> disks exceed that.  You can safely do the following:
>>- "sudo smartctl -t offline /dev/sda"
>>- wait the named amount of time +50%, because their estimates are
>> always optimistic
> === START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION ===
> Sending command: "Execute SMART off-line routine immediately in off-line
> mode".
> Drive command "Execute SMART off-line routine immediately in off-line
> mode" successful.
> Testing has begun.
> Please wait 625 seconds for test to complete.
> Test will complete after Thu Jan 15 15:07:29 2015

=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num  Test_DescriptionStatus  Remaining 
LifeTime(hours)  LBA_of_first_error
# 1  Vendor (0x50)   Completed without error   00% 0 -




rsteff@ThinkCentre-M58p:~$ sudo smartctl -a /dev/sda
smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.5.0-18-generic] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Model Family: Seagate Barracuda 7200.12
Device Model: ST3320418AS
Serial Number:6VM97DR5
LU WWN Device Id: 5 000c50 01ed8e39b
Firmware Version: CC66
User Capacity:320,072,933,376 bytes [320 GB]
Sector Size:  512 bytes logical/physical
Device is:In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  ATA-8-ACS revision 4
Local Time is:Thu Jan 15 15:40:36 2015 PST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82)Offline data collection activity
 was completed without error.
 Auto Offline Data Collection: Enabled.
Self-test execution status:  (   0)The previous self-test 
routine completed
 without error or no self-test has ever
 been run.
Total time to complete Offline
data collection: (  625) seconds.
Offline data collection
capabilities:  (0x7b) SMART execute Offline immediate.
 Auto Offline data collection on/off support.
 Suspend Offline collection upon new
 command.
 Offline surface scan supported.
 Self-test supported.
 Conveyance Self-test supported.
 Selective Self-test supported.
SMART capabilities:(0x0003)Saves SMART data before entering
 power-saving mode.
 Supports SMART auto save timer.
Error logging capability:(0x01)Error logging supported.
 General Purpose Logging supported.
Short self-test routine
recommended polling time:  (   1) minutes.
Extended self-test routine
recommended polling time:  (  68) minutes.
Conveyance self-test routine
recommended polling time:  (   2) minutes.
SCT capabilities:(0x103f)SCT Status supported.
 SCT Error Recovery Control supported.
 SCT Feature Control supported.
 SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG VALUE WORST THRESH TYPE UPDATED  
WHEN_FAILED RAW_VALUE
   1 Raw_Read_Error_Rate 0x000f   119   099   006Pre-fail 
Always   -   206415233
   3 Spin_Up_Time0x0003   097   097   000Pre-fail 
Always   -   0
   4 Start_Stop_Count0x0032   100   100   020Old_age 
Always   -   938
   5 Reallocated_Sector_Ct   0x0033   100   100   036Pre-fail 
Always   -   0
   7 Seek_Error_Rate 0x000f   084   060   030Pre-fail 
Always   -   259747921
   9 Power_On_Hours  0x0032   058   058   000Old_age 
Always   -   37506
  10 Spin_Retry_Count0x0013   100   100   097Pre-fail 
Always   -   0
  12 Power_Cycle_Count   0x0032   100   100   020Old_age 
Always   -   468
183 Runtime_Bad_Block   0x0032   100   100   000Old_age 
Always   -   0
184 End-to-End_Error0x0032   100   100   099Old_age 
Always   -   0
187 Reported_Uncorrect  0x0032   100   100   000Old_age 
Always   -   0
188 Command_Timeout 0x0032   100   098   000Old_age 
Always   -   66256
189 High_Fly_Writes 0x003a   100   100   000Old_age 
Always   -   0
190 Airflow_Temperature_Cel 0x0022   06

Re: [PLUG] Recommended Disk Test

2015-01-15 Thread Brian Martin
On 01/15/2015 03:46 PM, Dick Steffens wrote:
> === START OF READ SMART DATA SECTION ===
> SMART Self-test log structure revision number 1
> Num  Test_DescriptionStatus  Remaining
> LifeTime(hours)  LBA_of_first_error
> # 1  Vendor (0x50)   Completed without error   00% 0 -
>

That test hasn't completed yet.  The one entry in the test report was 
taken when the disk had 0 hours of power-up time (e.g. at the factory).
You should get another one that matches the current power-up time
when it's done.

 -B.




-- 

Brian P. Martin, Chief Consultant
Martin Consulting Services, Inc.
Phone: 503-617-4500
E-mail: br...@martinconsulting.com
___
PLUG mailing list
PLUG@lists.pdxlinux.org
http://lists.pdxlinux.org/mailman/listinfo/plug


Re: [PLUG] Recommended Disk Test

2015-01-15 Thread Dick Steffens
On 01/15/2015 03:50 PM, Brian Martin wrote:
> On 01/15/2015 03:46 PM, Dick Steffens wrote:
>> === START OF READ SMART DATA SECTION ===
>> SMART Self-test log structure revision number 1
>> Num  Test_DescriptionStatus  Remaining
>> LifeTime(hours)  LBA_of_first_error
>> # 1  Vendor (0x50)   Completed without error   00% 0 -
>>
> That test hasn't completed yet.  The one entry in the test report was
> taken when the disk had 0 hours of power-up time (e.g. at the factory).
> You should get another one that matches the current power-up time
> when it's done.

So much longer than wait time plus 50%, then. It said:

Test will complete after Thu Jan 15 15:07:29 2015

So I waited until after 3:30. I'll run it again and look at it later 
this evening.

-- 
Regards,

Dick Steffens

___
PLUG mailing list
PLUG@lists.pdxlinux.org
http://lists.pdxlinux.org/mailman/listinfo/plug


Re: [PLUG] Recommended Disk Test

2015-01-15 Thread Russell Johnson

> On Jan 15, 2015, at 15:23, Dick Steffens  wrote:
> 
> I used to run top over an ssh connection for days at a time. When the 
> machine froze there wasn't anything useful I could see other than that 
> top was also frozen.


In that case, you should be running something remotely that displays the date 
and time. Then you might be able to go back to the logs and find something 
useful. 

Of course, if this is old news… In the words of Roseanne Roseannadana…

NEVERMIND!

—

Russell Johnson
r...@dimstar.net

___
PLUG mailing list
PLUG@lists.pdxlinux.org
http://lists.pdxlinux.org/mailman/listinfo/plug


Re: [PLUG] Recommended Disk Test

2015-01-15 Thread Dick Steffens
On 01/15/2015 03:50 PM, Brian Martin wrote:
> On 01/15/2015 03:46 PM, Dick Steffens wrote:
>> === START OF READ SMART DATA SECTION ===
>> SMART Self-test log structure revision number 1
>> Num  Test_DescriptionStatus  Remaining
>> LifeTime(hours)  LBA_of_first_error
>> # 1  Vendor (0x50)   Completed without error   00% 0 -
>>
> That test hasn't completed yet.  The one entry in the test report was
> taken when the disk had 0 hours of power-up time (e.g. at the factory).
> You should get another one that matches the current power-up time
> when it's done.

The first time I ran the test I observed the instruction "Use smartctl 
-X to abort test." I did that. Maybe I wasn't supposed to. So, this 
time, instead of aborting the test, I ran the command you instructed me 
to use in your first e-mail. I got the same results. I started the test 
shortly after 2:00 PM, and ran the -l selftest at about 9:25 PM.


rsteff@ThinkCentre-M58p:~$ sudo smartctl -t offline /dev/sda
[sudo] password for rsteff:
smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.5.0-18-generic] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION ===
Sending command: "Execute SMART off-line routine immediately in off-line 
mode".
Drive command "Execute SMART off-line routine immediately in off-line 
mode" successful.
Testing has begun.
Please wait 625 seconds for test to complete.
Test will complete after Thu Jan 15 16:14:26 2015

Use smartctl -X to abort test.
rsteff@ThinkCentre-M58p:~$ sudo smartctl -l selftest /dev/sda
[sudo] password for rsteff:
smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.5.0-18-generic] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num  Test_DescriptionStatus  Remaining 
LifeTime(hours)  LBA_of_first_error
# 1  Vendor (0x50)   Completed without error   00% 0 -



-- 
Regards,

Dick Steffens

___
PLUG mailing list
PLUG@lists.pdxlinux.org
http://lists.pdxlinux.org/mailman/listinfo/plug


Re: [PLUG] Recommended Disk Test

2015-01-15 Thread Brian Martin
> The first time I ran the test I observed the instruction "Use smartctl
> -X to abort test." I did that. Maybe I wasn't supposed to. So, this
> time, instead of aborting the test, I ran the command you instructed me
> to use in your first e-mail. I got the same results. I started the test
> shortly after 2:00 PM, and ran the -l selftest at about 9:25 PM.

My mistake.  From the smartctl man page regarding "-l offline":

"The effects of this test are visible only in that it updates the SMART 
Attribute values, and  if  errors  are found they will appear in the 
SMART error log, visible with the ´-l error´ option."

If you didn't get any errors (and I see from your earlier e-mail that 
you didn't have any before), then the test ran clean.  You can also use 
"smartctl -t long /dev/sda" -- I haven't been able to really determine 
which is the more thorough test.  This one does log to self-test long.

And yes, the "-X" option is really to abort a running test, as in "I 
didn't mean to run that", not for normal termination.

 -B.


-- 

Brian P. Martin, Chief Consultant
Martin Consulting Services, Inc.
Phone: 503-617-4500
E-mail: br...@martinconsulting.com
___
PLUG mailing list
PLUG@lists.pdxlinux.org
http://lists.pdxlinux.org/mailman/listinfo/plug