Bzzzz wrote:
> Bob Proulx wrote:
> > > Erasing error output just doesn't erase the cause,
> > > and the cause might be very dangerous to the system's
> > > health...
> > 
> > Erasing the error output?  Why are you erasing error output?  I
> > never suggested any such thing.
> 
> So you're following attentively the output of the automatic
> fsck, excellent...

Sorry.  I don't follow your line of discussion.

> > > This also means more frequent FS checks ("I'm waiting
> > > for hours fsck to complete" IS NOT a good excuse).
> > 
> > Huh?  What?  Huh?  What are you talking about?  I suggested setting
> > FSCKFIX=yes and that most certainly has nothing to do with long
> > fsck check times nor with more frequent checks.  Why did you
> > suggest that?
> 
> Because some FS errors have an incremental pattern; thus,
> more frequent fsck gives the user a much better chance to
> act before complete brakdown.

I completely agree with this.  It is one reason for ext file systems
to implement an fsck check interval after a certain number of months
or a certain number of mounts.  By default the check interval time is
usually 6 months.  That is only triggered after a subsequent reboot
however.  But security upgrades to the kernel usually come through
more often than this so eventually the system will get an fsck check.

A proactive admin should be aware of these things and schedule
appropriate preventative maintenance.

I have mixed feeling about the mount counts interval however.  The
mount counts are usually randomized somewhat across multiple file
systems so that they don't all execute at once after a series of
reboots that triggers the check but that only seems partially
effective.  It really requires that a system be rebooted regularly and
very often in order for that to be effective.  And I don't think just
because a system has been mounted and unmounted perhaps in a very
short period of time (say 30 times in one day) to be an indication of
likely need.

> > > Now, if you think your way's the best, keep on going to the
> > > bottom of it, and just replace fsck with an empty script that
> > > always returns 1.
> > 
> > That is a classic "straw man" fallacy.
> > 
> >   https://en.wikipedia.org/wiki/Straw_man
> 
> In the facts, your way is almost the same as avoiding completely
> a fsck (the first reason I gave).

Uhm...  No.  It isn't.  "My way" is setting FSCKFIX=yes.  Please read
the documentation for it.

  $ man rcS
  ...
  FSCKFIX
    When  the  root  and all other file systems are checked, fsck is
    invoked with the -a option which means "autorepair".   If  there
    are  major  inconsistencies then the fsck process will bail out.
    The system will print a  message  asking  the  administrator  to
    repair  the  file  system manually and will present a root shell
    prompt (actually a sulogin prompt) on the console.  Setting this
    option  to  yes  causes  the fsck commands to be run with the -y
    option instead of the -a option.  This will tell fsck always  to
    repair the file systems without asking for permission.

  $ man fsck.ext4
  ...
  -a     This  option  does  the same thing as the -p option.  It is pro-
         vided for backwards compatibility only;  it  is  suggested  that
         people use -p option whenever possible.
  ...
  -p     Automatically  repair  ("preen")  the  file system.  This option
         will cause e2fsck to automatically fix any  filesystem  problems
         that  can be safely fixed without human intervention.  If e2fsck
         discovers a problem which may require the  system  administrator
         to  take  additional  corrective  action,  e2fsck  will  print a
         description of the problem and then exit with the value 4  logi-
         cally  or'ed  into  the exit code.  (See the EXIT CODE section.)
         This option is normally used by the system's boot  scripts.   It
         may not be specified at the same time as the -n or -y options.
  ...
  -y     Assume an answer of `yes' to all questions; allows e2fsck to  be
         used non-interactively.  This option may not be specified at the
         same time as the -n or -p options.

The default is 'fsck -p' which will bail out and allow the
administrator to manually repair the filesystem.

My previous point was this.  Is there anyone reading this mailing list
that would be expert enough in the file system in order to manually
repair it with a file system debugger (it has been ages since I looked
at fsdb for UFS) and to do other than answer yes to any of the
interactive fsck questions?

If the answer is yes then that is awesome!  I would love it if they
would write a tutorial for others such as myself to be able to learn
this knowledge and capability.  But if everyone is simply going to
answer yes to each interactive fsck questions then they might as well
supply -y on the fsck command line.  The result is exactly the same.

> > Somehow you have mutated my suggestion of fixing the problem with
> > ignoring the problem.  Ignoring is very, very bad.  Why would you
> > even suggest ignoring the problem?  I know I didn't suggest
> > ignoring the problem.
> 
> But this is what you (1/2) make!
> And don't tell me you're scrutinizing very carefully something
> (the fsck output) you're very used to see, nobody does (nor can).
> 
> This is the bottom of the problem: the same as when you find
> the dirty linen in your way in the stairs, you're careful once,
> then you integrate it... until you're wife bawls you out for
> doing so. Except that in this scenario, the HD's full failure
> have 90% chances to be the result.
>  
> Hence my proposition to get rid of fsck: you'll save time, never
> your HDz.

I read through this several times.  It rather leaves me speechless for
a reply to it.  So I will simply note that I read it and move on.

Bob

Attachment: signature.asc
Description: Digital signature

Reply via email to