Arno Lehmann <[EMAIL PROTECTED]> writes:
> 13.11.2008 12:05, Ronald Buder wrote:
>> Due to a server failure the nfs shares are not available anymore. I 
>> would like to see some sort of a timeout at least if that is at all 
>> possible.
>
> That's not possible inside Bacula - the FD simply can't terminate
> file system accesses that are stalled due to NFS problems.

if the filesystem is mounted with "intr" (and you should always mount
NFS with "intr"), the FD *can* set an alarm on itself and recover.
not sure what the recovery behaviour should be -- simply aborting the
whole job would be in improvement (the timeout value should be user
configurable of course), but it could also stop reading more files in
the current filesystem and go on with the job.  Bacula knows when it
steps into a new filesystem, so it can be taught how to jump out of
that recursion.

slightly related, Bacula could try to send an NFS ping to the NFS
server before recursing into the filesystem.  this means Bacula will
not waste time if the server was down already, but I'm not sure it's
worthwhile to complicate the code with NFS specific code.  if you're
interested, you can look at the code in a utility on my web page:

  http://heim.ifi.uio.no/kjetilho/hacks/#cknfs

> The best thing to do is often a restart of the NFS server.

yes, life is better when the NFS server is up :-)

-- 
regards,          | Redpill  _
Kjetil T. Homme   | Linpro  (_)


-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

Reply via email to