Hi again,

On 20/08/2005 5:34 a.m., Andrew Morton wrote:
Reuben Farrelly <[EMAIL PROTECTED]> wrote:

A few new problems cropped up with this kernel..

1. NFS seems to be unstable, oopsing when shutting down:

--- devel/fs/nfsd/nfssvc.c~ingo-nfs-stuff-fix   2005-08-19 10:29:15.000000000 
-0700
+++ devel-akpm/fs/nfsd/nfssvc.c 2005-08-19 10:30:03.000000000 -0700
@@ -286,7 +286,6 @@ out:
        /* Release the thread */
        svc_exit_thread(rqstp);
- unlock_kernel();
        /* Release module */
        unlock_kernel();
        module_put_and_exit(0);
_

That fixed it, thanks.


Aug 20 12:26:10 tornado kernel: Device  not ready.

2. That message on the third line of the trace above: "kernel: Device not ready." is being logged every few mins or so, I believe it is my SCSI CDROM that is causing it. It also logs something similar after the SCSI driver has probed the device on boot:

Aug 20 12:24:36 tornado kernel: scsi0 : Adaptec AIC7XXX EISA/VLB/PCI SCSI HBA DRIVER, Rev 7.0
Aug 20 12:24:36 tornado kernel:         <Adaptec 2940 Ultra SCSI adapter>
Aug 20 12:24:36 tornado kernel: aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs
Aug 20 12:24:36 tornado kernel:
Aug 20 12:24:36 tornado kernel: Vendor: SONY Model: CD-RW CRX145S Rev: 1.0b Aug 20 12:24:36 tornado kernel: Type: CD-ROM ANSI SCSI revision: 04
Aug 20 12:24:36 tornado kernel:  target0:0:6: Beginning Domain Validation
Aug 20 12:24:36 tornado kernel: target0:0:6: Domain Validation skipping write tests Aug 20 12:24:36 tornado kernel: target0:0:6: FAST-10 SCSI 10.0 MB/s ST (100 ns, offset 15)
Aug 20 12:24:36 tornado kernel:  target0:0:6: Ending Domain Validation
Aug 20 12:24:36 tornado kernel: Device  not ready.

This has been a problem for quite a few weeks now, albeit I believe, only a cosmetic one.

Is some application trying to poll the device?

I wonder if hald knows something about this and is polling.. however that message above about "Device not ready" occurs when the kernel is booting, before any userspace stuff has started up. Maybe hald is just being a bit aggressive in re-probing the drive after userspace launches. B all accounts after a week of uptime the drive certainly ought to be ready, it seems to work ok ;-)

Note the extra space after 'Device' and 'not' which implies possibly some text is missing (which would have made it more clear which device is not exactly ready). The case sensitive strings "Device" and "not ready" appears together in scsi_lib.c and very few other places.

Is the device actually "not ready", or is it in reality ready and working? ie: what happens if you stick a CD in it?

The CD can be read, and the error messages go away. They stay away even after the CD has been ejected.

4. PAM is complaining about "PAM audit_open() failed: Protocol not suppor
ted" and I can't log in as any user including root. I would have picked this was a userspace problem, but it doesn't break with -rc5-mm1, yet reproduceably breaks with -rc6-mm1. Weird.

hm.  How come you're able to use the machine then?

Machine was booting up ok, and things were being written to syslog. Rebooted into -rc5-mm1 to investigate, and of course could boot into rc6-mm1 in single user mode, test and bring services up one by one from there. Having two boxes helped too.

Is it possible to get an strace of this failure somehow?

Not sure if this is needed anymore, as I found that the problem goes away when I compile in kernel auditing. This not required for -rc5-mm1. Is that change intended?

reuben
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to