from:"M.H.VanLeeuwen"

2.6.13 kernel OOPS

2005-08-29 Thread M.H.VanLeeuwen

Hi,

Is this a known problem?

Thanks,
Martin


cat /proc/sys/net/ipv4/conf/lo/rp_filter

 <1>Unable to handle kernel paging request at virtual address 419a91d8
 printing eip:
c0116644
*pde = 
Oops:  [#6]
Modules linked in:
CPU:0
EIP:0060:[]Not tainted VLI
EFLAGS: 00010246   (2.6.13)
EIP is at do_proc_dointvec_conv+0x14/0x40
eax: c1b40f28   ebx:    ecx: 419a91d8   edx: c1b40f24
esi: 1000   edi: 0001   ebp: 0804d008   esp: c1b40eec
ds: 007b   es: 007b   ss: 0068
Process cat (pid: 503, threadinfo=c1b4 task=c22245d0)
Stack: c0116731    419a91d8 0001  c1b40fbc
   c22245d0 c1b656a4   00030002 c1b40f0b c2242b84 c1b656a4
   c1e12420 0804d008  c12910e0 c01169a5 0804d008 c1b40f64 c1b40fa4
Call Trace:
 [] do_proc_dointvec+0xc1/0x320
 [] proc_dointvec+0x15/0x20
 [] do_proc_dointvec_conv+0x0/0x40
 [] do_rw_proc+0x6e/0x80
 [] proc_readsys+0x0/0x20
 [] proc_readsys+0x10/0x20
 [] vfs_read+0x7e/0x140
 [] sys_read+0x3c/0x70
 [] syscall_call+0x7/0xb
Code: 00 00 83 c4 0c 89 c8 5b 5e 5f 5d c3 8d 74 26 00 8d bc 27 00 00 00 00 83 
7c 24 04 00 74 0d 8b 00 85 c0 75 18 8b 02 89 01 31 c0 c3 <8b> 09 85 c9 78 16 c7 
00 00 00 00 00 31 c0 89 0a c3 8b 02 f7 d8


bash-2.05$ /bld/linux-2.6.13/scripts/ver_linux
If some fields are empty or look unusual you may have an old version.
Compare to the current minimal requirements in Documentation/Changes.

Linux shadow 2.6.12 #1 SMP Sat Jun 18 09:36:33 CDT 2005 i686 unknown unknown 
GNU/Linux

Gnu C  4.0.0
Gnu make   3.80
binutils   2.15
util-linux 2.12q
mount  2.12q
module-init-tools  3.0-pre2
e2fsprogs  1.35
reiserfsprogs  reiserfsck:
reiser4progs   fsck.reiser4:
pcmcia-cs  3.2.8
nfs-utils  0.1.5
Linux C Library2.3.4
Dynamic linker (ldd)   2.3.4
Linux C++ Library  6.0.4
Procps 3.2.5
Net-tools  1.60
Kbd1.12
Sh-utils   5.2.1
udev   048
Modules Loaded
bash-2.05$
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Re: Does kernel require IDE enabled in BIOS to access HD, FS errors?

2001-07-07 Thread M.H.VanLeeuwen

Jim,

Thanks for the info, comments interleaved below

Thanks
Martin

Jim Roland wrote:
> 
> Activating an IDE drive in an older BIOS (newer ones have a SCSI option in
> the "A/C/CDROM" options) will always force an IDE drive boot with older
> BIOSes.  Older BIOSes are written to stop looking for a boot device once it
> has found one, and it's own IDE is where it says "Ok, I have boot
> capability", otherwise no IDE drive, means it passes boot control to other
> system BIOSes (like your SCSI or NIC cards).  This is by default with older
> systems.

This behavior is acceptable, as I can as you suggest move the kernel to the
IDE and tell it the root device is /dev/md0.  Actually that is what I did
to a floppy to get access to the system while the BIOS was set to 
detect the IDE drives.   Since I only boot whenever Linus puts out a new
kernel that isn't so much of a concern.

> 
> I expect someone will rebut my comments about the kernel (which is fine, I'm
> not a Kernel hacker--PROPERLY USED TERM HERE (not the media's term) ),
> but it is my understanding that the kernel uses your system BIOS for actual
> reads/writes at the hardware level, this way it does not have to account for
> EVERY possible BIOS out there.  (Other OSes use BIOS system calls for this
> purpose as well)  When you turn BIOS to  the OS does what it can, but
> the BIOS in your system *SHOULD* refuse to process the call, instead it's
> doing the read/writes, but not the same way as if IDE was turned on.

But, that's kind of the point I'm driving at if the OS can't correctly access the
IDE interface it shouldn't do it at all.

My guess is that the CMD64X driver is broken since the system can write data
to the HD but with random corruption. Ditto for reading data.  Likewise,
why would I see an increase in write performance when "dd if=/dev/zero of=/dev/null"
is running simultaneously on another console.

> Are you getting IDE corruption with the BIOS set to  for your IDE
> drive?

None whatsoever.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Re: Does kernel require IDE enabled in BIOS to access HD, FS errors?

2001-07-07 Thread M.H.VanLeeuwen


Jim,

Thanks for the info, comments interleaved below

Thanks
Martin

Jim Roland wrote:
 
 Activating an IDE drive in an older BIOS (newer ones have a SCSI option in
 the A/C/CDROM options) will always force an IDE drive boot with older
 BIOSes.  Older BIOSes are written to stop looking for a boot device once it
 has found one, and it's own IDE is where it says Ok, I have boot
 capability, otherwise no IDE drive, means it passes boot control to other
 system BIOSes (like your SCSI or NIC cards).  This is by default with older
 systems.

This behavior is acceptable, as I can as you suggest move the kernel to the
IDE and tell it the root device is /dev/md0.  Actually that is what I did
to a floppy to get access to the system while the BIOS was set to auto
detect the IDE drives.   Since I only boot whenever Linus puts out a new
kernel that isn't so much of a concern.

 
 I expect someone will rebut my comments about the kernel (which is fine, I'm
 not a Kernel hacker--PROPERLY USED TERM HERE (not the media's term) grin),
 but it is my understanding that the kernel uses your system BIOS for actual
 reads/writes at the hardware level, this way it does not have to account for
 EVERY possible BIOS out there.  (Other OSes use BIOS system calls for this
 purpose as well)  When you turn BIOS to NONE the OS does what it can, but
 the BIOS in your system *SHOULD* refuse to process the call, instead it's
 doing the read/writes, but not the same way as if IDE was turned on.

But, that's kind of the point I'm driving at if the OS can't correctly access the
IDE interface it shouldn't do it at all.

My guess is that the CMD64X driver is broken since the system can write data
to the HD but with random corruption. Ditto for reading data.  Likewise,
why would I see an increase in write performance when dd if=/dev/zero of=/dev/null
is running simultaneously on another console.

 Are you getting IDE corruption with the BIOS set to AUTO for your IDE
 drive?

None whatsoever.
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Does kernel require IDE enabled in BIOS to access HD, FS errors?

2001-07-06 Thread M.H.VanLeeuwen


Hi,

I have a SMP P166 system that has been running for years with an AIC7xxx SCSI card as
opposed to the native IDE interface.  The BIOS has the IDE 0,1,2,3 set to .
Running out of disk space I installed one of the original IDE drives. The kernel
booted and ID'd the drive correctly.  Kernel version 2.4.5/6 behave the same.

Uniform Multi-Platform E-IDE driver Revision: 6.31
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
CMD646: IDE controller on PCI bus 00 dev 10
CMD646: chipset revision 1
CMD646: not 100% native mode: will probe irqs later
CMD646: chipset revision 0x01, MultiWord DMA Limited, IRQ workaround enabled
CMD646: simplex device:  DMA disabled
ide0: CMD646 Bus-Master DMA disabled (BIOS)
CMD646: simplex device:  DMA disabled
ide1: CMD646 Bus-Master DMA disabled (BIOS)
hdb: CD-ROM CDU76E, ATAPI CD/DVD-ROM drive
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
hdc: WDC AC2850F, ATA DISK drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
ide1 at 0x170-0x177,0x376 on irq 15
hdc: 1667232 sectors (854 MB) w/64KiB Cache, CHS=1654/16/63
hdb: packet command error: status=0x51 { DriveReady SeekComplete Error }
hdb: packet command error: error=0x44
hdb: ATAPI 4X CD-ROM drive, 256kB Cache
Uniform CD-ROM driver Revision: 3.12
Partition check:
 hdc: [PTBL] [827/32/63] hdc1

However I can't boot from the SCSI drives if the IDE HD is enabled due to deficiencies
in the BIOS... boot "A: then C:" or "C: then A:" are the only choices, if neither are
present the system boots from the SCSI card, otherwise it fails to boot.

PROBLEM: cannot reliably R/W to the HD unless the BIOS is set to  recognize.
I consistently see MD5SUM errors and FS corruption and other strange FS symptoms 
when the BIOS is set to  for the drive and _never_ see any errors with the
setting set to .  There are no messages emitted by the kernel that there
were any system errors encountered leading one to believe that all is well, when
it isn't.

What is interesting, is that the I/O writes increase from once every 14 seconds to
once every 2-3 seconds and the FS corruption diminishes but don't disappear
if a background "dd if=/dev/zero of=/dev/null" is running.

Is this expected kernel behavior?

VMSTAT follow... when copying files from SCSI drives to IDE drive.

More info available if needed...

Thanks,
Martin

The waiting processes are kupdated and bdflush. (I have Alt-SysRq- trace of them)

VMSTAT 1 for the case w/ BIOS set to  looks like (w/o dd running):

   procs  memoryswap  io system cpu
 r  b  w   swpd   free   buff  cache  si  sobibo   incs  us  sy  id
 0  0  0  0  83180   1056  39800   0   0   261 3   8343   5   8  87
 0  0  0  0  83176   1056  39800   0   0 0 0  11920   3   2  95
 0  0  0  0  83176   1056  39800   0   0 0 0  11620   2   2  96
 0  1  0  0  83012   1096  39812   0   0   329 0  196   183   3   8  89
 0  1  0  0  81268   1128  41444   0   0  1021 0  309   275   3  17  80
 0  1  0  0  74464   1200  47716   0   0  313127  292   264   6  25  69
 2  0  0  0  67772   1276  53632   0   0  2962 0  397   245  12  25  63
 2  0  0  0  64016   1324  56900   0   0  1602 0  414   155  48  27  26
 1  1  0  0  53924   1372  66608   0   0  4960 0  200   165  14  40  46
 1  0  0  0  42260   1448  77556   0   0  5493 0  232   210   3  35  62
 0  1  0  0  30276   1480  88748   0   0  5616 0  201   124   4  34  62
 2  0  0  0  22580   1496  96044   0   0  3671  2868  30796   2  33  65
 0  1  0  0  12392   1528 105492   0   0  4771  4852  276   164   5  37  58
 1  0  0  0   3056   1560 114232   0   0  4641  4861  328   200   2  43  54
 1  1  0  0   3056   1588 114192   0   0  5011  4744  281   139   5  39  57
 1  0  1  0   3056   1612 114168   0   0  5269  1728  256   115   4  35  60
 0  1  1  0   3056   1680 114084   0   0  4827 0  271   193   2  33  64
 1  0  1  0   3056   1708 114056   0   0  5268 0  236   106   3  38  59
 2  0  1  0   3056   1748 113864   0   0  3817  3968  315   132   7  44  49
 2  0  1  0   3056   1760 113604   0   0  2955 0  34863  41  50   9
 1  0  1  0   3056   1788 113940   0   0  4258 0  24797  41  46  13
 1  0  1  0   3056   1844 113880   0   0  4246 0  281   168   4  36  60
 0  1  1  0   3064   1856 113868   0   0  2955 0  20969   3  19  78
 0  1  2  0   3064   1856 113868   0   0 0 0  14927   2   3  95
 0  1  2  0   3064   1856 113868   0   0 0 0  14719   2   3  95
 0  1  2  0   3064   1856 113868   0   0 0 0  15218   2   3  95
 0  1  2  0   3064   1856 113868   0   0 0 0  14716   2   4  94
 0  1  2  0   3064   1856 113868   0   0 0 0  14716   1   4  94
 0  1  1  0   3060   1856 113868   0   0 0  3613  15628   1   5  94
 0  1  1

Does kernel require IDE enabled in BIOS to access HD, FS errors?

2001-07-06 Thread M.H.VanLeeuwen


Hi,

I have a SMP P166 system that has been running for years with an AIC7xxx SCSI card as
opposed to the native IDE interface.  The BIOS has the IDE 0,1,2,3 set to NONE.
Running out of disk space I installed one of the original IDE drives. The kernel
booted and ID'd the drive correctly.  Kernel version 2.4.5/6 behave the same.

Uniform Multi-Platform E-IDE driver Revision: 6.31
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
CMD646: IDE controller on PCI bus 00 dev 10
CMD646: chipset revision 1
CMD646: not 100% native mode: will probe irqs later
CMD646: chipset revision 0x01, MultiWord DMA Limited, IRQ workaround enabled
CMD646: simplex device:  DMA disabled
ide0: CMD646 Bus-Master DMA disabled (BIOS)
CMD646: simplex device:  DMA disabled
ide1: CMD646 Bus-Master DMA disabled (BIOS)
hdb: CD-ROM CDU76E, ATAPI CD/DVD-ROM drive
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
hdc: WDC AC2850F, ATA DISK drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
ide1 at 0x170-0x177,0x376 on irq 15
hdc: 1667232 sectors (854 MB) w/64KiB Cache, CHS=1654/16/63
hdb: packet command error: status=0x51 { DriveReady SeekComplete Error }
hdb: packet command error: error=0x44
hdb: ATAPI 4X CD-ROM drive, 256kB Cache
Uniform CD-ROM driver Revision: 3.12
Partition check:
 hdc: [PTBL] [827/32/63] hdc1

However I can't boot from the SCSI drives if the IDE HD is enabled due to deficiencies
in the BIOS... boot A: then C: or C: then A: are the only choices, if neither are
present the system boots from the SCSI card, otherwise it fails to boot.

PROBLEM: cannot reliably R/W to the HD unless the BIOS is set to auto recognize.
I consistently see MD5SUM errors and FS corruption and other strange FS symptoms 
when the BIOS is set to NONE for the drive and _never_ see any errors with the
setting set to AUTO.  There are no messages emitted by the kernel that there
were any system errors encountered leading one to believe that all is well, when
it isn't.

What is interesting, is that the I/O writes increase from once every 14 seconds to
once every 2-3 seconds and the FS corruption diminishes but don't disappear
if a background dd if=/dev/zero of=/dev/null is running.

Is this expected kernel behavior?

VMSTAT follow... when copying files from SCSI drives to IDE drive.

More info available if needed...

Thanks,
Martin

The waiting processes are kupdated and bdflush. (I have Alt-SysRq- trace of them)

VMSTAT 1 for the case w/ BIOS set to NONE looks like (w/o dd running):

   procs  memoryswap  io system cpu
 r  b  w   swpd   free   buff  cache  si  sobibo   incs  us  sy  id
 0  0  0  0  83180   1056  39800   0   0   261 3   8343   5   8  87
 0  0  0  0  83176   1056  39800   0   0 0 0  11920   3   2  95
 0  0  0  0  83176   1056  39800   0   0 0 0  11620   2   2  96
 0  1  0  0  83012   1096  39812   0   0   329 0  196   183   3   8  89
 0  1  0  0  81268   1128  41444   0   0  1021 0  309   275   3  17  80
 0  1  0  0  74464   1200  47716   0   0  313127  292   264   6  25  69
 2  0  0  0  67772   1276  53632   0   0  2962 0  397   245  12  25  63
 2  0  0  0  64016   1324  56900   0   0  1602 0  414   155  48  27  26
 1  1  0  0  53924   1372  66608   0   0  4960 0  200   165  14  40  46
 1  0  0  0  42260   1448  77556   0   0  5493 0  232   210   3  35  62
 0  1  0  0  30276   1480  88748   0   0  5616 0  201   124   4  34  62
 2  0  0  0  22580   1496  96044   0   0  3671  2868  30796   2  33  65
 0  1  0  0  12392   1528 105492   0   0  4771  4852  276   164   5  37  58
 1  0  0  0   3056   1560 114232   0   0  4641  4861  328   200   2  43  54
 1  1  0  0   3056   1588 114192   0   0  5011  4744  281   139   5  39  57
 1  0  1  0   3056   1612 114168   0   0  5269  1728  256   115   4  35  60
 0  1  1  0   3056   1680 114084   0   0  4827 0  271   193   2  33  64
 1  0  1  0   3056   1708 114056   0   0  5268 0  236   106   3  38  59
 2  0  1  0   3056   1748 113864   0   0  3817  3968  315   132   7  44  49
 2  0  1  0   3056   1760 113604   0   0  2955 0  34863  41  50   9
 1  0  1  0   3056   1788 113940   0   0  4258 0  24797  41  46  13
 1  0  1  0   3056   1844 113880   0   0  4246 0  281   168   4  36  60
 0  1  1  0   3064   1856 113868   0   0  2955 0  20969   3  19  78
 0  1  2  0   3064   1856 113868   0   0 0 0  14927   2   3  95
 0  1  2  0   3064   1856 113868   0   0 0 0  14719   2   3  95
 0  1  2  0   3064   1856 113868   0   0 0 0  15218   2   3  95
 0  1  2  0   3064   1856 113868   0   0 0 0  14716   2   4  94
 0  1  2  0   3064   1856 113868   0   0 0 0  14716   1   4  94
 0  1  1  0   3060   1856 113868   0   0 0  3613  15628   1   5

Re: Delay in authentication.

2001-01-08 Thread M.H.VanLeeuwen


Chris,

I reported the same thing on 11/19/00, whether this is a feature or bug for
2.4.X was not determined.  Was this behavior intentionally changed and why?

Looks like 2.2.X gives ECONNREFUSED, but 2.4.X doesn't and times out.

http://www.mail-archive.com/linux-kernel@vger.kernel.org/msg13983.html text below.

Martin




   I had occasion to "telinit 1" today and found that it took a long time
   to login after root passwd was entered.  this doesn't happen with 2.2.X
   kernels.

   Is this to be expected with the 2.4 series kernels? or a bug?

   Martin

   strace for 2.4.0-test11-pre7

   ---snip---
   gettimeofday({974665658, 952483}, NULL) = 0
   socket(PF_INET, SOCK_DGRAM, IPPROTO_UDP) = 3
   getpid()= 305
   bind(3, {sin_family=AF_INET, sin_port=htons(905), 
sin_addr=inet_addr("0.0.0.0")}}, 16) 
   = 0
   ioctl(3, FIONBIO, [1])  = 0
   sendto(3, "\31\23\233@\0\0\0\0\0\0\0\2\0\1\206\240\0\0\0\2\0\0\0\3"..., 56, 0, 
   {sin_family=AF_INET, sin_port=htons(111),
   sin_addr=inet_addr("127.0.0.1")}}, 16) = 56
   poll([{fd=3, events=POLLIN}], 1, 5000)  = 0
   ioctl(3, SIOCGIFCONF, 0xbfffb33c)   = 0
   ioctl(3, SIOCGIFFLAGS, 0xbfffb344)  = 0
   sendto(3, "\31\23\233@\0\0\0\0\0\0\0\2\0\1\206\240\0\0\0\2\0\0\0\3"..., 56, 0, 
   {sin_family=AF_INET, sin_port=htons(111),
   sin_addr=inet_addr("127.0.0.1")}}, 16) = 56 
   ---snip---

   strace for 2.2.17

   ---snip---
   gettimeofday({974664928, 735539}, NULL) = 0
   socket(PF_INET, SOCK_DGRAM, IPPROTO_UDP) = 3
   getpid()= 368
   bind(3, {sin_family=AF_INET, sin_port=htons(968), 
sin_addr=inet_addr("0.0.0.0")}}, 16) 
   = 0
   ioctl(3, FIONBIO, [1])  = 0
   sendto(3, "_c\353\331\0\0\0\0\0\0\0\2\0\1\206\240\0\0\0\2\0\0\0\3"..., 56, 0, 
   {sin_family=AF_INET, sin_port=htons(111),
   sin_addr=inet_addr("127.0.0.1")}}, 16) = 56
   poll([{fd=3, events=POLLIN, revents=POLLERR}], 1, 5000) = 1
   recvfrom(3, 0x8056380, 400, 0, 0xbfffd66c, 0xbfffd618) = -1 ECONNREFUSED 
(Connection 
   refused)
   close(3)= 0  
   ---snip---

-

Chris Meadors wrote:
> 
> On Mon, 8 Jan 2001, David S. Miller wrote:
> 
> > This definitely seems like the classic "/etc/nsswitch.conf is told to
> > look for YP servers and you are not using YP", so have a look and fix
> > nsswitch.conf if this is in fact the problem.
> 
> What I have never gotten, is why on my machines (no specific distro, just
> everything built from source and installed by me) login takes a long time,
> unless I have portmap running.
> 
> My /etc/nsswitch.conf would seem to be right:
> 
> passwd: files
> group:  files
> shadow: files
> 
> hosts:  files dns
> networks:   files dns
> 
> protocols:  files
> services:   files
> ethers: files
> rpc:files
> 
> netgroup:   files
> 
> What else could effect that?
> 
> -Chris
> --
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Re: Delay in authentication.

2001-01-08 Thread M.H.VanLeeuwen


Chris,

I reported the same thing on 11/19/00, whether this is a feature or bug for
2.4.X was not determined.  Was this behavior intentionally changed and why?

Looks like 2.2.X gives ECONNREFUSED, but 2.4.X doesn't and times out.

http://www.mail-archive.com/linux-kernel@vger.kernel.org/msg13983.html text below.

Martin




   I had occasion to "telinit 1" today and found that it took a long time
   to login after root passwd was entered.  this doesn't happen with 2.2.X
   kernels.

   Is this to be expected with the 2.4 series kernels? or a bug?

   Martin

   strace for 2.4.0-test11-pre7

   ---snip---
   gettimeofday({974665658, 952483}, NULL) = 0
   socket(PF_INET, SOCK_DGRAM, IPPROTO_UDP) = 3
   getpid()= 305
   bind(3, {sin_family=AF_INET, sin_port=htons(905), 
sin_addr=inet_addr("0.0.0.0")}}, 16) 
   = 0
   ioctl(3, FIONBIO, [1])  = 0
   sendto(3, "\31\23\233@\0\0\0\0\0\0\0\2\0\1\206\240\0\0\0\2\0\0\0\3"..., 56, 0, 
   {sin_family=AF_INET, sin_port=htons(111),
   sin_addr=inet_addr("127.0.0.1")}}, 16) = 56
   poll([{fd=3, events=POLLIN}], 1, 5000)  = 0
   ioctl(3, SIOCGIFCONF, 0xbfffb33c)   = 0
   ioctl(3, SIOCGIFFLAGS, 0xbfffb344)  = 0
   sendto(3, "\31\23\233@\0\0\0\0\0\0\0\2\0\1\206\240\0\0\0\2\0\0\0\3"..., 56, 0, 
   {sin_family=AF_INET, sin_port=htons(111),
   sin_addr=inet_addr("127.0.0.1")}}, 16) = 56 
   ---snip---

   strace for 2.2.17

   ---snip---
   gettimeofday({974664928, 735539}, NULL) = 0
   socket(PF_INET, SOCK_DGRAM, IPPROTO_UDP) = 3
   getpid()= 368
   bind(3, {sin_family=AF_INET, sin_port=htons(968), 
sin_addr=inet_addr("0.0.0.0")}}, 16) 
   = 0
   ioctl(3, FIONBIO, [1])  = 0
   sendto(3, "_c\353\331\0\0\0\0\0\0\0\2\0\1\206\240\0\0\0\2\0\0\0\3"..., 56, 0, 
   {sin_family=AF_INET, sin_port=htons(111),
   sin_addr=inet_addr("127.0.0.1")}}, 16) = 56
   poll([{fd=3, events=POLLIN, revents=POLLERR}], 1, 5000) = 1
   recvfrom(3, 0x8056380, 400, 0, 0xbfffd66c, 0xbfffd618) = -1 ECONNREFUSED 
(Connection 
   refused)
   close(3)= 0  
   ---snip---

-

Chris Meadors wrote:
 
 On Mon, 8 Jan 2001, David S. Miller wrote:
 
  This definitely seems like the classic "/etc/nsswitch.conf is told to
  look for YP servers and you are not using YP", so have a look and fix
  nsswitch.conf if this is in fact the problem.
 
 What I have never gotten, is why on my machines (no specific distro, just
 everything built from source and installed by me) login takes a long time,
 unless I have portmap running.
 
 My /etc/nsswitch.conf would seem to be right:
 
 passwd: files
 group:  files
 shadow: files
 
 hosts:  files dns
 networks:   files dns
 
 protocols:  files
 services:   files
 ethers: files
 rpc:files
 
 netgroup:   files
 
 What else could effect that?
 
 -Chris
 --
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Again, kernel BUG at /usr/src/linux/include/linux/nfs_fs.h:167! 2.4.0-prerelease

2001-01-02 Thread M.H.VanLeeuwen


Hi Neil,

I thought I'd seen this same bug on 2.4.0-test12 after I applied your patch
but didn't follow up until now.  sorry.

Anyway, with 2.4.0-prerelease and an updated loop.c.patch, below the ksymoops
output (your patch updated to 2.4.0-prerelease), I got the following BUG reports.

So it appears as if the "obviously correct patch ;-)" doesn't fix this particular
problem, but doesn't seem to break anything obvious.

Later,
Martin

ksymoops 2.3.5 on i686 2.4.0-prerelease.  Options used
 -V (default)
 -k /proc/ksyms (default)
 -l /proc/modules (default)
 -o /lib/modules/2.4.0-prerelease/ (default)
 -m /boot/System.map-2.4.0-prerelease (specified)

activating NMI Watchdog ... done.
cpu: 0, clocks: 680100, slice: 226700
cpu: 1, clocks: 680100, slice: 226700
kernel BUG at /usr/src/linux/include/linux/nfs_fs.h:167!
invalid operand: 
CPU:0
EIP:0010:[]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010286
eax: 0039   ebx: c9183e80   ecx:    edx: 0002
esi: cb17dcc0   edi: cb1d3d40   ebp: c962e000   esp: c962fd04
ds: 0018   es: 0018   ss: 0018
Process find (pid: 29692, stackpage=c962f000)
Stack: c022fea5 c0230040 00a7 cabe30c0 c89c9860  c12c8294 c015bf1d
   cabe30c0 c89c9860 c12c8294  1000 c89c9860  c12c8294
   cabe30c0 fff4 c015c721 cabe30c0 c89c9860 c12c8294 cabe30c0 c12c8294
Call Trace: [] [] [] [] [] 
[] []
   [] [] [] [] [] [] 
[] []
   [] [] [] [] [] [] 
[]
Code: 0f 0b 83 c4 0c 89 7e 1c eb 26 90 8d 74 26 00 6a 00 8b 54 24

>>EIP; c015cf61<=
Trace; c022fea5 
Trace; c0230040 
Trace; c015bf1d 
Trace; c015c721 
Trace; c0125bf7 
Trace; c018b15c 
Trace; c018b064 
Trace; c018b48b 
Trace; c0181b6f <__make_request+5bb/638>
Trace; c0181d03 
Trace; c0181d6b 
Trace; c0181ef7 
Trace; c0152bea 
Trace; c0151295 
Trace; c0112e28 
Trace; c0123ff4 
Trace; c014133c 
Trace; c014170c 
Trace; c0141867 
Trace; c014170c 
Trace; c0123548 
Trace; c0108fd3 
Code;  c015cf61 
 <_EIP>:
Code;  c015cf61<=
   0:   0f 0b ud2a  <=
Code;  c015cf63 
   2:   83 c4 0c  add$0xc,%esp
Code;  c015cf66 
   5:   89 7e 1c  mov%edi,0x1c(%esi)
Code;  c015cf69 
   8:   eb 26 jmp30 <_EIP+0x30> c015cf91 

Code;  c015cf6b 
   a:   90nop
Code;  c015cf6c 
   b:   8d 74 26 00   lea0x0(%esi,1),%esi
Code;  c015cf70 
   f:   6a 00 push   $0x0
Code;  c015cf72 
  11:   8b 54 24 00   mov0x0(%esp,1),%edx

kernel BUG at /usr/src/linux/include/linux/nfs_fs.h:167!
invalid operand: 
CPU:1
EIP:0010:[]
EFLAGS: 00010286
eax: 0039   ebx: c9183e80   ecx:    edx: 0100
esi: c87c3ce0   edi: cb1d3d40   ebp: c5f38000   esp: c5f39d2c
ds: 0018   es: 0018   ss: 0018
Process find (pid: 29930, stackpage=c5f39000)
Stack: c022fea5 c0230040 00a7 cabe3360 c8ad1040  c115d0d4 c015bf1d
   cabe3360 c8ad1040 c115d0d4  1000 c8ad1040  c115d0d4
   cabe3360 fff4 c015c721 cabe3360 c8ad1040 c115d0d4 cabe3360 c115d0d4
Call Trace: [] [] [] [] [] 
[] []
   [] [] [] [] [] [] 
[] []
   [] [] [] [] [] []
Code: 0f 0b 83 c4 0c 89 7e 1c eb 26 90 8d 74 26 00 6a 00 8b 54 24

>>EIP; c015cf61<=
Trace; c022fea5 
Trace; c0230040 
Trace; c015bf1d 
Trace; c015c721 
Trace; c0125bf7 
Trace; c018b15c 
Trace; c018b064 
Trace; c018b48b 
Trace; c0181b6f <__make_request+5bb/638>
Trace; c0181d03 
Trace; c0181d6b 
Trace; c0181ef7 
Trace; c0151347 
Trace; c0112e28 
Trace; c0123ff4 
Trace; c014133c 
Trace; c014170c 
Trace; c0141867 
Trace; c014170c 
Trace; c0123548 
Trace; c0108fd3 
Code;  c015cf61 
 <_EIP>:
Code;  c015cf61<=
   0:   0f 0b ud2a  <=
Code;  c015cf63 
   2:   83 c4 0c  add$0xc,%esp
Code;  c015cf66 
   5:   89 7e 1c  mov%edi,0x1c(%esi)
Code;  c015cf69 
   8:   eb 26 jmp30 <_EIP+0x30> c015cf91 

Code;  c015cf6b 
   a:   90nop
Code;  c015cf6c 
   b:   8d 74 26 00   lea0x0(%esi,1),%esi
Code;  c015cf70 
   f:   6a 00 push   $0x0
Code;  c015cf72 
  11:   8b 54 24 00   mov0x0(%esp,1),%edx


updated patch for 2.4.0-prerelease
__

--- loop.c.orig Fri Dec 29 16:07:21 2000
+++ loop.c  Tue Jan  2 17:11:38 2001
@@ -441,37 +441,18 @@
if (!aops->prepare_write || !aops->commit_write)
lo->lo_flags |= LO_FLAGS_READ_ONLY;

-   error = get_write_access(inode);
-   if (error)
-   goto out_putf;
 
/* Backed by a regular file - we need to hold onto a file
-  structure for this file.  Friggin' NFS can't live without
-  it on write and for reading we use do_generic_file_read(),
-  so...

Again, kernel BUG at /usr/src/linux/include/linux/nfs_fs.h:167! 2.4.0-prerelease

2001-01-02 Thread M.H.VanLeeuwen


Hi Neil,

I thought I'd seen this same bug on 2.4.0-test12 after I applied your patch
but didn't follow up until now.  sorry.

Anyway, with 2.4.0-prerelease and an updated loop.c.patch, below the ksymoops
output (your patch updated to 2.4.0-prerelease), I got the following BUG reports.

So it appears as if the "obviously correct patch ;-)" doesn't fix this particular
problem, but doesn't seem to break anything obvious.

Later,
Martin

ksymoops 2.3.5 on i686 2.4.0-prerelease.  Options used
 -V (default)
 -k /proc/ksyms (default)
 -l /proc/modules (default)
 -o /lib/modules/2.4.0-prerelease/ (default)
 -m /boot/System.map-2.4.0-prerelease (specified)

activating NMI Watchdog ... done.
cpu: 0, clocks: 680100, slice: 226700
cpu: 1, clocks: 680100, slice: 226700
kernel BUG at /usr/src/linux/include/linux/nfs_fs.h:167!
invalid operand: 
CPU:0
EIP:0010:[c015cf61]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010286
eax: 0039   ebx: c9183e80   ecx:    edx: 0002
esi: cb17dcc0   edi: cb1d3d40   ebp: c962e000   esp: c962fd04
ds: 0018   es: 0018   ss: 0018
Process find (pid: 29692, stackpage=c962f000)
Stack: c022fea5 c0230040 00a7 cabe30c0 c89c9860  c12c8294 c015bf1d
   cabe30c0 c89c9860 c12c8294  1000 c89c9860  c12c8294
   cabe30c0 fff4 c015c721 cabe30c0 c89c9860 c12c8294 cabe30c0 c12c8294
Call Trace: [c022fea5] [c0230040] [c015bf1d] [c015c721] [c0125bf7] 
[c018b15c] [c018b064]
   [c018b48b] [c0181b6f] [c0181d03] [c0181d6b] [c0181ef7] [c0152bea] 
[c0151295] [c0112e28]
   [c0123ff4] [c014133c] [c014170c] [c0141867] [c014170c] [c0123548] 
[c0108fd3]
Code: 0f 0b 83 c4 0c 89 7e 1c eb 26 90 8d 74 26 00 6a 00 8b 54 24

EIP; c015cf61 nfs_create_request+195/1e4   =
Trace; c022fea5 devfsd_buf_size+1b85/10a8c
Trace; c0230040 devfsd_buf_size+1d20/10a8c
Trace; c015bf1d nfs_readpage_async+13d/1c0
Trace; c015c721 nfs_readpage+c5/12c
Trace; c0125bf7 do_generic_file_read+347/53c
Trace; c018b15c lo_receive+58/68
Trace; c018b064 lo_read_actor+0/a0
Trace; c018b48b do_lo_request+31f/414
Trace; c0181b6f __make_request+5bb/638
Trace; c0181d03 generic_make_request+117/124
Trace; c0181d6b submit_bh+5b/7c
Trace; c0181ef7 ll_rw_block+127/188
Trace; c0152bea ext2_bread+ce/110
Trace; c0151295 ext2_readdir+91/37c
Trace; c0112e28 do_page_fault+0/3e8
Trace; c0123ff4 do_munmap+58/250
Trace; c014133c vfs_readdir+8c/e8
Trace; c014170c filldir64+0/10c
Trace; c0141867 sys_getdents64+4f/b8
Trace; c014170c filldir64+0/10c
Trace; c0123548 sys_brk+b4/d8
Trace; c0108fd3 system_call+33/38
Code;  c015cf61 nfs_create_request+195/1e4
 _EIP:
Code;  c015cf61 nfs_create_request+195/1e4   =
   0:   0f 0b ud2a  =
Code;  c015cf63 nfs_create_request+197/1e4
   2:   83 c4 0c  add$0xc,%esp
Code;  c015cf66 nfs_create_request+19a/1e4
   5:   89 7e 1c  mov%edi,0x1c(%esi)
Code;  c015cf69 nfs_create_request+19d/1e4
   8:   eb 26 jmp30 _EIP+0x30 c015cf91 
nfs_create_request+1c5/1e4
Code;  c015cf6b nfs_create_request+19f/1e4
   a:   90nop
Code;  c015cf6c nfs_create_request+1a0/1e4
   b:   8d 74 26 00   lea0x0(%esi,1),%esi
Code;  c015cf70 nfs_create_request+1a4/1e4
   f:   6a 00 push   $0x0
Code;  c015cf72 nfs_create_request+1a6/1e4
  11:   8b 54 24 00   mov0x0(%esp,1),%edx

kernel BUG at /usr/src/linux/include/linux/nfs_fs.h:167!
invalid operand: 
CPU:1
EIP:0010:[c015cf61]
EFLAGS: 00010286
eax: 0039   ebx: c9183e80   ecx:    edx: 0100
esi: c87c3ce0   edi: cb1d3d40   ebp: c5f38000   esp: c5f39d2c
ds: 0018   es: 0018   ss: 0018
Process find (pid: 29930, stackpage=c5f39000)
Stack: c022fea5 c0230040 00a7 cabe3360 c8ad1040  c115d0d4 c015bf1d
   cabe3360 c8ad1040 c115d0d4  1000 c8ad1040  c115d0d4
   cabe3360 fff4 c015c721 cabe3360 c8ad1040 c115d0d4 cabe3360 c115d0d4
Call Trace: [c022fea5] [c0230040] [c015bf1d] [c015c721] [c0125bf7] 
[c018b15c] [c018b064]
   [c018b48b] [c0181b6f] [c0181d03] [c0181d6b] [c0181ef7] [c0151347] 
[c0112e28] [c0123ff4]
   [c014133c] [c014170c] [c0141867] [c014170c] [c0123548] [c0108fd3]
Code: 0f 0b 83 c4 0c 89 7e 1c eb 26 90 8d 74 26 00 6a 00 8b 54 24

EIP; c015cf61 nfs_create_request+195/1e4   =
Trace; c022fea5 devfsd_buf_size+1b85/10a8c
Trace; c0230040 devfsd_buf_size+1d20/10a8c
Trace; c015bf1d nfs_readpage_async+13d/1c0
Trace; c015c721 nfs_readpage+c5/12c
Trace; c0125bf7 do_generic_file_read+347/53c
Trace; c018b15c lo_receive+58/68
Trace; c018b064 lo_read_actor+0/a0
Trace; c018b48b do_lo_request+31f/414
Trace; c0181b6f __make_request+5bb/638
Trace; c0181d03 generic_make_request+117/124
Trace; c0181d6b submit_bh+5b/7c
Trace; c0181ef7 ll_rw_block+127/188
Trace; c0151347 ext2_readdir+143/37c
Trace; c0112e28 do_page_fault+0/3e8
Trace; c0123ff4 do_munmap+58/250
Trace; c014133c

Re: kernel BUG at /usr/src/linux/include/linux/nfs_fs.h:167! -reproducible

2000-12-19 Thread M.H.VanLeeuwen


Alexander Viro wrote:
> 
> On Mon, 18 Dec 2000, Neil Brown wrote:
> 
> > On Monday December 18, [EMAIL PROTECTED] wrote:
> > > > " " == M H VanLeeuwen <[EMAIL PROTECTED]> writes:
> > >
> > >  > Trond, Neil I don't know if this is a loopback bug or an NFS
> > >  > bug but since nfs_fs.h was implicated so I thought one of you
> > >  > may be interested.
> > >
> > >  > Could you let me know if you know this problem has already been
> > >  > fixed or if you need more info.
> > >
> > > Hi,
> > >
> > > As far as I'm concerned, it's a loopback bug.
> >
> > I read it the same way.
> > Actually, I cannot see the point of copying the "struct file"!  Why
> > not just take a reference to it?  The comment tries to justify it, but
> > I don't buy it.
> 
> Wish I remembered who had complained when I proposed to kill that copying...
> It was introduced back in 2.1.110 and back then comment looked so:
> 
> +   /* Backed by a regular file - we need to hold onto
> +  a file structure for this file.  We'll use it to
> +  write to blocks that are not already present in
> +  a sparse file.  We create a new file structure
> +  based on the one passed to us via 'arg'.  This is
> +  to avoid changing the file structure that the
> +  caller is using */
> +
> 
> I would be happy to get rid of that crap - it was the only reason why I
> had to add the sodding file_moveto() and world would be better without it.
> If we can kill it off - let's do it and let's take fs/file_table:file_moveto()
> along.
> 
> IOW, I also think that copying the struct file is wrong. IIRC, complaints were
> bogus - losetup requires enough priviliges to make worrying about security
> implications somewhat pointless.

Ok, I've was able to cause 2 BUG's yesterday in a couple of hours, ( it was easier
to lock the machine solid than to get the BUG;-)), since loading test12 w/patch I
haven't seen a single BUG in the last 24 hours.  Any chance we can get this change
to Linus for test13?

Any specific tests you want me to run?

Martin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Re: kernel BUG at /usr/src/linux/include/linux/nfs_fs.h:167! -reproducible

2000-12-19 Thread M.H.VanLeeuwen


Alexander Viro wrote:
 
 On Mon, 18 Dec 2000, Neil Brown wrote:
 
  On Monday December 18, [EMAIL PROTECTED] wrote:
" " == M H VanLeeuwen [EMAIL PROTECTED] writes:
  
 Trond, Neil I don't know if this is a loopback bug or an NFS
 bug but since nfs_fs.h was implicated so I thought one of you
 may be interested.
  
 Could you let me know if you know this problem has already been
 fixed or if you need more info.
  
   Hi,
  
   As far as I'm concerned, it's a loopback bug.
 
  I read it the same way.
  Actually, I cannot see the point of copying the "struct file"!  Why
  not just take a reference to it?  The comment tries to justify it, but
  I don't buy it.
 
 Wish I remembered who had complained when I proposed to kill that copying...
 It was introduced back in 2.1.110 and back then comment looked so:
 
 +   /* Backed by a regular file - we need to hold onto
 +  a file structure for this file.  We'll use it to
 +  write to blocks that are not already present in
 +  a sparse file.  We create a new file structure
 +  based on the one passed to us via 'arg'.  This is
 +  to avoid changing the file structure that the
 +  caller is using */
 +
 
 I would be happy to get rid of that crap - it was the only reason why I
 had to add the sodding file_moveto() and world would be better without it.
 If we can kill it off - let's do it and let's take fs/file_table:file_moveto()
 along.
 
 IOW, I also think that copying the struct file is wrong. IIRC, complaints were
 bogus - losetup requires enough priviliges to make worrying about security
 implications somewhat pointless.

Ok, I've was able to cause 2 BUG's yesterday in a couple of hours, ( it was easier
to lock the machine solid than to get the BUG;-)), since loading test12 w/patch I
haven't seen a single BUG in the last 24 hours.  Any chance we can get this change
to Linus for test13?

Any specific tests you want me to run?

Martin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

kernel BUG at /usr/src/linux/include/linux/nfs_fs.h:167! - reproducible

2000-12-16 Thread M.H.VanLeeuwen


Trond, Neil
 
I don't know if this is a loopback bug or an NFS bug but since
nfs_fs.h was implicated so I thought one of you may be interested.
 
Could you let me know if you know this problem has already been fixed
or if you need more info.
 
Martin
 
[1.] One line summary of the problem:
kernel BUG at /usr/src/linux/include/linux/nfs_fs.h:167!
 
[2.] Full description of the problem/report:
 
BUG while trying to read from NFS file mounted locally via loopback.
 
NFS client(heli, UP) & server(shadow, SMP) are 2.4.0-test12 NFSv3.
 
shadow mounted a file system image via loopback to a local mount point.
 
shadow:/mnt#mount
heli:/tmp on /mnt type nfs (rw,addr=192.168.0.2)
 
shadow:/mnt/uml# file root_fs_slackware_7.1_small
root_fs_slackware_7.1_small: x86 boot sector, extended partition table
 
shadow:/mnt/uml# mount -o loop root_fs_slackware_7.1_small /mnt1
 
shadow:/mnt/uml# mount
heli:/tmp on /mnt type nfs (rw,addr=192.168.0.2)
/mnt/uml/root_fs_slackware_7.1_small on /mnt1 type ext2 (rw,loop=/dev/loop/0)
 
shadow:/mnt/uml# cd /mnt1
shadow:/mnt1# lf
/usr/local/bin/lf: line 1:   585 Segmentation fault  ls $LS_OPTIONS $1
 
[3.] Keywords (i.e., modules, networking, kernel):
nfs, loopback
[4.] Kernel version (from /proc/version):
Linux version 2.4.0-test12 (mhvl@shadow) (gcc version 2.95.2 19991024 (release)) #1 
SMP Tue Dec 12 20:46:11 CST 2000  
[5.] Output of Oops.. message (if applicable) with symbolic information
 resolved (see Documentation/oops-tracing.txt)
ksymoops 2.3.5 on i686 2.4.0-test12.  Options used
 -V (default)
 -k /proc/ksyms (default)
 -l /proc/modules (default)
 -o /lib/modules/2.4.0-test12/ (default)
 -m /boot/System.map-2.4.0-test12 (specified)
 
No modules in ksyms, skipping objects
Warning (read_lsmod): no symbols in lsmod, is /proc/modules a valid lsmod file?
activating NMI Watchdog ... done.
cpu: 0, clocks: 680220, slice: 226740
cpu: 1, clocks: 680220, slice: 226740
kernel BUG at /usr/src/linux/include/linux/nfs_fs.h:167!
invalid operand: 
CPU:0
EIP:0010:[]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010282
eax: 0039   ebx: c8639ae0   ecx:    edx: 0002
esi: cba46840   edi: c78592c0   ebp: c05f6000   esp: c05f7d08
ds: 0018   es: 0018   ss: 0018
Process ls (pid: 585, stackpage=c05f7000)
Stack: c0231b65 c0231d40 00a7 c081fda0 c0a473c0  c1018fd4 c015b30d
   c081fda0 c0a473c0 c1018fd4  1000 c0a473c0  c1018fd4
   c081fda0 fff4 c015bb11 c081fda0 c0a473c0 c1018fd4 c13a9b7c c1018fd4
Call Trace: [] [] [] [] [] 
[] []
   [] [] [] [] [] [] 
[] []
   [] [] [] [] [] [] 
[]
Code: 0f 0b 83 c4 0c 89 7e 1c eb 26 90 8d 74 26 00 6a 00 8b 54 24
 
>>EIP; c015c351<=
Trace; c0231b65 
Trace; c0231d40 
Trace; c015b30d 
Trace; c015bb11 
Trace; c0125d33 
Trace; c01839bc 
Trace; c01838c4 
Trace; c0183ceb 
Trace; c01829cf <__make_request+5bb/638>
Trace; c0182b4d 
Trace; c0182bb0 
Trace; c0182d37 
Trace; c015207d 
Trace; c0150725 
Trace; c0112e58 
Trace; c0124891 
Trace; c014086c 
Trace; c0140c3c 
Trace; c0140d97 
Trace; c0140c3c 
Trace; c0123578 
Trace; c010b073 
Code;  c015c351 
 <_EIP>:
Code;  c015c351<=
   0:   0f 0b ud2a  <=
Code;  c015c353 
   2:   83 c4 0c  add$0xc,%esp
Code;  c015c356 
   5:   89 7e 1c  mov%edi,0x1c(%esi)
Code;  c015c359 
   8:   eb 26 jmp30 <_EIP+0x30> c015c381 

Code;  c015c35b 
   a:   90nop
Code;  c015c35c 
   b:   8d 74 26 00   lea0x0(%esi,1),%esi
Code;  c015c360 
   f:   6a 00 push   $0x0
Code;  c015c362 
  11:   8b 54 24 00   mov0x0(%esp,1),%edx
 
 
1 warning issued.  Results may not be reliable. 
[6.] A small shell script or example program which triggers the
 problem (if possible)
see description of problem
[7.] Environment
[7.1.] Software (add the output of the ver_linux script here)
-- Versions installed: (if some fields are empty or look
-- unusual then possibly you have very old versions)
Linux shadow 2.4.0-test12 #1 SMP Tue Dec 12 20:46:11 CST 2000 i686 unknown
Kernel modules 2.3.21
Gnu C  2.95.2
Gnu Make   3.79.1
Binutils   2.10.1
Linux C Library> libc.2.2
Dynamic linker ldd (GNU libc) 2.2
Procps 2.0.2
Mount  2.10q
Net-tools  1.57
Kbd1.00
Sh-utils   2.0
Modules Loaded
 
[7.2.] Processor information (from /proc/cpuinfo):
processor   : 0
vendor_id   : GenuineIntel
cpu family  : 6
model   : 6
model name  : Celeron (Mendocino)
stepping: 5
cpu MHz : 476.085
cache size  : 128 KB
fdiv_bug: no
hlt_bug : no
f00f_bug: no
coma_bug: no
fpu : yes
fpu_exception   : yes
cpuid level : 2
wp  : yes
flags   : fpu vme de pse tsc msr

kernel BUG at /usr/src/linux/include/linux/nfs_fs.h:167! - reproducible

2000-12-16 Thread M.H.VanLeeuwen


Trond, Neil
 
I don't know if this is a loopback bug or an NFS bug but since
nfs_fs.h was implicated so I thought one of you may be interested.
 
Could you let me know if you know this problem has already been fixed
or if you need more info.
 
Martin
 
[1.] One line summary of the problem:
kernel BUG at /usr/src/linux/include/linux/nfs_fs.h:167!
 
[2.] Full description of the problem/report:
 
BUG while trying to read from NFS file mounted locally via loopback.
 
NFS client(heli, UP)  server(shadow, SMP) are 2.4.0-test12 NFSv3.
 
shadow mounted a file system image via loopback to a local mount point.
 
shadow:/mnt#mount
heli:/tmp on /mnt type nfs (rw,addr=192.168.0.2)
 
shadow:/mnt/uml# file root_fs_slackware_7.1_small
root_fs_slackware_7.1_small: x86 boot sector, extended partition table
 
shadow:/mnt/uml# mount -o loop root_fs_slackware_7.1_small /mnt1
 
shadow:/mnt/uml# mount
heli:/tmp on /mnt type nfs (rw,addr=192.168.0.2)
/mnt/uml/root_fs_slackware_7.1_small on /mnt1 type ext2 (rw,loop=/dev/loop/0)
 
shadow:/mnt/uml# cd /mnt1
shadow:/mnt1# lf
/usr/local/bin/lf: line 1:   585 Segmentation fault  ls $LS_OPTIONS $1
 
[3.] Keywords (i.e., modules, networking, kernel):
nfs, loopback
[4.] Kernel version (from /proc/version):
Linux version 2.4.0-test12 (mhvl@shadow) (gcc version 2.95.2 19991024 (release)) #1 
SMP Tue Dec 12 20:46:11 CST 2000  
[5.] Output of Oops.. message (if applicable) with symbolic information
 resolved (see Documentation/oops-tracing.txt)
ksymoops 2.3.5 on i686 2.4.0-test12.  Options used
 -V (default)
 -k /proc/ksyms (default)
 -l /proc/modules (default)
 -o /lib/modules/2.4.0-test12/ (default)
 -m /boot/System.map-2.4.0-test12 (specified)
 
No modules in ksyms, skipping objects
Warning (read_lsmod): no symbols in lsmod, is /proc/modules a valid lsmod file?
activating NMI Watchdog ... done.
cpu: 0, clocks: 680220, slice: 226740
cpu: 1, clocks: 680220, slice: 226740
kernel BUG at /usr/src/linux/include/linux/nfs_fs.h:167!
invalid operand: 
CPU:0
EIP:0010:[c015c351]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010282
eax: 0039   ebx: c8639ae0   ecx:    edx: 0002
esi: cba46840   edi: c78592c0   ebp: c05f6000   esp: c05f7d08
ds: 0018   es: 0018   ss: 0018
Process ls (pid: 585, stackpage=c05f7000)
Stack: c0231b65 c0231d40 00a7 c081fda0 c0a473c0  c1018fd4 c015b30d
   c081fda0 c0a473c0 c1018fd4  1000 c0a473c0  c1018fd4
   c081fda0 fff4 c015bb11 c081fda0 c0a473c0 c1018fd4 c13a9b7c c1018fd4
Call Trace: [c0231b65] [c0231d40] [c015b30d] [c015bb11] [c0125d33] 
[c01839bc] [c01838c4]
   [c0183ceb] [c01829cf] [c0182b4d] [c0182bb0] [c0182d37] [c015207d] 
[c0150725] [c0112e58]
   [c0124891] [c014086c] [c0140c3c] [c0140d97] [c0140c3c] [c0123578] 
[c010b073]
Code: 0f 0b 83 c4 0c 89 7e 1c eb 26 90 8d 74 26 00 6a 00 8b 54 24
 
EIP; c015c351 nfs_create_request+195/1e4   =
Trace; c0231b65 devfsd_buf_size+1c85/1196c
Trace; c0231d40 devfsd_buf_size+1e60/1196c
Trace; c015b30d nfs_readpage_async+13d/1c0
Trace; c015bb11 nfs_readpage+c5/12c
Trace; c0125d33 do_generic_file_read+32f/57c
Trace; c01839bc lo_receive+58/68
Trace; c01838c4 lo_read_actor+0/a0
Trace; c0183ceb do_lo_request+31f/414
Trace; c01829cf __make_request+5bb/638
Trace; c0182b4d generic_make_request+101/110
Trace; c0182bb0 submit_bh+54/5c
Trace; c0182d37 ll_rw_block+137/1a4
Trace; c015207d ext2_bread+d1/114
Trace; c0150725 ext2_readdir+91/37c
Trace; c0112e58 do_page_fault+0/3e8
Trace; c0124891 merge_segments+15d/18c
Trace; c014086c vfs_readdir+8c/e8
Trace; c0140c3c filldir64+0/10c
Trace; c0140d97 sys_getdents64+4f/b8
Trace; c0140c3c filldir64+0/10c
Trace; c0123578 sys_brk+b4/d8
Trace; c010b073 system_call+33/38
Code;  c015c351 nfs_create_request+195/1e4
 _EIP:
Code;  c015c351 nfs_create_request+195/1e4   =
   0:   0f 0b ud2a  =
Code;  c015c353 nfs_create_request+197/1e4
   2:   83 c4 0c  add$0xc,%esp
Code;  c015c356 nfs_create_request+19a/1e4
   5:   89 7e 1c  mov%edi,0x1c(%esi)
Code;  c015c359 nfs_create_request+19d/1e4
   8:   eb 26 jmp30 _EIP+0x30 c015c381 
nfs_create_request+1c5/1e4
Code;  c015c35b nfs_create_request+19f/1e4
   a:   90nop
Code;  c015c35c nfs_create_request+1a0/1e4
   b:   8d 74 26 00   lea0x0(%esi,1),%esi
Code;  c015c360 nfs_create_request+1a4/1e4
   f:   6a 00 push   $0x0
Code;  c015c362 nfs_create_request+1a6/1e4
  11:   8b 54 24 00   mov0x0(%esp,1),%edx
 
 
1 warning issued.  Results may not be reliable. 
[6.] A small shell script or example program which triggers the
 problem (if possible)
see description of problem
[7.] Environment
[7.1.] Software (add the output of the ver_linux script here)
-- Versions installed: (if some fields are empty or look
-- unusual then possibly you have very old versions)
Linux shadow

Cache problems on test12-pre?

2000-12-08 Thread M.H.VanLeeuwen


Hi,

I've notices weird compile time failures etc on test12-pre7, especially
running more than 2 simultaneous processes...

but most noticeable is the time it takes to run ldconfig, after the
first time test11 takes less than 1 second, test12-pre7 takes ~40
seconds.

both were run immediately after reboot on a completely idle system
this system is almost exclusively NFS mounted file systems.

Is this a known problem?

Martin

here is test11  vs   test12-pre7

shadow:~# time ldconfig  shadow:~# time ldconfig
 
real0m35.881sreal0m43.979s
user0m0.120s user0m0.090s
sys 0m0.890s sys 0m0.980s
shadow:~# time ldconfig  shadow:~# time ldconfig
 
real0m0.870s     real0m38.702s
user0m0.040s user0m0.120s
sys 0m0.230s sys 0m0.980s
shadow:~# time ldconfig  shadow:~# time ldconfig
 
real0m0.345s     real0m40.181s
user0m0.040s user0m0.130s
sys 0m0.070s sys 0m0.900s
shadow:~# time ldconfig  shadow:~# time ldconfig
 
real0m0.098s  -  real0m39.108s
user0m0.050s user0m0.110s
sys 0m0.050s sys 0m1.180s
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

another buffer.c:827 BUG, RAID1 reconstruction.

2000-12-08 Thread M.H.VanLeeuwen


Hi,

I got this BUG report after test12-pre7 soft locked on my NFS server,
all nfsd's in D state and I had to reboot and system was rebuilding the
ide RAID1 arrays.

NFS client test12-pre7 was rebooted as well, root logged in, and ran ldconfig

NFS server BUG'd out

Hand copied OOPS hope too much isn't wrong.

Martin

heli:~$ ksymoops -K -L -O -m /boot/System.map-2.4.0-test12 < buffer.bug
ksymoops 2.3.5 on i586 2.2.17-RAID.  Options used
 -V (default)
 -K (specified)
 -L (specified)
 -O (specified)
 -m /boot/System.map-2.4.0-test12 (specified)

Kernel BUG at buffer.c 827
invalid operand: 
CPU: 0
EIP: 0010:[]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010082
eax: 001c ebx: c10837d0 ecx:  edx: 
esi: c1be2e40 edi: 0002 epb: c1be2e88 esp: c0275ea8
ds:  0018 es: 0018 ss: 0018
Stack: c02151a5 c021545a 033b 0001e2ba 0046 c13c7be0 cabe2e40 c011c226c
   c1b32e40 0001 c13c7be0 c13c7bf8 0001 0001 c01c2392 c13c7be0
   0001 c13c7b88 c11d3780 0002 c0174c39 c13c7bf8 0001 c11d3780
Call Trace: [] [] [] [] [] 
[] []
[] [] [] [] [] 
[] [] []
[] [] [] [] []
code: 0f 06 83 c4 0c 8d 73 28 8d 43 2c 39 43 2c 74 15 b9 01 00 00

>>EIP; c012ca53<=
Trace; c0215195 
Trace; c021545a 
Trace; c01c226e 
Trace; c01c2399 
Trace; c0174c39 
Trace; c019db3b 
Trace; c01a60b4 
Trace; c019f367 
Trace; c01a6050 
Trace; c010a04f 
Trace; c010a1ae 
Trace; c01071f0 
Trace; e000 
Trace; c0108f20 
Trace; c0107180 
Trace; e000 
Trace; c0107213 
Trace; c0107277 
Trace; c0105000 
Trace; c0100191 
Code;  c012ca53 
 <_EIP>:
Code;  c012ca53<=
   0:   0f 06 clts  <=
Code;  c012ca55 
   2:   83 c4 0c  add$0xc,%esp
Code;  c012ca58 
   5:   8d 73 28  lea0x28(%ebx),%esi
Code;  c012ca5b 
   8:   8d 43 2c  lea0x2c(%ebx),%eax
Code;  c012ca5e 
   b:   39 43 2c  cmp%eax,0x2c(%ebx)
Code;  c012ca61 
   e:   74 15 je 25 <_EIP+0x25> c012ca78 

Code;  c012ca63 
  10:   b9 01 00 00 00mov$0x1,%ecx
 
Aiee, killing interrupt handler
heli:~$
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

another buffer.c:827 BUG, RAID1 reconstruction.

2000-12-08 Thread M.H.VanLeeuwen


Hi,

I got this BUG report after test12-pre7 soft locked on my NFS server,
all nfsd's in D state and I had to reboot and system was rebuilding the
ide RAID1 arrays.

NFS client test12-pre7 was rebooted as well, root logged in, and ran ldconfig

NFS server BUG'd out

Hand copied OOPS hope too much isn't wrong.

Martin

heli:~$ ksymoops -K -L -O -m /boot/System.map-2.4.0-test12  buffer.bug
ksymoops 2.3.5 on i586 2.2.17-RAID.  Options used
 -V (default)
 -K (specified)
 -L (specified)
 -O (specified)
 -m /boot/System.map-2.4.0-test12 (specified)

Kernel BUG at buffer.c 827
invalid operand: 
CPU: 0
EIP: 0010:[c012ca53]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010082
eax: 001c ebx: c10837d0 ecx:  edx: 
esi: c1be2e40 edi: 0002 epb: c1be2e88 esp: c0275ea8
ds:  0018 es: 0018 ss: 0018
Stack: c02151a5 c021545a 033b 0001e2ba 0046 c13c7be0 cabe2e40 c011c226c
   c1b32e40 0001 c13c7be0 c13c7bf8 0001 0001 c01c2392 c13c7be0
   0001 c13c7b88 c11d3780 0002 c0174c39 c13c7bf8 0001 c11d3780
Call Trace: [c0215195] [c021545a] [c01c226e] [c01c2399] [c0174c39] 
[c019db3b] [c01a60b4]
[c019f367] [c01a6050] [c010a04f] [c010a1ae] [c01071f0] 
[e000] [c0108f20] [c0107180]
[e000] [c0107213] [c0107277] [c0105000] [c0100191]
code: 0f 06 83 c4 0c 8d 73 28 8d 43 2c 39 43 2c 74 15 b9 01 00 00

EIP; c012ca53 end_buffer_io_async+c7/f4   =
Trace; c0215195 tvecs+318d/19d24
Trace; c021545a tvecs+3452/19d24
Trace; c01c226e raid1_end_bh_io+7e/110
Trace; c01c2399 raid1_end_request+99/a0
Trace; c0174c39 end_that_request_first+65/c4
Trace; c019db3b ide_end_request+27/74
Trace; c01a60b4 ide_dma_intr+64/9c
Trace; c019f367 ide_intr+fb/150
Trace; c01a6050 ide_dma_intr+0/9c
Trace; c010a04f handle_IRQ_event+2f/58
Trace; c010a1ae do_IRQ+6e/b0
Trace; c01071f0 default_idle+0/28
Trace; e000 END_OF_CODE+3fd2af28/
Trace; c0108f20 ret_from_intr+0/20
Trace; c0107180 init+a4/104
Trace; e000 END_OF_CODE+3fd2af28/
Trace; c0107213 default_idle+23/28
Trace; c0107277 cpu_idle+3f/54
Trace; c0105000 empty_bad_page+0/1000
Trace; c0100191 L6+0/2
Code;  c012ca53 end_buffer_io_async+c7/f4
 _EIP:
Code;  c012ca53 end_buffer_io_async+c7/f4   =
   0:   0f 06 clts  =
Code;  c012ca55 end_buffer_io_async+c9/f4
   2:   83 c4 0c  add$0xc,%esp
Code;  c012ca58 end_buffer_io_async+cc/f4
   5:   8d 73 28  lea0x28(%ebx),%esi
Code;  c012ca5b end_buffer_io_async+cf/f4
   8:   8d 43 2c  lea0x2c(%ebx),%eax
Code;  c012ca5e end_buffer_io_async+d2/f4
   b:   39 43 2c  cmp%eax,0x2c(%ebx)
Code;  c012ca61 end_buffer_io_async+d5/f4
   e:   74 15 je 25 _EIP+0x25 c012ca78 
end_buffer_io_async+ec/f4
Code;  c012ca63 end_buffer_io_async+d7/f4
  10:   b9 01 00 00 00mov$0x1,%ecx
 
Aiee, killing interrupt handler
heli:~$
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Cache problems on test12-pre?

2000-12-08 Thread M.H.VanLeeuwen


Hi,

I've notices weird compile time failures etc on test12-pre7, especially
running more than 2 simultaneous processes...

but most noticeable is the time it takes to run ldconfig, after the
first time test11 takes less than 1 second, test12-pre7 takes ~40
seconds.

both were run immediately after reboot on a completely idle system
this system is almost exclusively NFS mounted file systems.

Is this a known problem?

Martin

here is test11  vs   test12-pre7

shadow:~# time ldconfig  shadow:~# time ldconfig
 
real0m35.881sreal0m43.979s
user0m0.120s user0m0.090s
sys 0m0.890s sys 0m0.980s
shadow:~# time ldconfig  shadow:~# time ldconfig
 
real0m0.870s     real0m38.702s
user0m0.040s user0m0.120s
sys 0m0.230s sys 0m0.980s
shadow:~# time ldconfig  shadow:~# time ldconfig
 
real0m0.345s     real0m40.181s
user0m0.040s user0m0.130s
sys 0m0.070s sys 0m0.900s
shadow:~# time ldconfig  shadow:~# time ldconfig
 
real0m0.098s  -  real0m39.108s
user0m0.050s user0m0.110s
sys 0m0.050s sys 0m1.180s
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Re: run level 1, login takes too long, 2.4.X vs. 2.2.X

2000-11-19 Thread M.H.VanLeeuwen

Hi David,

Yup, I know rpc.portmap isn't running, the point is that it wasn't running on either
2.2.17 or 2.4.X.  Isn't run level 1 supposed to only be the bare minimum of running
processes, a few kernel processes, init and getty.  No network services...

What's changed in the kernel to elicit this behavior?

Is there a better "faster" way to get root access at run level 1 w/o login & passwd
on 2.4.X?

No it's not an everyday occurance, but I was impatiently thinking the sytem had
locked up and rebooted a couple of times, so it got me wondering why 2.2.X and
2.4.X differ in this basic behavior. 

Martin

David Ford wrote:
> 
> rpc.portmap isn't running, your login configuration/nss requires yp or something 
>provided ans an RPC.
> 
> -d
> 
> "M.H.VanLeeuwen" wrote:
> 
> > I had occasion to "telinit 1" today and found that it took a long time
> > to login after root passwd was entered.  this doesn't happen with 2.2.X
> > kernels.
> >
> > Is this to be expected with the 2.4 series kernels? or a bug?
> >
> > Martin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

run level 1, login takes too long, 2.4.X vs. 2.2.X

2000-11-19 Thread M.H.VanLeeuwen


I had occasion to "telinit 1" today and found that it took a long time
to login after root passwd was entered.  this doesn't happen with 2.2.X
kernels.

Is this to be expected with the 2.4 series kernels? or a bug?

Martin

strace for 2.4.0-test11-pre7

---snip---
gettimeofday({974665658, 952483}, NULL) = 0
socket(PF_INET, SOCK_DGRAM, IPPROTO_UDP) = 3
getpid()= 305
bind(3, {sin_family=AF_INET, sin_port=htons(905), sin_addr=inet_addr("0.0.0.0")}}, 16) 
= 0
ioctl(3, FIONBIO, [1])  = 0
sendto(3, "\31\23\233@\0\0\0\0\0\0\0\2\0\1\206\240\0\0\0\2\0\0\0\3"..., 56, 0, 
{sin_family=AF_INET, sin_port=htons(111),
sin_addr=inet_addr("127.0.0.1")}}, 16) = 56
poll([{fd=3, events=POLLIN}], 1, 5000)  = 0
ioctl(3, SIOCGIFCONF, 0xbfffb33c)   = 0
ioctl(3, SIOCGIFFLAGS, 0xbfffb344)  = 0
sendto(3, "\31\23\233@\0\0\0\0\0\0\0\2\0\1\206\240\0\0\0\2\0\0\0\3"..., 56, 0, 
{sin_family=AF_INET, sin_port=htons(111),
sin_addr=inet_addr("127.0.0.1")}}, 16) = 56 
---snip---

strace for 2.2.17

---snip---
gettimeofday({974664928, 735539}, NULL) = 0
socket(PF_INET, SOCK_DGRAM, IPPROTO_UDP) = 3
getpid()= 368
bind(3, {sin_family=AF_INET, sin_port=htons(968), sin_addr=inet_addr("0.0.0.0")}}, 16) 
= 0
ioctl(3, FIONBIO, [1])  = 0
sendto(3, "_c\353\331\0\0\0\0\0\0\0\2\0\1\206\240\0\0\0\2\0\0\0\3"..., 56, 0, 
{sin_family=AF_INET, sin_port=htons(111),
sin_addr=inet_addr("127.0.0.1")}}, 16) = 56
poll([{fd=3, events=POLLIN, revents=POLLERR}], 1, 5000) = 1
recvfrom(3, 0x8056380, 400, 0, 0xbfffd66c, 0xbfffd618) = -1 ECONNREFUSED (Connection 
refused)
close(3)= 0  
---snip---
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

run level 1, login takes too long, 2.4.X vs. 2.2.X

2000-11-19 Thread M.H.VanLeeuwen


I had occasion to "telinit 1" today and found that it took a long time
to login after root passwd was entered.  this doesn't happen with 2.2.X
kernels.

Is this to be expected with the 2.4 series kernels? or a bug?

Martin

strace for 2.4.0-test11-pre7

---snip---
gettimeofday({974665658, 952483}, NULL) = 0
socket(PF_INET, SOCK_DGRAM, IPPROTO_UDP) = 3
getpid()= 305
bind(3, {sin_family=AF_INET, sin_port=htons(905), sin_addr=inet_addr("0.0.0.0")}}, 16) 
= 0
ioctl(3, FIONBIO, [1])  = 0
sendto(3, "\31\23\233@\0\0\0\0\0\0\0\2\0\1\206\240\0\0\0\2\0\0\0\3"..., 56, 0, 
{sin_family=AF_INET, sin_port=htons(111),
sin_addr=inet_addr("127.0.0.1")}}, 16) = 56
poll([{fd=3, events=POLLIN}], 1, 5000)  = 0
ioctl(3, SIOCGIFCONF, 0xbfffb33c)   = 0
ioctl(3, SIOCGIFFLAGS, 0xbfffb344)  = 0
sendto(3, "\31\23\233@\0\0\0\0\0\0\0\2\0\1\206\240\0\0\0\2\0\0\0\3"..., 56, 0, 
{sin_family=AF_INET, sin_port=htons(111),
sin_addr=inet_addr("127.0.0.1")}}, 16) = 56 
---snip---

strace for 2.2.17

---snip---
gettimeofday({974664928, 735539}, NULL) = 0
socket(PF_INET, SOCK_DGRAM, IPPROTO_UDP) = 3
getpid()= 368
bind(3, {sin_family=AF_INET, sin_port=htons(968), sin_addr=inet_addr("0.0.0.0")}}, 16) 
= 0
ioctl(3, FIONBIO, [1])  = 0
sendto(3, "_c\353\331\0\0\0\0\0\0\0\2\0\1\206\240\0\0\0\2\0\0\0\3"..., 56, 0, 
{sin_family=AF_INET, sin_port=htons(111),
sin_addr=inet_addr("127.0.0.1")}}, 16) = 56
poll([{fd=3, events=POLLIN, revents=POLLERR}], 1, 5000) = 1
recvfrom(3, 0x8056380, 400, 0, 0xbfffd66c, 0xbfffd618) = -1 ECONNREFUSED (Connection 
refused)
close(3)= 0  
---snip---
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Re: run level 1, login takes too long, 2.4.X vs. 2.2.X

2000-11-19 Thread M.H.VanLeeuwen


Hi David,

Yup, I know rpc.portmap isn't running, the point is that it wasn't running on either
2.2.17 or 2.4.X.  Isn't run level 1 supposed to only be the bare minimum of running
processes, a few kernel processes, init and getty.  No network services...

What's changed in the kernel to elicit this behavior?

Is there a better "faster" way to get root access at run level 1 w/o login  passwd
on 2.4.X?

No it's not an everyday occurance, but I was impatiently thinking the sytem had
locked up and rebooted a couple of times, so it got me wondering why 2.2.X and
2.4.X differ in this basic behavior. 

Martin

David Ford wrote:
 
 rpc.portmap isn't running, your login configuration/nss requires yp or something 
provided ans an RPC.
 
 -d
 
 "M.H.VanLeeuwen" wrote:
 
  I had occasion to "telinit 1" today and found that it took a long time
  to login after root passwd was entered.  this doesn't happen with 2.2.X
  kernels.
 
  Is this to be expected with the 2.4 series kernels? or a bug?
 
  Martin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Re: Linux-2.4.0-test10

2000-11-01 Thread M.H.VanLeeuwen

"CRADOCK, Christopher" wrote:
> 
> I have a similar hardware list and I don't observe any of these problems on
> 2.4.0-test10x. Is it possibly a hardware conflict somewhere?
> 
> What I do see occasionally is if X was ever heavy on the memory usage (say
> I've run GIMP for a couple of hours) then the text console's font set gets
> trashed until the next reboot. Console driver failing to reset something?
> 
> Chris Cradock
> 

Hi Chris

Never had the trashed fonts before.

How about a better comparison of systems?
All I mentioned were r128, ne2k, PIIX4 and SMP,  barely enough to claim similar
hardware thus these aren't real problems cause you don't see them.
I can send you gory details if your interested.

My reason for claiming these are problems, maybe not show stoppers, are:

This system is rock solid on 2.2.X.

problem 1, shouldn't fail on 2.4 if it works just fine on 2.2.  Probably a locking
issue but I'm not sure.  Any idea where to put some BKL's to see if the problem
will go away?

problem 2, happens randomly, so is it a hardware problem or a software issue?  being
that the system works fine SMP and UP then my guess is a software interaction when
UP-APIC is enabled, a race condition??

problem 3, new feature in 2.4, one would expect, hey, I've got this hdwr in my system,
let me enable this option...  wait a minute the system doesn't boot...

problem 4, ISAPNP in the kernel is new for 2.4, i was pointing out that it can be
improved to make it better able to select IRQ's that work so that the user can just
upgrade to 2.4 without having to tweak the BIOS and/or the code.  I sent a patch to
Linus but he rejected it, yes I realize it was a weak attempt but it fixed my ISAPNP
problems, and no one has proposed a better solution.  Shouldn't the
first release of 2.4.0 show that it's new capabilities are ready for prime time?

Thanks,
Martin

> 1.  kernel compiled w/o FB support.  When attempting to switch
> back to X from VC1-6 system locks hard for SMP.  Nada thing
> fixes this except hard reset... no Alt-SysRq-B, nothing
> DRI not enabled.  Video card has r128 chipset.
>  
> 2.  System is a NFS root machine, after a period of heavy ntwk
> activity, eg. "make clean" in /usr/src/linux ETH0 no longer
> works or sometimes just ntwk activity during system boot is
> enough to cause the ETH activity to cease.
> The only recourse is to Alt-SysRq-B the system.
> NIC = NE2K ISA
>  
> 3. Enabling PIIX4, kernel locks hard when printing the partition
>tables for hdc.   hdc has no partitions.
>I think this problem is on Ted's problem list???
>  
> 4. ISAPNP assigns an invalid/unusable IRQ to NE2K NIC card.
>Previously reported to Linus & Ingo, they asked for an MPTABLE
>dump, haven't heard back since providing said data.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Re: Linux-2.4.0-test10

2000-11-01 Thread M.H.VanLeeuwen

David Ford wrote:
> 
> "M.H.VanLeeuwen" wrote:
> 
> > 3. Enabling PIIX4, kernel locks hard when printing the partition
> >tables for hdc.   hdc has no partitions.
> >I think this problem is on Ted's problem list???
> 
> Disable PIIXn tuning and recompile your kernel.  How does it fare now?
> 

Yep, disabling (opposite of "enabling") does allow the kernel to boot just fine.
PIIXn tuning must be tickling something on the system so that the first time we
read from the disk, partition check block 0, the system freezes hard.

I do know that w/o PIIXn tuning the result of the first block read is all zero's
hence the "/dev/ide/host0/bus0/target1/lun0: unknown partition table" message.

Any idea how to go about debugging this kind of lockup?
Guess i'll scrounge up a couple of disks and see if it's controller or disk related.

Weren't you also experiencing this type of problem on a laptop?  

Martin

lspci -v
00:07.0 ISA bridge: Intel Corporation 82371AB PIIX4 ISA (rev 02)
Flags: bus master, medium devsel, latency 0

00:07.1 IDE interface: Intel Corporation 82371AB PIIX4 IDE (rev 01) (prog-if 80 
[Master])
Flags: bus master, medium devsel, latency 32
I/O ports at f000 [size=16]

00:07.2 USB Controller: Intel Corporation 82371AB PIIX4 USB (rev 01) (prog-if 00 
[UHCI])
Flags: bus master, medium devsel, latency 32, IRQ 19
I/O ports at d000 [size=32]

00:07.3 Bridge: Intel Corporation 82371AB PIIX4 ACPI (rev 02)
Flags: medium devsel
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Re: Linux-2.4.0-test10

2000-11-01 Thread M.H.VanLeeuwen


David Ford wrote:
 
 "M.H.VanLeeuwen" wrote:
 
  3. Enabling PIIX4, kernel locks hard when printing the partition
 tables for hdc.   hdc has no partitions.
 I think this problem is on Ted's problem list???
 
 Disable PIIXn tuning and recompile your kernel.  How does it fare now?
 

Yep, disabling (opposite of "enabling") does allow the kernel to boot just fine.
PIIXn tuning must be tickling something on the system so that the first time we
read from the disk, partition check block 0, the system freezes hard.

I do know that w/o PIIXn tuning the result of the first block read is all zero's
hence the "/dev/ide/host0/bus0/target1/lun0: unknown partition table" message.

Any idea how to go about debugging this kind of lockup?
Guess i'll scrounge up a couple of disks and see if it's controller or disk related.

Weren't you also experiencing this type of problem on a laptop?  


Martin


lspci -v
00:07.0 ISA bridge: Intel Corporation 82371AB PIIX4 ISA (rev 02)
Flags: bus master, medium devsel, latency 0

00:07.1 IDE interface: Intel Corporation 82371AB PIIX4 IDE (rev 01) (prog-if 80 
[Master])
Flags: bus master, medium devsel, latency 32
I/O ports at f000 [size=16]

00:07.2 USB Controller: Intel Corporation 82371AB PIIX4 USB (rev 01) (prog-if 00 
[UHCI])
Flags: bus master, medium devsel, latency 32, IRQ 19
I/O ports at d000 [size=32]

00:07.3 Bridge: Intel Corporation 82371AB PIIX4 ACPI (rev 02)
Flags: medium devsel
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Re: Linux-2.4.0-test10

2000-11-01 Thread M.H.VanLeeuwen


"CRADOCK, Christopher" wrote:
 
 I have a similar hardware list and I don't observe any of these problems on
 2.4.0-test10x. Is it possibly a hardware conflict somewhere?
 
 What I do see occasionally is if X was ever heavy on the memory usage (say
 I've run GIMP for a couple of hours) then the text console's font set gets
 trashed until the next reboot. Console driver failing to reset something?
 
 Chris Cradock
 

Hi Chris

Never had the trashed fonts before.

How about a better comparison of systems?
All I mentioned were r128, ne2k, PIIX4 and SMP,  barely enough to claim similar
hardware thus these aren't real problems cause you don't see them.
I can send you gory details if your interested.

My reason for claiming these are problems, maybe not show stoppers, are:

This system is rock solid on 2.2.X.

problem 1, shouldn't fail on 2.4 if it works just fine on 2.2.  Probably a locking
issue but I'm not sure.  Any idea where to put some BKL's to see if the problem
will go away?

problem 2, happens randomly, so is it a hardware problem or a software issue?  being
that the system works fine SMP and UP then my guess is a software interaction when
UP-APIC is enabled, a race condition??

problem 3, new feature in 2.4, one would expect, hey, I've got this hdwr in my system,
let me enable this option...  wait a minute the system doesn't boot...

problem 4, ISAPNP in the kernel is new for 2.4, i was pointing out that it can be
improved to make it better able to select IRQ's that work so that the user can just
upgrade to 2.4 without having to tweak the BIOS and/or the code.  I sent a patch to
Linus but he rejected it, yes I realize it was a weak attempt but it fixed my ISAPNP
problems, and no one has proposed a better solution.  Shouldn't the
first release of 2.4.0 show that it's new capabilities are ready for prime time?


Thanks,
Martin


 1.  kernel compiled w/o FB support.  When attempting to switch
 back to X from VC1-6 system locks hard for SMP.  Nada thing
 fixes this except hard reset... no Alt-SysRq-B, nothing
 DRI not enabled.  Video card has r128 chipset.
  
 2.  System is a NFS root machine, after a period of heavy ntwk
 activity, eg. "make clean" in /usr/src/linux ETH0 no longer
 works or sometimes just ntwk activity during system boot is
 enough to cause the ETH activity to cease.
 The only recourse is to Alt-SysRq-B the system.
 NIC = NE2K ISA
  
 3. Enabling PIIX4, kernel locks hard when printing the partition
tables for hdc.   hdc has no partitions.
I think this problem is on Ted's problem list???
  
 4. ISAPNP assigns an invalid/unusable IRQ to NE2K NIC card.
Previously reported to Linus  Ingo, they asked for an MPTABLE
dump, haven't heard back since providing said data.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Re: Linux-2.4.0-test10

2000-10-31 Thread M.H.VanLeeuwen


FYI,

My list of 2.4.0-testX problems

Further details, .config, etc...available if needed

Martin

2.4.0-test10 and earlier problem list:
 
Problem |   UP  UP-APIC SMP
|
1   |   OK  OK  HARDLOCK
2   |   OK  FAILS   OK
3   |   HARDLOCKHARDLOCKHARDLOCK
4   |   BROKEN  BROKEN  BROKEN

Problem description:
 
1.  kernel compiled w/o FB support.  When attempting to switch
back to X from VC1-6 system locks hard for SMP.  Nada thing
fixes this except hard reset... no Alt-SysRq-B, nothing
DRI not enabled.  Video card has r128 chipset.
 
2.  System is a NFS root machine, after a period of heavy ntwk
activity, eg. "make clean" in /usr/src/linux ETH0 no longer
works or sometimes just ntwk activity during system boot is
enough to cause the ETH activity to cease.
The only recourse is to Alt-SysRq-B the system.
NIC = NE2K ISA
 
3. Enabling PIIX4, kernel locks hard when printing the partition
   tables for hdc.   hdc has no partitions.
   I think this problem is on Ted's problem list???
 
4. ISAPNP assigns an invalid/unusable IRQ to NE2K NIC card.
   Previously reported to Linux & Ingo, they asked for an MPTABLE
   dump, haven't heard back since providing said data.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Re: Linux-2.4.0-test10

2000-10-31 Thread M.H.VanLeeuwen


FYI,

My list of 2.4.0-testX problems

Further details, .config, etc...available if needed

Martin

2.4.0-test10 and earlier problem list:
 
Problem |   UP  UP-APIC SMP
|
1   |   OK  OK  HARDLOCK
2   |   OK  FAILS   OK
3   |   HARDLOCKHARDLOCKHARDLOCK
4   |   BROKEN  BROKEN  BROKEN

Problem description:
 
1.  kernel compiled w/o FB support.  When attempting to switch
back to X from VC1-6 system locks hard for SMP.  Nada thing
fixes this except hard reset... no Alt-SysRq-B, nothing
DRI not enabled.  Video card has r128 chipset.
 
2.  System is a NFS root machine, after a period of heavy ntwk
activity, eg. "make clean" in /usr/src/linux ETH0 no longer
works or sometimes just ntwk activity during system boot is
enough to cause the ETH activity to cease.
The only recourse is to Alt-SysRq-B the system.
NIC = NE2K ISA
 
3. Enabling PIIX4, kernel locks hard when printing the partition
   tables for hdc.   hdc has no partitions.
   I think this problem is on Ted's problem list???
 
4. ISAPNP assigns an invalid/unusable IRQ to NE2K NIC card.
   Previously reported to Linux  Ingo, they asked for an MPTABLE
   dump, haven't heard back since providing said data.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

tun/tap devices on 2.4.0-testX

2000-10-05 Thread M.H.VanLeeuwen


need help getting tapX devices operational on 2.4

for 2.4 using the new tun.o module I cannot create a tap0
interface.  on 2.2 it is the ethertap.o module.
changing modules.conf allows the module to load but the
interface does not come up:

ifconfig tap0 192.168.0.10 up
SIOCSIFADDR: No such device
tap0: unknown interface: No such device
tap0: unknown interface: No such device


has anyone successfully gotten this to work, if so how?

using devfs, which has created /dev/netlink/tapX devices...
symlinking /dev/netlink/tap0 /dev/tap0 doesn't help.

#define TUN_DEBUG 1, in tun.c doesn't help, fails to compile.

TIA
Martin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Re: [PATCH] ISAPNP using an invalid IO_APIC IRQ

2000-09-23 Thread M.H.VanLeeuwen


Alan Cox wrote:
> 
> IRQ 5 may well have gone to an onboard device.
> 
> There are two things to note here:
> 
> 1.  By default if you boot with non PnP OS the BIOS will assign IRQ's
> to PnP devices and we would be best to try and keep the existing value when
> possible (so the PCI/ISA routing is correct)

Yes, when I tell the BIOS that the OS is not PnP aware it assigns IRQ 9 to NIC,
yet Linux tries to assign it to IRQ 5.


Martin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Re: [PATCH] ISAPNP using an invalid IO_APIC IRQ

2000-09-23 Thread M.H.VanLeeuwen


Alan Cox wrote:
 
 IRQ 5 may well have gone to an onboard device.
 
 There are two things to note here:
 
 1.  By default if you boot with non PnP OS the BIOS will assign IRQ's
 to PnP devices and we would be best to try and keep the existing value when
 possible (so the PCI/ISA routing is correct)

Yes, when I tell the BIOS that the OS is not PnP aware it assigns IRQ 9 to NIC,
yet Linux tries to assign it to IRQ 5.


Martin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Re: [PATCH] ISAPNP using an invalid IO_APIC IRQ

2000-09-21 Thread M.H.VanLeeuwen

Linus Torvalds wrote:
> 
> On Thu, 21 Sep 2000, M.H.VanLeeuwen wrote:
> >
> > Is this patch acceptable?
> 
> Please explain.
> 
> The test seems to be that "if there are IO_APICs, a PnP irq _has_ to be an
> IO_APIC irq".
> 
> > +   if (!IO_APIC_IRQ(irq) && io_apic_irqs)
> > +   return 1;
> 
> Which makes no sense to me. Why would a legacy irq not be acceptable?

firstly, I have no understanding of IO_APIC vs. legacy IRQ handling in i386
hardware so what I say may be completely wrong.  I'll explain my circumstance
and we'll go from there.

Legacy would be just fine, but ISAPNP is attempting to assign an IRQ that is
unavailable on my system.  In otherwords, the BIOS has assigned it to some
other function and thus the IRQ doesn't appear to Linux as even existing.
In my case IRQ5 was assigned to "DISPLAY CONTROLLER" and IRQ5 isn't available
to the system, excerpts from dmesg:
...
Bus #2 is ISA
I/O APIC #2 Version 17 at 0xFEC0.
Int: type 0, pol 0, trig 0, bus 2, IRQ 04, APIC ID 2, APIC INT 04  *** NO ISA IRQ 5 ***
Int: type 0, pol 0, trig 0, bus 2, IRQ 06, APIC ID 2, APIC INT 06
...
ENABLING IO-APIC IRQs
...changing IO-APIC physical APIC ID to 2 ... ok.
Synchronizing Arb IDs.
init IO_APIC IRQs
 IO-APIC (apicid-pin) 2-0, 2-5, 2-10, 2-11, 2-17, 2-20, 2-21, 2-22, 2-23 not connected.
 ^ no IRQ 5 ?
...
 IRQ redirection table:
 NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect:
 04 003 03  000   0   01149
 05 000 00  100   0   00000   *** IRQ 5 ??? ***
 06 003 03  000   0   01151

Yet ISAPNP decided to assign that IRQ5 to my NIC.  ne.c fails to operate
when there are plenty of other IRQs still available.

The difference is in this logic between _no_ IO-APIC and _yes_ IO_APIC
in isapnp.c

isapnp_for_each_dev(dev) {
if (dev->active) {
if (dev->irq_resource[0].start == irq ||
dev->irq_resource[1].start == irq)
return 1;

}
}  

since w/o IO_APIC no PIC->APIC IRQ transform is done and this returns 1
and IRQ 5 is not used.  With IO_APIC no match occurs and the system attempt
to use IRQ 5, thus a hung system.

Hence, my attempt to rule out APIC IRQs that were unavailable to the system,
also assuming the legacy IRQs would also not be available.  This may not be
globally true, but seems to be on my system.  Abit BP6, nu Bios.

This entire problem would not have bothered me too badly, had the system
come up and run, but w/o explicitly telling the BIOS to reserve an ISA
IRQ ISAPNP eventually decides to try IRQ 12.  IRQ sharing with PS/2 mouse
and ne.c fail miserably.  On 2.2.X I had been using IRQ9 for ne.c card.
With 2.4 IRQ 9 is listed after IRQ 12 in xtab[16] (isapnp.c) so the system
fails to boot.

> Especially as they seem to be acceptable if there are _no_ IO-APIC irq's.

Only if they work. ;)

More info on request or tell me to go away, _grin_ again.

Martin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

[PATCH] ISAPNP using an invalid IO_APIC IRQ

2000-09-21 Thread M.H.VanLeeuwen


Linus,

Is this patch acceptable?

info & patch below.

Martin

>>Jaroslav
>>
>>Basically, this patch discards any unusable IO_APIC IRQs from the list of IRQs
>>that ISA PNP is trying to allocate from - but only if IO_APIC IRQs are available.
>>
>>Comments?
>>Martin

Jaroslav Kysela wrote:
>
>It works only for i386.
>
>I suggest to add:
>
>#ifndef IO_APIC_IRQ
I used CONFIG_X86_IO_APIC instead
>#define IO_APIC_IRQ(x)
>#define io_apic_irqs 0
>#endif
>
>Also, isapnp_proc.c shouldn't be changed. I prefer to see all possible
>IRQs, although they can't be used. Feel free to post this patch to Linus
>and lkml.

I left in the patch for isapnp_proc.c just in case...

>
>Jaroslav
updated patch

--- /home/mhvl/linux.orig/drivers/pnp/isapnp.c  Tue Jul 25 20:25:14 2000
+++ /home/mhvl/linux/drivers/pnp/isapnp.c   Wed Sep 20 20:59:56 2000
@@ -33,10 +33,10 @@
 #include 
 #include 
 #include 
-#include 
 #include 
 #include 
 #include 
+#include 

 LIST_HEAD(isapnp_cards);
 LIST_HEAD(isapnp_devices);
@@ -48,6 +48,11 @@
 #define ISAPNP_DEBUG
 #endif

+#ifndef CONFIG_X86_IO_APIC
+#define IO_APIC_IRQ(x) 0
+#define io_apic_irqs 0
+#endif
+
 struct resource *pidxr_res = NULL;
 struct resource *pnpwrp_res = NULL;
 struct resource *isapnp_rdp_res = NULL;
@@ -1598,6 +1603,8 @@

if (irq < 0 || irq > 15)
return 1;
+   if (!IO_APIC_IRQ(irq) && io_apic_irqs)
+   return 1;
for (i = 0; i < 16; i++) {
if (isapnp_reserve_irq[i] == irq)
return 1;
--- /home/mhvl/linux.orig/drivers/pnp/isapnp_proc.c Wed Jul 12 23:58:43 2000
+++ /home/mhvl/linux/drivers/pnp/isapnp_proc.c  Tue Sep 19 17:52:25 2000
@@ -30,6 +30,12 @@
 #include 
 #include 
 #include 
+#include 
+
+#ifndef CONFIG_X86_IO_APIC
+#define IO_APIC_IRQ(x) 0
+#define io_apic_irqs 0
+#endif

 struct isapnp_info_buffer {
char *buffer;   /* pointer to begin of buffer */
@@ -448,7 +454,7 @@

isapnp_printf(buffer, "%sIRQ ", space);
for (i = 0; i < 16; i++)
-   if (irq->map & (1<map & (1<From previous e-mails

"M.H.VanLeeuwen" wrote:
> 
> Jaroslav Kysela wrote:
> >
> > On Tue, 29 Aug 2000, M.H.VanLeeuwen wrote:
> >
> > > With default BIOS settings, IRQ 5 is unavailable for ISA yet
> > > it is being assigned by the ne.c driver and NFS root system
> > > doesn't finish booting.
> > >
> > > Is this a driver problem or a ISAPNP problem?
> >
> > It is general IRQ manager problem. ISA PnP code cannot determine which
> > interrupts cannot be used. I think that we need some next layer between
> > IO APIC control code and Plug & Play routines to exchange this information
> > to prevent allocation of a free but unuseable IRQ for ISA PnP devices.
> >
> > Jaroslav
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

[PATCH] ISAPNP using an invalid IO_APIC IRQ

2000-09-21 Thread M.H.VanLeeuwen


Linus,

Is this patch acceptable?

info  patch below.

Martin

Jaroslav

Basically, this patch discards any unusable IO_APIC IRQs from the list of IRQs
that ISA PNP is trying to allocate from - but only if IO_APIC IRQs are available.

Comments?
Martin

Jaroslav Kysela wrote:

It works only for i386.

I suggest to add:

#ifndef IO_APIC_IRQ
I used CONFIG_X86_IO_APIC instead
#define IO_APIC_IRQ(x)
#define io_apic_irqs 0
#endif

Also, isapnp_proc.c shouldn't be changed. I prefer to see all possible
IRQs, although they can't be used. Feel free to post this patch to Linus
and lkml.

I left in the patch for isapnp_proc.c just in case...


Jaroslav
updated patch

--- /home/mhvl/linux.orig/drivers/pnp/isapnp.c  Tue Jul 25 20:25:14 2000
+++ /home/mhvl/linux/drivers/pnp/isapnp.c   Wed Sep 20 20:59:56 2000
@@ -33,10 +33,10 @@
 #include linux/delay.h
 #include asm/io.h
 #include asm/dma.h
-#include asm/irq.h
 #include linux/pci.h
 #include linux/init.h
 #include linux/isapnp.h
+#include linux/irq.h

 LIST_HEAD(isapnp_cards);
 LIST_HEAD(isapnp_devices);
@@ -48,6 +48,11 @@
 #define ISAPNP_DEBUG
 #endif

+#ifndef CONFIG_X86_IO_APIC
+#define IO_APIC_IRQ(x) 0
+#define io_apic_irqs 0
+#endif
+
 struct resource *pidxr_res = NULL;
 struct resource *pnpwrp_res = NULL;
 struct resource *isapnp_rdp_res = NULL;
@@ -1598,6 +1603,8 @@

if (irq  0 || irq  15)
return 1;
+   if (!IO_APIC_IRQ(irq)  io_apic_irqs)
+   return 1;
for (i = 0; i  16; i++) {
if (isapnp_reserve_irq[i] == irq)
return 1;
--- /home/mhvl/linux.orig/drivers/pnp/isapnp_proc.c Wed Jul 12 23:58:43 2000
+++ /home/mhvl/linux/drivers/pnp/isapnp_proc.c  Tue Sep 19 17:52:25 2000
@@ -30,6 +30,12 @@
 #include asm/uaccess.h
 #include linux/smp_lock.h
 #include linux/isapnp.h
+#include linux/irq.h
+
+#ifndef CONFIG_X86_IO_APIC
+#define IO_APIC_IRQ(x) 0
+#define io_apic_irqs 0
+#endif

 struct isapnp_info_buffer {
char *buffer;   /* pointer to begin of buffer */
@@ -448,7 +454,7 @@

isapnp_printf(buffer, "%sIRQ ", space);
for (i = 0; i  16; i++)
-   if (irq-map  (1i)) {
+   if (irq-map  (1i)  (IO_APIC_IRQ(i) || !io_apic_irqs)) {
if (!first) {
isapnp_printf(buffer, ",");
} else {


--
From previous e-mails

"M.H.VanLeeuwen" wrote:
 
 Jaroslav Kysela wrote:
 
  On Tue, 29 Aug 2000, M.H.VanLeeuwen wrote:
 
   With default BIOS settings, IRQ 5 is unavailable for ISA yet
   it is being assigned by the ne.c driver and NFS root system
   doesn't finish booting.
  
   Is this a driver problem or a ISAPNP problem?
 
  It is general IRQ manager problem. ISA PnP code cannot determine which
  interrupts cannot be used. I think that we need some next layer between
  IO APIC control code and Plug  Play routines to exchange this information
  to prevent allocation of a free but unuseable IRQ for ISA PnP devices.
 
  Jaroslav
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Re: [PATCH] ISAPNP using an invalid IO_APIC IRQ

2000-09-21 Thread M.H.VanLeeuwen


Linus Torvalds wrote:
 
 On Thu, 21 Sep 2000, M.H.VanLeeuwen wrote:
 
  Is this patch acceptable?
 
 Please explain.
 
 The test seems to be that "if there are IO_APICs, a PnP irq _has_ to be an
 IO_APIC irq".
 
  +   if (!IO_APIC_IRQ(irq)  io_apic_irqs)
  +   return 1;
 
 Which makes no sense to me. Why would a legacy irq not be acceptable?

firstly, I have no understanding of IO_APIC vs. legacy IRQ handling in i386
hardware so what I say may be completely wrong.  I'll explain my circumstance
and we'll go from there.

Legacy would be just fine, but ISAPNP is attempting to assign an IRQ that is
unavailable on my system.  In otherwords, the BIOS has assigned it to some
other function and thus the IRQ doesn't appear to Linux as even existing.
In my case IRQ5 was assigned to "DISPLAY CONTROLLER" and IRQ5 isn't available
to the system, excerpts from dmesg:
...
Bus #2 is ISA
I/O APIC #2 Version 17 at 0xFEC0.
Int: type 0, pol 0, trig 0, bus 2, IRQ 04, APIC ID 2, APIC INT 04  *** NO ISA IRQ 5 ***
Int: type 0, pol 0, trig 0, bus 2, IRQ 06, APIC ID 2, APIC INT 06
...
ENABLING IO-APIC IRQs
...changing IO-APIC physical APIC ID to 2 ... ok.
Synchronizing Arb IDs.
init IO_APIC IRQs
 IO-APIC (apicid-pin) 2-0, 2-5, 2-10, 2-11, 2-17, 2-20, 2-21, 2-22, 2-23 not connected.
 ^ no IRQ 5 ?
...
 IRQ redirection table:
 NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect:
 04 003 03  000   0   01149
 05 000 00  100   0   00000   *** IRQ 5 ??? ***
 06 003 03  000   0   01151

Yet ISAPNP decided to assign that IRQ5 to my NIC.  ne.c fails to operate
when there are plenty of other IRQs still available.

The difference is in this logic between _no_ IO-APIC and _yes_ IO_APIC
in isapnp.c

isapnp_for_each_dev(dev) {
if (dev-active) {
if (dev-irq_resource[0].start == irq ||
dev-irq_resource[1].start == irq)
return 1;

}
}  

since w/o IO_APIC no PIC-APIC IRQ transform is done and this returns 1
and IRQ 5 is not used.  With IO_APIC no match occurs and the system attempt
to use IRQ 5, thus a hung system.

Hence, my attempt to rule out APIC IRQs that were unavailable to the system,
also assuming the legacy IRQs would also not be available.  This may not be
globally true, but seems to be on my system.  Abit BP6, nu Bios.

This entire problem would not have bothered me too badly, had the system
come up and run, but w/o explicitly telling the BIOS to reserve an ISA
IRQ ISAPNP eventually decides to try IRQ 12.  IRQ sharing with PS/2 mouse
and ne.c fail miserably.  On 2.2.X I had been using IRQ9 for ne.c card.
With 2.4 IRQ 9 is listed after IRQ 12 in xtab[16] (isapnp.c) so the system
fails to boot.

 Especially as they seem to be acceptable if there are _no_ IO-APIC irq's.

Only if they work. ;)

More info on request or tell me to go away, _grin_ again.

Martin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Re: ISAPNP using an invalid IRQ, 2.4.0-test7

2000-09-18 Thread M.H.VanLeeuwen


Jaroslav Kysela wrote:
> 
> On Tue, 29 Aug 2000, M.H.VanLeeuwen wrote:
> 
> > With default BIOS settings, IRQ 5 is unavailable for ISA yet
> > it is being assigned by the ne.c driver and NFS root system
> > doesn't finish booting.
> >
> > Is this a driver problem or a ISAPNP problem?
> 
> It is general IRQ manager problem. ISA PnP code cannot determine which
> interrupts cannot be used. I think that we need some next layer between
> IO APIC control code and Plug & Play routines to exchange this information
> to prevent allocation of a free but unuseable IRQ for ISA PnP devices.
> 
> Jaroslav
> 

Jaroslav,

This patch is a first approximation to the above problem, can U look it over?

Basically, it discards any unusable IO_APIC IRQs from the list of IRQs that
ISA PNP is trying to allocate from - but only if IO_APIC IRQs are available.
This works for none APIC systems as well since io_apic_irqs s/b zero if
APIC's don't exist.

Comments?
Martin

--- /home/mhvl/linux.orig/drivers/pnp/isapnp.c  Tue Jul 25 20:25:14 2000
+++ isapnp.cMon Sep 18 21:28:39 2000
@@ -33,10 +33,10 @@
 #include 
 #include 
 #include 
-#include 
 #include 
 #include 
 #include 
+#include 

 LIST_HEAD(isapnp_cards);
 LIST_HEAD(isapnp_devices);
@@ -1598,6 +1598,8 @@

if (irq < 0 || irq > 15)
return 1;
+   if (!IO_APIC_IRQ(irq) && io_apic_irqs)
+   return 1;
for (i = 0; i < 16; i++) {
if (isapnp_reserve_irq[i] == irq)
return 1;
--- /home/mhvl/linux.orig/drivers/pnp/isapnp_proc.c Wed Jul 12 23:58:43 2000
+++ isapnp_proc.c   Wed Sep 13 22:39:47 2000
@@ -30,6 +30,7 @@
 #include 
 #include 
 #include 
+#include 

 struct isapnp_info_buffer {
char *buffer;   /* pointer to begin of buffer */
@@ -448,7 +449,7 @@

isapnp_printf(buffer, "%sIRQ ", space);
for (i = 0; i < 16; i++)
-   if (irq->map & (1<map & (1<http://www.tux.org/lkml/

Re: Update Linux 2.4 Status/TODO list

2000-09-13 Thread M.H.VanLeeuwen


David Ford wrote:
> 
> Andre Hedrick wrote:
> 
> > > 4. Boot Time Failures
> > >
> > >  * Use PCI DMA 'lost interrupt' problem with some hw [which ?] (NEC
> > >Versa LX with PIIX tuning)
> >
> > If this is a rare version of the BX/LX that has a no fix errata, then it
> > will be messy to issue resets to get out of the loop.
> >
> > >  * PIIXn tuning can hang laptop (2.4.0-test8-pre6, David Ford)
> >
> > Need more details of how APM/ACPI is dorking with DMA settins by the OEM.
> 
> These two are both reported by me, are the same issue.  The exact same kernel,
> one with PIIXn tuning enabled, will hang the hardware on boot requiring a
> physical power loss to restart.
> 
> No tuning options applied, only the capability enabled in the kernel, makes it
> crash.
> 
> If necessary I can again provide the boot log and a verbose lspci and whatever
> else is desired.  I'll even let you log in and look at it.
> 

I have the very same problem with my system w/ PIIXn enabled on 2.4.0-test8, BP6 MB.
If you need another system/person/datapoint to help narrow down the variables I can
supply whatever info you may need or try various patches.

Martin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Re: ISAPNP using an invalid IRQ, 2.4.0-test7

2000-08-30 Thread M.H.VanLeeuwen

Jaroslav Kysela wrote:
> 
> On Tue, 29 Aug 2000, M.H.VanLeeuwen wrote:
> 
> > With default BIOS settings, IRQ 5 is unavailable for ISA yet
> > it is being assigned by the ne.c driver and NFS root system
> > doesn't finish booting.
> >
> > Is this a driver problem or a ISAPNP problem?
> 
> It is general IRQ manager problem. ISA PnP code cannot determine which
> interrupts cannot be used. I think that we need some next layer between
> IO APIC control code and Plug & Play routines to exchange this information
> to prevent allocation of a free but unuseable IRQ for ISA PnP devices.
> 
> Jaroslav

Kind of what I figured, too bad Linux 2.4 PNP is akin to Plug'N Pray (trademark MS ;)).
It seems as though 2.4 is taking a step backwards since the same system/bios setup
worked just fine on 2.2 with userland PNP utilities.  Making PNP work for more
cases would sure be nice.

Do you know of anyone actively working to make the above kernel adjustments?
Can it be done relatively easily?

Martin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Re: ISAPNP using an invalid IRQ, 2.4.0-test7

2000-08-30 Thread M.H.VanLeeuwen


Jaroslav Kysela wrote:
 
 On Tue, 29 Aug 2000, M.H.VanLeeuwen wrote:
 
  With default BIOS settings, IRQ 5 is unavailable for ISA yet
  it is being assigned by the ne.c driver and NFS root system
  doesn't finish booting.
 
  Is this a driver problem or a ISAPNP problem?
 
 It is general IRQ manager problem. ISA PnP code cannot determine which
 interrupts cannot be used. I think that we need some next layer between
 IO APIC control code and Plug  Play routines to exchange this information
 to prevent allocation of a free but unuseable IRQ for ISA PnP devices.
 
 Jaroslav

Kind of what I figured, too bad Linux 2.4 PNP is akin to Plug'N Pray (trademark MS ;)).
It seems as though 2.4 is taking a step backwards since the same system/bios setup
worked just fine on 2.2 with userland PNP utilities.  Making PNP work for more
cases would sure be nice.

Do you know of anyone actively working to make the above kernel adjustments?
Can it be done relatively easily?

Martin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

38 matches

Mail list logo