Re: Problem with ata layer in 2.6.24

2008-01-28 Thread Michal Jaegermann
On Mon, Jan 28, 2008 at 08:31:57PM -0500, Gene Heskett wrote:
> 
> In my script, its one line:
> mkinitrd -f initrd-$VER.img $VER && \
> 
> where $VER is the shell variable I edit to = the version number, located at 
> the top of the script.
> 
> Unforch, its failing:
> No module pata_amd found for kernel 2.6.24, aborting.

mkinitrd is just a shell script.  Even if its options, and there is
a quite a number of these, do not allow to influence a choice of
modules in a desired manner, it is pretty trivial to make yourself a
custom version of it and just hardwire there a fixed list of modules
to use instead of relying on general mechanisms which are trying
hard to guess what you may need.

That way your regular 'mkinitrd' will build something to boot with
libata and 'mkinird.ide' will use IDE modules for that purpose using
the same "core" kernel.

If you are using distribution kernels, as opposed to your own
configuration, it is quite likely that you will need to install
'kernel-devel' package and recompile and add required IDE modules
yourself as those may be not provided.  This is done the same way
like for any other "external" module.

   Michal
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: Problem with ata layer in 2.6.24

2008-01-28 Thread Michal Jaegermann
On Mon, Jan 28, 2008 at 08:31:57PM -0500, Gene Heskett wrote:
 
 In my script, its one line:
 mkinitrd -f initrd-$VER.img $VER  \
 
 where $VER is the shell variable I edit to = the version number, located at 
 the top of the script.
 
 Unforch, its failing:
 No module pata_amd found for kernel 2.6.24, aborting.

mkinitrd is just a shell script.  Even if its options, and there is
a quite a number of these, do not allow to influence a choice of
modules in a desired manner, it is pretty trivial to make yourself a
custom version of it and just hardwire there a fixed list of modules
to use instead of relying on general mechanisms which are trying
hard to guess what you may need.

That way your regular 'mkinitrd' will build something to boot with
libata and 'mkinird.ide' will use IDE modules for that purpose using
the same core kernel.

If you are using distribution kernels, as opposed to your own
configuration, it is quite likely that you will need to install
'kernel-devel' package and recompile and add required IDE modules
yourself as those may be not provided.  This is done the same way
like for any other external module.

   Michal
--
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.20.6 vanilla does't boot

2007-04-18 Thread Michal Jaegermann
On Wed, Apr 18, 2007 at 03:39:25PM -0400, Len Brown wrote:
> On Sunday 15 April 2007 11:50, Michal Jaegermann wrote:
> > 
> > A kernel derived from 2.6.21-rc6-git1 (2.6.20-1.3053.fc7.x86_64 from
> > Fedora "rawhide" to be more precise) did boot on the hardware in
> > question, though; but only when I gave it 'acpi=off'.  Without that
> > parameter it was getting stuck apparently when starting hotplug.
> > In that kernel case disks were accessed using pata_atiixp driver.
> 
> If "acpi=off" is necessary to boot the latest kernel, please
> report an ACPI bug:
> http://bugzilla.kernel.org/enter_bug.cgi?product=ACPI

I now travel and what I can do at this moment is somewhat limited.
In particular I cannot gain an access to the hardware in question.
But please see
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=232490
and the most recent comments there in particular.

> Please mention in the bug report what the latest working kernel was.

This is mentioned in the referenced report as well.

Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.20.6 vanilla does't boot

2007-04-18 Thread Michal Jaegermann
On Wed, Apr 18, 2007 at 03:39:25PM -0400, Len Brown wrote:
 On Sunday 15 April 2007 11:50, Michal Jaegermann wrote:
  
  A kernel derived from 2.6.21-rc6-git1 (2.6.20-1.3053.fc7.x86_64 from
  Fedora rawhide to be more precise) did boot on the hardware in
  question, though; but only when I gave it 'acpi=off'.  Without that
  parameter it was getting stuck apparently when starting hotplug.
  In that kernel case disks were accessed using pata_atiixp driver.
 
 If acpi=off is necessary to boot the latest kernel, please
 report an ACPI bug:
 http://bugzilla.kernel.org/enter_bug.cgi?product=ACPI

I now travel and what I can do at this moment is somewhat limited.
In particular I cannot gain an access to the hardware in question.
But please see
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=232490
and the most recent comments there in particular.

 Please mention in the bug report what the latest working kernel was.

This is mentioned in the referenced report as well.

Michal
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.20.6 vanilla does't boot

2007-04-15 Thread Michal Jaegermann
On Fri, Apr 13, 2007 at 03:38:18PM +0400, Denis Kirjanov wrote:
> I updated the BIOS to the latest version, but the problem persists.
> Boots option pci = noacpi not solved the problem. Reporting bios bug
> disappears when setting pci = nommconf, But the kernel is still not
> loaded (

On x86_64 hardware using ata_piix I was unable to boot kernels
based on the current 2.6.20.x either.  Regardless of extra kernel
parameters used I was getting consistently 'hdc: lost interrupt',
and/or similar and after something like that the whole machine
was dead.  The only differences were if I could still reboot
from a keyboard or if I really have to pull a plug.

A kernel derived from 2.6.21-rc6-git1 (2.6.20-1.3053.fc7.x86_64 from
Fedora "rawhide" to be more precise) did boot on the hardware in
question, though; but only when I gave it 'acpi=off'.  Without that
parameter it was getting stuck apparently when starting hotplug.
In that kernel case disks were accessed using pata_atiixp driver.

   Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: 2.6.20.6 vanilla does't boot

2007-04-15 Thread Michal Jaegermann
On Fri, Apr 13, 2007 at 03:38:18PM +0400, Denis Kirjanov wrote:
 I updated the BIOS to the latest version, but the problem persists.
 Boots option pci = noacpi not solved the problem. Reporting bios bug
 disappears when setting pci = nommconf, But the kernel is still not
 loaded (

On x86_64 hardware using ata_piix I was unable to boot kernels
based on the current 2.6.20.x either.  Regardless of extra kernel
parameters used I was getting consistently 'hdc: lost interrupt',
and/or similar and after something like that the whole machine
was dead.  The only differences were if I could still reboot
from a keyboard or if I really have to pull a plug.

A kernel derived from 2.6.21-rc6-git1 (2.6.20-1.3053.fc7.x86_64 from
Fedora rawhide to be more precise) did boot on the hardware in
question, though; but only when I gave it 'acpi=off'.  Without that
parameter it was getting stuck apparently when starting hotplug.
In that kernel case disks were accessed using pata_atiixp driver.

   Michal
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [1/4] 2.6.21-rc5: known regressions (v2)

2007-03-31 Thread Michal Jaegermann
On Sat, Mar 31, 2007 at 05:01:23PM +0200, Adrian Bunk wrote:
> On Fri, Mar 30, 2007 at 06:23:10PM -0600, Michal Jaegermann wrote:
> > On Fri, Mar 30, 2007 at 11:32:09PM +0200, Adrian Bunk wrote:
> > > 
> > > Subject: kernels fail to boot with drives on ATIIXP controller
> > >  (ACPI/IRQ related)
> > > References : https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=229621
> > >  http://lkml.org/lkml/2007/3/4/257
> > > Submitter  : Michal Jaegermann <[EMAIL PROTECTED]>
> > > Status : unknown
> > 
> > I have now even better one with pata_via.  A kernel, which for
> > all practical purposes is 2.6.21-rc5, not only refuses to boot
> > (and I cannot find some option combination which would allow me to
> > do so anyway) but simply refuses to read _any_ data from a media.
> > This included a partitioning information.
> > 
> > Earlier kernel on the same hardware boots without raising any fuss.
> > 
> > Details are collected as
> > https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=234650
> 
> If I understand this correctly, a plain 2.6.20 kernel is already broken?

You mean that a quoted report talks about 2.6.20-1.3025.fc7 kernel?
These are vagaries of kernel version numbering in Fedora.
Changelogs are not that clear but it appears that
2.6.19-1.2911.6.4.fc6 will be actually closer to 2.6.20.
That kernel from a bug report is really, for all intents and purposes,
2.6.21-rc5 (if I am not misreading something).

I am afraid that I do not have at this moment an easy to way to check
"plain" 2.6.20 on the hardware in question.  It appears that the
essential difference is that a working kernel is using and old IDE
driver, and sees the drive - in this case - as /dev/hdc, while the
current one tries to go through libata and chockes uncontrollably.

   Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [1/4] 2.6.21-rc5: known regressions (v2)

2007-03-31 Thread Michal Jaegermann
On Sat, Mar 31, 2007 at 05:01:23PM +0200, Adrian Bunk wrote:
 On Fri, Mar 30, 2007 at 06:23:10PM -0600, Michal Jaegermann wrote:
  On Fri, Mar 30, 2007 at 11:32:09PM +0200, Adrian Bunk wrote:
   
   Subject: kernels fail to boot with drives on ATIIXP controller
(ACPI/IRQ related)
   References : https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=229621
http://lkml.org/lkml/2007/3/4/257
   Submitter  : Michal Jaegermann [EMAIL PROTECTED]
   Status : unknown
  
  I have now even better one with pata_via.  A kernel, which for
  all practical purposes is 2.6.21-rc5, not only refuses to boot
  (and I cannot find some option combination which would allow me to
  do so anyway) but simply refuses to read _any_ data from a media.
  This included a partitioning information.
  
  Earlier kernel on the same hardware boots without raising any fuss.
  
  Details are collected as
  https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=234650
 
 If I understand this correctly, a plain 2.6.20 kernel is already broken?

You mean that a quoted report talks about 2.6.20-1.3025.fc7 kernel?
These are vagaries of kernel version numbering in Fedora.
Changelogs are not that clear but it appears that
2.6.19-1.2911.6.4.fc6 will be actually closer to 2.6.20.
That kernel from a bug report is really, for all intents and purposes,
2.6.21-rc5 (if I am not misreading something).

I am afraid that I do not have at this moment an easy to way to check
plain 2.6.20 on the hardware in question.  It appears that the
essential difference is that a working kernel is using and old IDE
driver, and sees the drive - in this case - as /dev/hdc, while the
current one tries to go through libata and chockes uncontrollably.

   Michal
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [1/4] 2.6.21-rc5: known regressions (v2)

2007-03-30 Thread Michal Jaegermann
On Fri, Mar 30, 2007 at 11:32:09PM +0200, Adrian Bunk wrote:
> 
> Subject: kernels fail to boot with drives on ATIIXP controller
>  (ACPI/IRQ related)
> References : https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=229621
>  http://lkml.org/lkml/2007/3/4/257
> Submitter  : Michal Jaegermann <[EMAIL PROTECTED]>
> Status : unknown

I have now even better one with pata_via.  A kernel, which for
all practical purposes is 2.6.21-rc5, not only refuses to boot
(and I cannot find some option combination which would allow me to
do so anyway) but simply refuses to read _any_ data from a media.
This included a partitioning information.

Earlier kernel on the same hardware boots without raising any fuss.

Details are collected as
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=234650

   Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [1/4] 2.6.21-rc5: known regressions (v2)

2007-03-30 Thread Michal Jaegermann
On Fri, Mar 30, 2007 at 11:32:09PM +0200, Adrian Bunk wrote:
 
 Subject: kernels fail to boot with drives on ATIIXP controller
  (ACPI/IRQ related)
 References : https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=229621
  http://lkml.org/lkml/2007/3/4/257
 Submitter  : Michal Jaegermann [EMAIL PROTECTED]
 Status : unknown

I have now even better one with pata_via.  A kernel, which for
all practical purposes is 2.6.21-rc5, not only refuses to boot
(and I cannot find some option combination which would allow me to
do so anyway) but simply refuses to read _any_ data from a media.
This included a partitioning information.

Earlier kernel on the same hardware boots without raising any fuss.

Details are collected as
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=234650

   Michal
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [3/6] 2.6.21-rc2: known regressions

2007-03-04 Thread Michal Jaegermann
On Mon, Mar 05, 2007 at 02:50:36AM +0100, Adrian Bunk wrote:
> This email lists some known regressions in 2.6.21-rc2 compared to 2.6.20
> that are not yet fixed in Linus' tree.

> Subject: kernels fail to boot with drives on ATIIXP controller
> References : https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=229621
> Submitter  : Michal Jaegermann <[EMAIL PROTECTED]>
> Status : unknown

Alan added comment to my posting that my problems are caused by
messed up IRQ routing on that box I tried.  Indeed, I can boot
kernel 2.6.20-1.2962.fc7, which really is 2.6.21-rc2, provided
I will use 'acpi=off irqpoll'.  Anything else and a boot silently
dies if 'acpi=off' is skipped or is not finding disk if 'irqpoll'
is missing.

Somehow 2.6.19 is booting on the same hardware without "valliant
efforts"; OTOH 'ahci' driver was not used there.

Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [3/6] 2.6.21-rc2: known regressions

2007-03-04 Thread Michal Jaegermann
On Mon, Mar 05, 2007 at 02:50:36AM +0100, Adrian Bunk wrote:
 This email lists some known regressions in 2.6.21-rc2 compared to 2.6.20
 that are not yet fixed in Linus' tree.

 Subject: kernels fail to boot with drives on ATIIXP controller
 References : https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=229621
 Submitter  : Michal Jaegermann [EMAIL PROTECTED]
 Status : unknown

Alan added comment to my posting that my problems are caused by
messed up IRQ routing on that box I tried.  Indeed, I can boot
kernel 2.6.20-1.2962.fc7, which really is 2.6.21-rc2, provided
I will use 'acpi=off irqpoll'.  Anything else and a boot silently
dies if 'acpi=off' is skipped or is not finding disk if 'irqpoll'
is missing.

Somehow 2.6.19 is booting on the same hardware without valliant
efforts; OTOH 'ahci' driver was not used there.

Michal
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [PATCH] libata: Cable detection fixes

2007-03-01 Thread Michal Jaegermann
On Thu, Mar 01, 2007 at 08:33:17PM -0500, Jeff Garzik wrote:
> 
> That little change, buried in the middle of Alan's patch, changes the 
> probing order for a /lot/ of devices, possibly millions, when you 
> consider that it changes behavior of ata_piix (Intel SATA) as well as 
> all the not-yet-default PATA controllers.

Hm, I got recently hands on a hardware where 2.6.21-rc1 based
kernels from Fedora rawhide simply do not boot as there is no
way to get to disks.  I would not mind some change in behavior
although so far I can boot at least some earlier kernels.

This looks like ATIIXP issue and details are here:
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=229621
Changelogs for kernels in question have this:

* Wed Feb 21 2007 Dave Jones <[EMAIL PROTECTED]>
- 2.6.21-rc1

   Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [PATCH] libata: Cable detection fixes

2007-03-01 Thread Michal Jaegermann
On Thu, Mar 01, 2007 at 08:33:17PM -0500, Jeff Garzik wrote:
 
 That little change, buried in the middle of Alan's patch, changes the 
 probing order for a /lot/ of devices, possibly millions, when you 
 consider that it changes behavior of ata_piix (Intel SATA) as well as 
 all the not-yet-default PATA controllers.

Hm, I got recently hands on a hardware where 2.6.21-rc1 based
kernels from Fedora rawhide simply do not boot as there is no
way to get to disks.  I would not mind some change in behavior
although so far I can boot at least some earlier kernels.

This looks like ATIIXP issue and details are here:
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=229621
Changelogs for kernels in question have this:

* Wed Feb 21 2007 Dave Jones [EMAIL PROTECTED]
- 2.6.21-rc1

   Michal
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: why can't I remove a kernel module (or: what uses a given module)?

2006-12-04 Thread Michal Jaegermann
On Mon, Dec 04, 2006 at 11:10:01AM +0100, Tobias Oed wrote:
> Tomasz Chmielewski wrote:
> 
> > Yes, something's using that drive, be it a program, a module (unlikely),
> > or something that is compiled directly in the kernel (for example,
> > md/raid1).
> 
> Since you mention md, dm comes to mind. I have seen a couple of drives that
> were previously attached to fake raid controllers becoming unavailable when
> moved to a normal controller. I haven't found the one size workaround for
> the problem yet.

 dmraid -r -E 

Yes, I was hit by this in a different context.  A command is
described on 'man dmraid' but figuring out what was the issue
took me a long while.

   Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: why can't I remove a kernel module (or: what uses a given module)?

2006-12-04 Thread Michal Jaegermann
On Mon, Dec 04, 2006 at 11:10:01AM +0100, Tobias Oed wrote:
 Tomasz Chmielewski wrote:
 
  Yes, something's using that drive, be it a program, a module (unlikely),
  or something that is compiled directly in the kernel (for example,
  md/raid1).
 
 Since you mention md, dm comes to mind. I have seen a couple of drives that
 were previously attached to fake raid controllers becoming unavailable when
 moved to a normal controller. I haven't found the one size workaround for
 the problem yet.

 dmraid -r -E device_in_question

Yes, I was hit by this in a different context.  A command is
described on 'man dmraid' but figuring out what was the issue
took me a long while.

   Michal
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: A "new driver model" and EXPORT_SYMBOL_GPL question

2005-07-06 Thread Michal Jaegermann
On Tue, Jul 05, 2005 at 02:57:40PM -0700, Greg KH wrote:
> On Tue, Jul 05, 2005 at 03:50:43PM -0600, Zan Lynx wrote:
> > Sourced from here:
> > http://hulllug.principalhosting.net/archive/index.php/t-52440.html
> 
> No, that is not the same topic or thread.

Formally you are correct but from my POV this sounds casuistic and
fit for a patent lawyer.  You were "recently advised not to change
these symbols" and you stated that you will not. So instead you did
an end run and you removed an old interface and introduced a
replacement; but this time with EXPORT_SYMBOL_GPL - which has the
same effect as what you told you will not do.

> If you know of any closed source code, using those functions, please put
> them in contact with me.

Well, I gave an example in my original question.  Yes, I asked them
to contact you.  If they will do that I have no idea.

   Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: A new driver model and EXPORT_SYMBOL_GPL question

2005-07-06 Thread Michal Jaegermann
On Tue, Jul 05, 2005 at 02:57:40PM -0700, Greg KH wrote:
 On Tue, Jul 05, 2005 at 03:50:43PM -0600, Zan Lynx wrote:
  Sourced from here:
  http://hulllug.principalhosting.net/archive/index.php/t-52440.html
 
 No, that is not the same topic or thread.

Formally you are correct but from my POV this sounds casuistic and
fit for a patent lawyer.  You were recently advised not to change
these symbols and you stated that you will not. So instead you did
an end run and you removed an old interface and introduced a
replacement; but this time with EXPORT_SYMBOL_GPL - which has the
same effect as what you told you will not do.

 If you know of any closed source code, using those functions, please put
 them in contact with me.

Well, I gave an example in my original question.  Yes, I asked them
to contact you.  If they will do that I have no idea.

   Michal
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: A "new driver model" and EXPORT_SYMBOL_GPL question

2005-07-04 Thread Michal Jaegermann
On Sun, Jul 03, 2005 at 10:44:41PM -0700, Greg KH wrote:
> On Sun, Jul 03, 2005 at 05:12:02PM -0600, Michal Jaegermann wrote:
> 
> > Was a decision to use EXPORT_SYMBOL_GPL deliberate and if yes then
> > what considerations dictated it, other then the patch author wrote
> > it that way, and what drivers in question are supposed to use when
> > this change will show up in the mainline?  It looks that 2.6.13
> > will do this.
> 
> Please see the archives for the answers to these questions.

I actually tried that before posting.  Maybe I attempted to look for
wrong things but, beyond conversion examples, I found some postings
with a general theme "there is no point to make life easy for
binary-only modules" and not much else.  I am afraid that this
leaves me not much wiser.

Also, at least when dealing with 2.6.13-r1, neither
Documentation/feature-removal-schedule.txt, nor files in
Documentation/driver-model/ directory, mention anything about a
switch to EXPORT_SYMBOL_GPL for relevant symbols.  The only thing I
can find about the later in "feature-removal..." is a note about RCU
API.

   Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: A new driver model and EXPORT_SYMBOL_GPL question

2005-07-04 Thread Michal Jaegermann
On Sun, Jul 03, 2005 at 10:44:41PM -0700, Greg KH wrote:
 On Sun, Jul 03, 2005 at 05:12:02PM -0600, Michal Jaegermann wrote:
 
  Was a decision to use EXPORT_SYMBOL_GPL deliberate and if yes then
  what considerations dictated it, other then the patch author wrote
  it that way, and what drivers in question are supposed to use when
  this change will show up in the mainline?  It looks that 2.6.13
  will do this.
 
 Please see the archives for the answers to these questions.

I actually tried that before posting.  Maybe I attempted to look for
wrong things but, beyond conversion examples, I found some postings
with a general theme there is no point to make life easy for
binary-only modules and not much else.  I am afraid that this
leaves me not much wiser.

Also, at least when dealing with 2.6.13-r1, neither
Documentation/feature-removal-schedule.txt, nor files in
Documentation/driver-model/ directory, mention anything about a
switch to EXPORT_SYMBOL_GPL for relevant symbols.  The only thing I
can find about the later in feature-removal... is a note about RCU
API.

   Michal
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


[OT] Re: Microsoft and Xenix.

2001-06-24 Thread Michal Jaegermann

On Mon, Jun 25, 2001 at 12:20:40AM +0200, Daniel Phillips wrote:
> On Sunday 24 June 2001 12:36, Rob Landley wrote:
> > On Saturday 23 June 2001 22:47, Eric W. Biederman wrote:
> > > GEM was a gui from Digital Research I believe.
> > > Geoworks/Geos was a seperate entity.
> >
> > Ah, the DR-DOS answer to dosshell/windows.  Cool.  (I used Dr. Dos byt
> > never tried its gui.)
> 
> GEM had its moment of glory when Xerox used it for the gui of Ventura 
> Publisher.

GEM (a slight variation) was also providing GUI on Atari ST.  At that
time it was heavily beating pants off from anything M$ was able to
cobble together on nominally much faster machines.

   Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



[OT] Re: Microsoft and Xenix.

2001-06-24 Thread Michal Jaegermann

On Mon, Jun 25, 2001 at 12:20:40AM +0200, Daniel Phillips wrote:
 On Sunday 24 June 2001 12:36, Rob Landley wrote:
  On Saturday 23 June 2001 22:47, Eric W. Biederman wrote:
   GEM was a gui from Digital Research I believe.
   Geoworks/Geos was a seperate entity.
 
  Ah, the DR-DOS answer to dosshell/windows.  Cool.  (I used Dr. Dos byt
  never tried its gui.)
 
 GEM had its moment of glory when Xerox used it for the gui of Ventura 
 Publisher.

GEM (a slight variation) was also providing GUI on Atari ST.  At that
time it was heavily beating pants off from anything M$ was able to
cobble together on nominally much faster machines.

   Michal
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Minor "cleanup" patches for 2.4.5-ac kernels

2001-06-12 Thread Michal Jaegermann

Here are some small, but in times important, "gotchas" in current
2.4.5-ac kernels.

When compiling SMP 'udelay' in current drivers/pci/quirks.c expands to:

   __udelay((15), cpu_data[(current->processor)]...

and a type for 'current' is not known, at least on alpha, so
the following seems to be in order:

--- linux-2.4.5ac/drivers/pci/quirks.c~ Tue Jun 12 16:31:12 2001
+++ linux-2.4.5ac/drivers/pci/quirks.c  Tue Jun 12 17:13:18 2001
@@ -18,6 +18,7 @@
 #include 
 #include 
 #include 
+#include 
 
 #undef DEBUG
 
There is no problem if SMP is not configured.

This one is replacing a symbol in sg.c to one which is exported
so 'sg.o' can be compiled as a valid module.

--- linux-2.4.5ac/drivers/scsi/sg.c~Tue May 29 17:52:09 2001
+++ linux-2.4.5ac/drivers/scsi/sg.c Tue May 29 18:40:17 2001
@@ -2603,7 +2603,7 @@
 num = (count < 10) ? count : 10;
 copy_from_user(buff, buffer, num);
 buff[num] = '\0';
-sg_allow_dio = simple_strtol(buff, 0, 10) ? 1 : 0;
+sg_allow_dio = simple_strtoul(buff, 0, 10) ? 1 : 0;
 return count;
 }
 
 
And this one, proposed already some few times by Ivan Kokshaysky,

--- 2.4.5-ac11/include/linux/binfmts.h  Mon Jun  4 14:19:00 2001
+++ linux/include/linux/binfmts.h   Mon Jun  4 20:24:50 2001
@@ -32,6 +32,9 @@ struct linux_binprm{
unsigned long loader, exec;
 };
 
+/* Forward declaration */
+struct mm_struct;
+
 /*
  * This structure defines the functions that are used to load the binary formats that
  * linux accepts.

kills a flood of warnings (at least on Alpha) about 'mm_struct'
defined on a parameter list.

Are there any reasons which would make any of those "bad"?

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Minor cleanup patches for 2.4.5-ac kernels

2001-06-12 Thread Michal Jaegermann

Here are some small, but in times important, gotchas in current
2.4.5-ac kernels.

When compiling SMP 'udelay' in current drivers/pci/quirks.c expands to:

   __udelay((15), cpu_data[(current-processor)]...

and a type for 'current' is not known, at least on alpha, so
the following seems to be in order:

--- linux-2.4.5ac/drivers/pci/quirks.c~ Tue Jun 12 16:31:12 2001
+++ linux-2.4.5ac/drivers/pci/quirks.c  Tue Jun 12 17:13:18 2001
@@ -18,6 +18,7 @@
 #include linux/pci.h
 #include linux/init.h
 #include linux/delay.h
+#include linux/sched.h
 
 #undef DEBUG
 
There is no problem if SMP is not configured.

This one is replacing a symbol in sg.c to one which is exported
so 'sg.o' can be compiled as a valid module.

--- linux-2.4.5ac/drivers/scsi/sg.c~Tue May 29 17:52:09 2001
+++ linux-2.4.5ac/drivers/scsi/sg.c Tue May 29 18:40:17 2001
@@ -2603,7 +2603,7 @@
 num = (count  10) ? count : 10;
 copy_from_user(buff, buffer, num);
 buff[num] = '\0';
-sg_allow_dio = simple_strtol(buff, 0, 10) ? 1 : 0;
+sg_allow_dio = simple_strtoul(buff, 0, 10) ? 1 : 0;
 return count;
 }
 
 
And this one, proposed already some few times by Ivan Kokshaysky,

--- 2.4.5-ac11/include/linux/binfmts.h  Mon Jun  4 14:19:00 2001
+++ linux/include/linux/binfmts.h   Mon Jun  4 20:24:50 2001
@@ -32,6 +32,9 @@ struct linux_binprm{
unsigned long loader, exec;
 };
 
+/* Forward declaration */
+struct mm_struct;
+
 /*
  * This structure defines the functions that are used to load the binary formats that
  * linux accepts.

kills a flood of warnings (at least on Alpha) about 'mm_struct'
defined on a parameter list.

Are there any reasons which would make any of those bad?

  Michal
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Tulip 0.9.15-pre3 - still no dice

2001-06-04 Thread Michal Jaegermann

A tulip driver 0.9.15-pre3, as included in 2.4.5-ac8, still does not
work for me and I have to replace it with 0.9.14d (April 3, 2001) to
get a functional network.

Trying it with 'tulip_debug=3' option I see this:


Linux Tulip driver version 0.9.15-pre3 (June 1, 2001)
00:0b.0: MWI config mwi=0, cacheline=16, csr0=00a0d000
tulip0:  EEPROM default media type Autosense.
tulip0:  Index #0 - Media 10baseT (#0) described by a 21142 Serial PHY (2) block.
tulip0:  Index #1 - Media 10baseT-FDX (#4) described by a 21142 Serial PHY (2) block.
tulip0:  Index #2 - Media 100baseTx (#3) described by a 21143 SYM PHY (4) block.
tulip0:  Index #3 - Media 100baseTx-FDX (#5) described by a 21143 SYM PHY (4) block.
eth0: Digital DS21143 Tulip rev 65 at 0x8800, 00:00:F0:51:00:72, IRQ 11.
eth0: Restarting 21143 autonegotiation, csr14=0003.
eth0: tulip_up(), irq==11.
eth0: Restarting 21143 autonegotiation, csr14=0003.
eth0: Done tulip_open(), CSR0 f8a0d000, CSR5 f016 CSR6 b2422202.
eth0: 21143 link status interrupt 50ca, CSR5 f0668010, fffb.
eth0: Autonegotiation failed, using 10baseT, link beat status 50ca.
eth0: 21143 non-MII 10baseT transceiver control 08af/0005.
eth0:  Setting CSR15 to 08af0008/00050008.
eth0: Using media type 10baseT, CSR12 is c6.
eth0:  Setting CSR6 8242/b2422002 CSR12 10c6.
eth0: 21143 negotiation status 10c6, 10baseT.
eth0: 21143 negotiation failed, status 10c6.
eth0: Testing new 21143 media 100baseTx.
eth0: The transmitter stopped.  CSR5 is f0008102, CSR6 b242, new CSR6 8386.
eth0: 21143 link status interrupt 02ca, CSR5 f0668010, fffbff7f.
eth0: 21143 100baseTx link beat failed.
eth0: Restarting 21143 autonegotiation, csr14=0003.
eth0: 21143 link status interrupt 50ca, CSR5 f0008010, fffb.
eth0: Autonegotiation failed, using 10baseT, link beat status 50ca.
.
(and a loop with at "Autonegotiation failed" until 'ifdown eth0')
.
eth0: Shutting down ethercard, status was f102.

There was a variation once. A status changed to this:

eth0: Restarting 21143 autonegotiation, csr14=0003.
eth0: 21143 link status interrupt 52ca, CSR5 f0008010, fffb.
eth0: Autonegotiation failed, using 10baseT, link beat status 52ca.

but it went back to the same loop:

This is, for a comparison, the same level of debug with 0.9.14d working 
driver.  Note different values for CSR0 and CSR5 on tulip_open().

Linux Tulip driver version 0.9.14d (April 3, 2001)
eth0: Digital DS21143 Tulip rev 65 at 0x8800, 00:00:F0:51:00:72, IRQ 11.
eth0:  EEPROM default media type Autosense.
eth0:  Index #0 - Media 10baseT (#0) described by a 21142 Serial PHY (2) block.
eth0:  Index #1 - Media 10baseT-FDX (#4) described by a 21142 Serial PHY (2) block.
eth0:  Index #2 - Media 100baseTx (#3) described by a 21143 SYM PHY (4) block.
eth0:  Index #3 - Media 100baseTx-FDX (#5) described by a 21143 SYM PHY (4) block.
eth0: Restarting 21143 autonegotiation, csr14=0003.
eth0: tulip_up(), irq==11.
eth0: Restarting 21143 autonegotiation, csr14=0003.
eth0: Done tulip_open(), CSR0 f8a0e000, CSR5 f072 CSR6 b2422202.
eth0: 21143 link status interrupt 50ca, CSR5 f0668010, fffb.
eth0: Autonegotiation failed, using 10baseT, link beat status 50ca.
eth0: 21143 non-MII 10baseT transceiver control 08af/0005.
eth0:  Setting CSR15 to 08af0008/00050008.
eth0: Using media type 10baseT, CSR12 is ca.
eth0:  Setting CSR6 8242/b2422002 CSR12 50ca.
eth0: 21143 negotiation status 50ca, 10baseT.

Here is an output of tulip-diag, as much as I can get, in a non-working
state:

tulip-diag.c:v2.06 1/8/2001 Donald Becker ([EMAIL PROTECTED])
 http://www.scyld.com/diag/index.html
Index #1: Found a Digital DS21143 Tulip adapter at 0x8800.
Digital DS21143 Tulip chip registers at 0x8800:
 0x00: f8a0d000   0efb 0efb0200 f000 b2420200 fbfffbff
 0x40: e000 cbf8   10ce 0001 fffb 8ff5
 Port selection is 10mpbs-serial, full-duplex.
 Transmit stopped, Receive stopped, full-duplex.
  The Rx process state is 'Stopped'.
  The Tx process state is 'Stopped'.
  The transmit threshold is 72.
  The NWay status register is 10ce.
EEPROM 64 words, 6 address bits.
PCI Subsystem IDs, vendor 1011, device 500b.
CardBus Information Structure at offset .
Ethernet MAC Station Address 00:00:F0:51:00:72.
EEPROM transceiver/media description table.
Leaf node at offset 65, default media type 0800 (Autosense).
 4 transceiver description blocks:
  Media 10baseT, block type 2, length 6.
   Serial transceiver for 10baseT (media type 0).
GP pin direction 08af  GP pin data 0005.
  Media 10baseT-Full Duplex, block type 2, length 6.
   Serial transceiver for 10baseT-Full Duplex (media type 4).
GP pin direction 08af  GP pin data 0005.
  Media 100baseTx, block type 4, length 8.
   SYM transceiver for 100baseTx (media type 3).
GP pin direction 08af  GP pin data 0005.
No media detection indication (command 80 61).
  Media 

Tulip 0.9.15-pre3 - still no dice

2001-06-04 Thread Michal Jaegermann

A tulip driver 0.9.15-pre3, as included in 2.4.5-ac8, still does not
work for me and I have to replace it with 0.9.14d (April 3, 2001) to
get a functional network.

Trying it with 'tulip_debug=3' option I see this:


Linux Tulip driver version 0.9.15-pre3 (June 1, 2001)
00:0b.0: MWI config mwi=0, cacheline=16, csr0=00a0d000
tulip0:  EEPROM default media type Autosense.
tulip0:  Index #0 - Media 10baseT (#0) described by a 21142 Serial PHY (2) block.
tulip0:  Index #1 - Media 10baseT-FDX (#4) described by a 21142 Serial PHY (2) block.
tulip0:  Index #2 - Media 100baseTx (#3) described by a 21143 SYM PHY (4) block.
tulip0:  Index #3 - Media 100baseTx-FDX (#5) described by a 21143 SYM PHY (4) block.
eth0: Digital DS21143 Tulip rev 65 at 0x8800, 00:00:F0:51:00:72, IRQ 11.
eth0: Restarting 21143 autonegotiation, csr14=0003.
eth0: tulip_up(), irq==11.
eth0: Restarting 21143 autonegotiation, csr14=0003.
eth0: Done tulip_open(), CSR0 f8a0d000, CSR5 f016 CSR6 b2422202.
eth0: 21143 link status interrupt 50ca, CSR5 f0668010, fffb.
eth0: Autonegotiation failed, using 10baseT, link beat status 50ca.
eth0: 21143 non-MII 10baseT transceiver control 08af/0005.
eth0:  Setting CSR15 to 08af0008/00050008.
eth0: Using media type 10baseT, CSR12 is c6.
eth0:  Setting CSR6 8242/b2422002 CSR12 10c6.
eth0: 21143 negotiation status 10c6, 10baseT.
eth0: 21143 negotiation failed, status 10c6.
eth0: Testing new 21143 media 100baseTx.
eth0: The transmitter stopped.  CSR5 is f0008102, CSR6 b242, new CSR6 8386.
eth0: 21143 link status interrupt 02ca, CSR5 f0668010, fffbff7f.
eth0: 21143 100baseTx link beat failed.
eth0: Restarting 21143 autonegotiation, csr14=0003.
eth0: 21143 link status interrupt 50ca, CSR5 f0008010, fffb.
eth0: Autonegotiation failed, using 10baseT, link beat status 50ca.
.
(and a loop with at Autonegotiation failed until 'ifdown eth0')
.
eth0: Shutting down ethercard, status was f102.

There was a variation once. A status changed to this:

eth0: Restarting 21143 autonegotiation, csr14=0003.
eth0: 21143 link status interrupt 52ca, CSR5 f0008010, fffb.
eth0: Autonegotiation failed, using 10baseT, link beat status 52ca.

but it went back to the same loop:

This is, for a comparison, the same level of debug with 0.9.14d working 
driver.  Note different values for CSR0 and CSR5 on tulip_open().

Linux Tulip driver version 0.9.14d (April 3, 2001)
eth0: Digital DS21143 Tulip rev 65 at 0x8800, 00:00:F0:51:00:72, IRQ 11.
eth0:  EEPROM default media type Autosense.
eth0:  Index #0 - Media 10baseT (#0) described by a 21142 Serial PHY (2) block.
eth0:  Index #1 - Media 10baseT-FDX (#4) described by a 21142 Serial PHY (2) block.
eth0:  Index #2 - Media 100baseTx (#3) described by a 21143 SYM PHY (4) block.
eth0:  Index #3 - Media 100baseTx-FDX (#5) described by a 21143 SYM PHY (4) block.
eth0: Restarting 21143 autonegotiation, csr14=0003.
eth0: tulip_up(), irq==11.
eth0: Restarting 21143 autonegotiation, csr14=0003.
eth0: Done tulip_open(), CSR0 f8a0e000, CSR5 f072 CSR6 b2422202.
eth0: 21143 link status interrupt 50ca, CSR5 f0668010, fffb.
eth0: Autonegotiation failed, using 10baseT, link beat status 50ca.
eth0: 21143 non-MII 10baseT transceiver control 08af/0005.
eth0:  Setting CSR15 to 08af0008/00050008.
eth0: Using media type 10baseT, CSR12 is ca.
eth0:  Setting CSR6 8242/b2422002 CSR12 50ca.
eth0: 21143 negotiation status 50ca, 10baseT.

Here is an output of tulip-diag, as much as I can get, in a non-working
state:

tulip-diag.c:v2.06 1/8/2001 Donald Becker ([EMAIL PROTECTED])
 http://www.scyld.com/diag/index.html
Index #1: Found a Digital DS21143 Tulip adapter at 0x8800.
Digital DS21143 Tulip chip registers at 0x8800:
 0x00: f8a0d000   0efb 0efb0200 f000 b2420200 fbfffbff
 0x40: e000 cbf8   10ce 0001 fffb 8ff5
 Port selection is 10mpbs-serial, full-duplex.
 Transmit stopped, Receive stopped, full-duplex.
  The Rx process state is 'Stopped'.
  The Tx process state is 'Stopped'.
  The transmit threshold is 72.
  The NWay status register is 10ce.
EEPROM 64 words, 6 address bits.
PCI Subsystem IDs, vendor 1011, device 500b.
CardBus Information Structure at offset .
Ethernet MAC Station Address 00:00:F0:51:00:72.
EEPROM transceiver/media description table.
Leaf node at offset 65, default media type 0800 (Autosense).
 4 transceiver description blocks:
  Media 10baseT, block type 2, length 6.
   Serial transceiver for 10baseT (media type 0).
GP pin direction 08af  GP pin data 0005.
  Media 10baseT-Full Duplex, block type 2, length 6.
   Serial transceiver for 10baseT-Full Duplex (media type 4).
GP pin direction 08af  GP pin data 0005.
  Media 100baseTx, block type 4, length 8.
   SYM transceiver for 100baseTx (media type 3).
GP pin direction 08af  GP pin data 0005.
No media detection indication (command 80 61).
  Media 

Current tulip driver from 2.4.5 is plain broken

2001-05-28 Thread Michal Jaegermann

I mentioned that before but this should be stated clearly.  As far as I
am concerned "Linux Tulip driver version 0.9.15-pre2 (May 16, 2001)",
as used in 2.4.5 - and other kernels - is totally buggered.  It comes up,
and ethernet interfaces can be configured, but does not matter how I am
playing with options I cannot get a single packet through.

Replacing it in 2.4.5 with "Linux Tulip driver version 0.9.14d (April 3,
2001)", which I have handy, restores sanity immediately and a network
simply works without any heroic efforts.

My NIC is "Digital DS21143 Tulip rev 65 at 0x8800".  BTW - a version
"tulip-1.1.7" from sourceforge behaves exactly like 0.9.15-pre2.

This is an output of 'tulip-diag -af' from a working setup:

tulip-diag.c:v2.06 1/8/2001 Donald Becker ([EMAIL PROTECTED])
 http://www.scyld.com/diag/index.html
Index #1: Found a Digital DS21143 Tulip adapter at 0x8800.
Digital DS21143 Tulip chip registers at 0x8800:
 0x00: f8a0e000   0d572000 0d572200 f066 b2422002 fbfffbff
 0x40: e000 fff583ff   52ca 0001 fffb 8ffd0008
 Port selection is 10mpbs-serial, half-duplex.
 Transmit started, Receive started, half-duplex.
  The Rx process state is 'Waiting for packets'.
  The Tx process state is 'Idle'.
  The transmit threshold is 72.
  The NWay status register is 52ca.
  Internal autonegotiation state is 'Negotiation complete'.

And this what shows up when I am trying to use "the-new-and-improved":

tulip-diag.c:v2.06 1/8/2001 Donald Becker ([EMAIL PROTECTED])
 http://www.scyld.com/diag/index.html
Index #1: Found a Digital DS21143 Tulip adapter at 0x8800.
Digital DS21143 Tulip chip registers at 0x8800:
 0x00: f8a0d000   0b9f4000 0b9f4200 f000 b2420200 fbfffbff
 0x40: e000 fff483ff   60ca 0001 fffb 8ffd
 Port selection is 10mpbs-serial, full-duplex.
 Transmit stopped, Receive stopped, full-duplex.
  The Rx process state is 'Stopped'.
  The Tx process state is 'Stopped'.
  The transmit threshold is 72.
  The NWay status register is 60ca.
  Internal autonegotiation state is 'Link check'.

Comments?

Michal
[EMAIL PROTECTED]

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Current tulip driver from 2.4.5 is plain broken

2001-05-28 Thread Michal Jaegermann

I mentioned that before but this should be stated clearly.  As far as I
am concerned Linux Tulip driver version 0.9.15-pre2 (May 16, 2001),
as used in 2.4.5 - and other kernels - is totally buggered.  It comes up,
and ethernet interfaces can be configured, but does not matter how I am
playing with options I cannot get a single packet through.

Replacing it in 2.4.5 with Linux Tulip driver version 0.9.14d (April 3,
2001), which I have handy, restores sanity immediately and a network
simply works without any heroic efforts.

My NIC is Digital DS21143 Tulip rev 65 at 0x8800.  BTW - a version
tulip-1.1.7 from sourceforge behaves exactly like 0.9.15-pre2.

This is an output of 'tulip-diag -af' from a working setup:

tulip-diag.c:v2.06 1/8/2001 Donald Becker ([EMAIL PROTECTED])
 http://www.scyld.com/diag/index.html
Index #1: Found a Digital DS21143 Tulip adapter at 0x8800.
Digital DS21143 Tulip chip registers at 0x8800:
 0x00: f8a0e000   0d572000 0d572200 f066 b2422002 fbfffbff
 0x40: e000 fff583ff   52ca 0001 fffb 8ffd0008
 Port selection is 10mpbs-serial, half-duplex.
 Transmit started, Receive started, half-duplex.
  The Rx process state is 'Waiting for packets'.
  The Tx process state is 'Idle'.
  The transmit threshold is 72.
  The NWay status register is 52ca.
  Internal autonegotiation state is 'Negotiation complete'.

And this what shows up when I am trying to use the-new-and-improved:

tulip-diag.c:v2.06 1/8/2001 Donald Becker ([EMAIL PROTECTED])
 http://www.scyld.com/diag/index.html
Index #1: Found a Digital DS21143 Tulip adapter at 0x8800.
Digital DS21143 Tulip chip registers at 0x8800:
 0x00: f8a0d000   0b9f4000 0b9f4200 f000 b2420200 fbfffbff
 0x40: e000 fff483ff   60ca 0001 fffb 8ffd
 Port selection is 10mpbs-serial, full-duplex.
 Transmit stopped, Receive stopped, full-duplex.
  The Rx process state is 'Stopped'.
  The Tx process state is 'Stopped'.
  The transmit threshold is 72.
  The NWay status register is 60ca.
  Internal autonegotiation state is 'Link check'.

Comments?

Michal
[EMAIL PROTECTED]

-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: PROBLEM: Alpha SMP Low Outbound Bandwidth

2001-05-25 Thread Michal Jaegermann

On Fri, May 25, 2001 at 02:50:07PM -0700, Jay Thorne wrote:
> [1.] One line summary of the problem:
> Kernel 2.4.4 ac15

> Using a quad 400Mhz Dodge/Rawhide machine with Tulip or VIARhine cards,

[ description of a slowdown skipped ].

Well, it looks that you have at least something to slow down.  I could
not get a single packet through my tulip on Alpha from at least
2.4.4-ac11 and up.  You can consider that an ultimate slowdown.  I tried
also a driver from http://sourceforge.net/projects/tulip/ and results
are the same.  This NIC, Digital DS21143 Tulip rev 65, works just fine
with various earlier kernels, including assorted 2.4.3 variants.
It is on 10baseT netwok - which may, or may not, be relevant here.

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



2.4.4-ac17 - missing in exports simple_strtol symbol

2001-05-25 Thread Michal Jaegermann

Patches to drivers/scsi/sg.c included in 2.4.4-ac17 require for
'sg.o' module to use 'simple_strtol' which is not exported in
kernel/ksyms.c.  Is this is an oversight or 'sg.o' should be actually
using something like 'simple_strtoul' - which is already exported?
In either case patches are obvious.

BTW - is tulip supposed to already work in this version?  Because
it does not.

  Michal
  [EMAIL PROTECTED]
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



2.4.4-ac17 - missing in exports simple_strtol symbol

2001-05-25 Thread Michal Jaegermann

Patches to drivers/scsi/sg.c included in 2.4.4-ac17 require for
'sg.o' module to use 'simple_strtol' which is not exported in
kernel/ksyms.c.  Is this is an oversight or 'sg.o' should be actually
using something like 'simple_strtoul' - which is already exported?
In either case patches are obvious.

BTW - is tulip supposed to already work in this version?  Because
it does not.

  Michal
  [EMAIL PROTECTED]
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: PROBLEM: Alpha SMP Low Outbound Bandwidth

2001-05-25 Thread Michal Jaegermann

On Fri, May 25, 2001 at 02:50:07PM -0700, Jay Thorne wrote:
 [1.] One line summary of the problem:
 Kernel 2.4.4 ac15

 Using a quad 400Mhz Dodge/Rawhide machine with Tulip or VIARhine cards,

[ description of a slowdown skipped ].

Well, it looks that you have at least something to slow down.  I could
not get a single packet through my tulip on Alpha from at least
2.4.4-ac11 and up.  You can consider that an ultimate slowdown.  I tried
also a driver from http://sourceforge.net/projects/tulip/ and results
are the same.  This NIC, Digital DS21143 Tulip rev 65, works just fine
with various earlier kernels, including assorted 2.4.3 variants.
It is on 10baseT netwok - which may, or may not, be relevant here.

  Michal
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: Compile fails an Alpha: include/asm/pci.h included from arch/alpha/kernel/setup.c

2001-05-21 Thread Michal Jaegermann

On Mon, May 21, 2001 at 05:18:55PM +0200, Jan-Benedict Glaw wrote:
> 
> Kernel 2.4.5-pre[34] don't compile on Alpha:
> 


> 152 struct pci_controller *hose = pdev->sysdata;
 ^^^

This is the problem (a type for 'pdev' is not defined).
And this is a possible fix:


--- linux-2.4.4ac/include/asm-alpha/pci.h~  Sat May 19 16:43:11 2001
+++ linux-2.4.4ac/include/asm-alpha/pci.h   Sat May 19 17:23:56 2001
@@ -6,6 +6,7 @@
 #include 
 #include 
 #include 
+#include 
 
 /*
  * The following structure is used to manage multiple PCI busses.

The patch is for 2.4.4-ac11, so offsets are possibly slightly different,
but probably not. :-)

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: Compile fails an Alpha: include/asm/pci.h included from arch/alpha/kernel/setup.c

2001-05-21 Thread Michal Jaegermann

On Mon, May 21, 2001 at 05:18:55PM +0200, Jan-Benedict Glaw wrote:
 
 Kernel 2.4.5-pre[34] don't compile on Alpha:
 


 152 struct pci_controller *hose = pdev-sysdata;
 ^^^

This is the problem (a type for 'pdev' is not defined).
And this is a possible fix:


--- linux-2.4.4ac/include/asm-alpha/pci.h~  Sat May 19 16:43:11 2001
+++ linux-2.4.4ac/include/asm-alpha/pci.h   Sat May 19 17:23:56 2001
@@ -6,6 +6,7 @@
 #include linux/spinlock.h
 #include asm/scatterlist.h
 #include asm/machvec.h
+#include linux/pci.h
 
 /*
  * The following structure is used to manage multiple PCI busses.

The patch is for 2.4.4-ac11, so offsets are possibly slightly different,
but probably not. :-)

  Michal
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Troubles with 8139too and 2.2.19

2001-05-07 Thread Michal Jaegermann

It looks like tha 8139too, at least in 2.2.19, works fine when there
is one such card around but with NICs it detects both but fails to
set the second one correctly (complaints about incorrect IRQ, memory,
... - you name it).  Does anybody has some ideas what is going on here?

This was observed on Alpha, BTW, but this tidbit is likely not relevant
and, yes, we used pairs of other NICs before on similar machines.

Alternate rtl8139 module from that kernel cannot detect D-link DFE-538
card.  A version of this driver from http://www.scyld.com seems to be ok
(at least both cards start) once you will get it to compile. :-)

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Troubles with 8139too and 2.2.19

2001-05-07 Thread Michal Jaegermann

It looks like tha 8139too, at least in 2.2.19, works fine when there
is one such card around but with NICs it detects both but fails to
set the second one correctly (complaints about incorrect IRQ, memory,
... - you name it).  Does anybody has some ideas what is going on here?

This was observed on Alpha, BTW, but this tidbit is likely not relevant
and, yes, we used pairs of other NICs before on similar machines.

Alternate rtl8139 module from that kernel cannot detect D-link DFE-538
card.  A version of this driver from http://www.scyld.com seems to be ok
(at least both cards start) once you will get it to compile. :-)

  Michal
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: Why recovering from broken configs is too hard

2001-05-03 Thread Michal Jaegermann

On Thu, May 03, 2001 at 03:47:55AM -0400, Eric S. Raymond wrote:
> OK, so you want CML2's "make oldconfig" to do something more graceful than
> simply say "Foo! You violated this constraint! Go fix it!"

After all this combinatorics I still do not know an answer to a simple
question.  With the current system I can do

   grep -v '^#' .config > config.stripped

copy 'config.stripped' back to .config, type 'make oldconfig' and hold
 for a while to get my old .config back.  This is actually
very useful, and used every day, although maybe not precisely in the
manner as above. :-)

So, the question is: can I do something something like that with CML2?
If the answer is "no" then something is very seriously missing and no
references to halting problems can cover that.  If, OTOH, the answer
is "yes" then what is a big trouble?

   Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: Why recovering from broken configs is too hard

2001-05-03 Thread Michal Jaegermann

On Thu, May 03, 2001 at 03:47:55AM -0400, Eric S. Raymond wrote:
 OK, so you want CML2's make oldconfig to do something more graceful than
 simply say Foo! You violated this constraint! Go fix it!

After all this combinatorics I still do not know an answer to a simple
question.  With the current system I can do

   grep -v '^#' .config  config.stripped

copy 'config.stripped' back to .config, type 'make oldconfig' and hold
Return for a while to get my old .config back.  This is actually
very useful, and used every day, although maybe not precisely in the
manner as above. :-)

So, the question is: can I do something something like that with CML2?
If the answer is no then something is very seriously missing and no
references to halting problems can cover that.  If, OTOH, the answer
is yes then what is a big trouble?

   Michal
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: Linux 2.4.4-ac3

2001-05-02 Thread Michal Jaegermann

On Tue, May 01, 2001 at 10:28:35PM +0100, Alan Cox wrote:
> 

> 2.4.4-ac3


Does not compile on Alpha.  I have a strange feeling that because
of this:-)
> o Fix module exception race on Alpha  (Andrea Arcangeli)

A declaration was forgotten and, comparing with i386 equivalent, this
minor correction is required:

--- linux-2.4.4-ac/arch/alpha/mm/extable.c~ Wed May  2 11:08:43 2001
+++ linux-2.4.4-ac/arch/alpha/mm/extable.c  Wed May  2 12:08:50 2001
@@ -36,6 +36,8 @@
 
 register unsigned long gp __asm__("$29");
 
+extern spinlock_t modlist_lock;
+
 static unsigned
 search_exception_table_without_gp(unsigned long addr)
 {

Michal

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: Linux 2.4.4-ac3

2001-05-02 Thread Michal Jaegermann

On Tue, May 01, 2001 at 10:28:35PM +0100, Alan Cox wrote:
 

 2.4.4-ac3


Does not compile on Alpha.  I have a strange feeling that because
of this:-)
 o Fix module exception race on Alpha  (Andrea Arcangeli)

A declaration was forgotten and, comparing with i386 equivalent, this
minor correction is required:

--- linux-2.4.4-ac/arch/alpha/mm/extable.c~ Wed May  2 11:08:43 2001
+++ linux-2.4.4-ac/arch/alpha/mm/extable.c  Wed May  2 12:08:50 2001
@@ -36,6 +36,8 @@
 
 register unsigned long gp __asm__($29);
 
+extern spinlock_t modlist_lock;
+
 static unsigned
 search_exception_table_without_gp(unsigned long addr)
 {

Michal

-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: BUG: Global FPU corruption in 2.2

2001-04-24 Thread Michal Jaegermann

On Tue, Apr 24, 2001 at 06:56:32PM +0200, Christian Ehrhardt wrote:
> On Tue, Apr 24, 2001 at 09:10:07AM -0700, Linus Torvalds wrote:
> > ptrace only operates on processes that are stopped. So there are no
> > locking issues - we've synchronized on a much higher level than a
> > spinlock or semaphore.
> 
> This is only true for requests other than PTRACE_ATTACH and
> PTRACE_ATTACH is exactly what I'm worried about.

May I remind everybody that at the beginning of this thread I posted
another example, from an SMP Alpha, of FPU problems.  It certainly
was not exactly like the one under discussion but it looked that
it had a similar "smell" to it.

It looks like that to reproduce this Alpha example one needs processors
with a rather fast clock and this hardware version is not yet very
widely available.

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: BUG: Global FPU corruption in 2.2

2001-04-24 Thread Michal Jaegermann

On Tue, Apr 24, 2001 at 06:56:32PM +0200, Christian Ehrhardt wrote:
 On Tue, Apr 24, 2001 at 09:10:07AM -0700, Linus Torvalds wrote:
  ptrace only operates on processes that are stopped. So there are no
  locking issues - we've synchronized on a much higher level than a
  spinlock or semaphore.
 
 This is only true for requests other than PTRACE_ATTACH and
 PTRACE_ATTACH is exactly what I'm worried about.

May I remind everybody that at the beginning of this thread I posted
another example, from an SMP Alpha, of FPU problems.  It certainly
was not exactly like the one under discussion but it looked that
it had a similar smell to it.

It looks like that to reproduce this Alpha example one needs processors
with a rather fast clock and this hardware version is not yet very
widely available.

  Michal
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: BUG: Global FPU corruption in 2.2

2001-04-19 Thread Michal Jaegermann

On Thu, Apr 19, 2001 at 11:05:03AM -0500, Victor Zandy wrote:
> 
> We have found that one of our programs can cause system-wide
> corruption of the x86 FPU under 2.2.16 and 2.2.17.

> 
> We see this problem on dual 550MHz Xeons with 1GB RAM.

Hm, I started to wonder if this is not somewhat related to a recent
report I got.  "The victim" was running 2.2.19 (basically) on an SMP
Alpha UP2000+ with two 800 MHz processors.  He managed to reduce the
problem to a rather small test case and I attach sources,  Makefile and
a "loop.sh" driver as a shar archive if you want to have a closer look.

This "loop.sh" simply fires triplets of "harry" process in a loop.
The guy hit by this gets apparently random floating point exceptions
starting with roughly sixth process and later intervals between bombs
will vary.  I have also 'strace' outputs from failing processes but
they are not telling very much.  'gdb' is also not very illuminating:

Program received signal SIGFPE, Arithmetic exception.
0x1200010a8 in vadd_ (a=0x11fff21e4, ia=0x120003294, b=0x11fff7004, 
ib=0x120003294, c=0x11fffbe20, ic=0x120003294, n=0x11c70) at vadd.f:99
99   C(CI) = A(AI) + B(BI)
Current language:  auto; currently fortran

(gdb) p *ia
$10 = 1
(gdb) p *ib
$11 = 1
(gdb) p *ic
$12 = 1
(gdb) p *n
Cannot access memory at address 0x4
(gdb) p *(0x11c70)
$13 = 1024

(gdb) info locals
n = (PTR TO -> ( integer )) 0x4
__g77_expr_0 = 10


He tells me that he is getting that on two different machines he has
around.

The trouble is that I tried to repeat that with different hardware,
kernels, compilers and libraries and I failed even on SMP; but I got an
access to a box with only 667 MHz processors.  OTOH he is running
right now 2.4.3-ac9 plus Andrea Arcangeli patches for rw semaphores
on Alpha and he reports that the problem went away (and, hopefuly,
nothing else will crop out :-).

Anybody can offer an insight what that may really be?  It may be,
of course, totally unrelated to this report from Victor Zandy.

  Michal
  [EMAIL PROTECTED]


 fpbomb.shar


Re: BUG: Global FPU corruption in 2.2

2001-04-19 Thread Michal Jaegermann

On Thu, Apr 19, 2001 at 11:05:03AM -0500, Victor Zandy wrote:
 
 We have found that one of our programs can cause system-wide
 corruption of the x86 FPU under 2.2.16 and 2.2.17.

 
 We see this problem on dual 550MHz Xeons with 1GB RAM.

Hm, I started to wonder if this is not somewhat related to a recent
report I got.  "The victim" was running 2.2.19 (basically) on an SMP
Alpha UP2000+ with two 800 MHz processors.  He managed to reduce the
problem to a rather small test case and I attach sources,  Makefile and
a "loop.sh" driver as a shar archive if you want to have a closer look.

This "loop.sh" simply fires triplets of "harry" process in a loop.
The guy hit by this gets apparently random floating point exceptions
starting with roughly sixth process and later intervals between bombs
will vary.  I have also 'strace' outputs from failing processes but
they are not telling very much.  'gdb' is also not very illuminating:

Program received signal SIGFPE, Arithmetic exception.
0x1200010a8 in vadd_ (a=0x11fff21e4, ia=0x120003294, b=0x11fff7004, 
ib=0x120003294, c=0x11fffbe20, ic=0x120003294, n=0x11c70) at vadd.f:99
99   C(CI) = A(AI) + B(BI)
Current language:  auto; currently fortran

(gdb) p *ia
$10 = 1
(gdb) p *ib
$11 = 1
(gdb) p *ic
$12 = 1
(gdb) p *n
Cannot access memory at address 0x4
(gdb) p *(0x11c70)
$13 = 1024

(gdb) info locals
n = (PTR TO - ( integer )) 0x4
__g77_expr_0 = 10


He tells me that he is getting that on two different machines he has
around.

The trouble is that I tried to repeat that with different hardware,
kernels, compilers and libraries and I failed even on SMP; but I got an
access to a box with only 667 MHz processors.  OTOH he is running
right now 2.4.3-ac9 plus Andrea Arcangeli patches for rw semaphores
on Alpha and he reports that the problem went away (and, hopefuly,
nothing else will crop out :-).

Anybody can offer an insight what that may really be?  It may be,
of course, totally unrelated to this report from Victor Zandy.

  Michal
  [EMAIL PROTECTED]


 fpbomb.shar


Re: /proc/config idea

2001-04-02 Thread Michal Jaegermann

On Mon, Apr 02, 2001 at 05:39:19PM -0700, David Lang wrote:
> 
> if the distro/sysadmin _always_ installs the kernel the 'right way' then
> the difference isn't nessasarily that large, but if you want reliability
> on any system it may be worth loosing a page or so of memory (hasn't
> someone said that the data can be compressed to <1K?)

After throwing away in a Makefile rule all "is not set" lines, as they
are trivially recoverable with 'make oldconfig', what is left for an
avarege kernel compresses to something like 500 bytes.  Quite a bit
of space left on this one page if you need more extensive .config.
'zcat /proc/config.gz' works just fine.

As most kernels around are NOT installed "the right way" I found that in
practice separating configuration information from a kernel image is not
even close to be semi-reliable on a longer run.  Those who say
"installation script", and similar things, assume that people compile
kernels for themselves.  This is undoubtely true for folks on this list;
this does not start to approximate the situation in general and, it
seems, that we really want it that way. :-)

BTW - /sbin/installkernel, as seen in practice, is not even correct for
a general case with x86; not to mention other architectures.  Writing
something like /var/log/config from "init data" during a bootup could be
another solution which does not take any kernel memory and still keeps
all this information attached to a kernel image itself.  OTOH we have
all these tons of strings which show in /proc/pci output and somehow
these do not cause such huge opposition.  Yes, I know that 'lspci' was
supposed to replace that; but it did not.

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: /proc/config idea

2001-04-02 Thread Michal Jaegermann

On Mon, Apr 02, 2001 at 05:39:19PM -0700, David Lang wrote:
 
 if the distro/sysadmin _always_ installs the kernel the 'right way' then
 the difference isn't nessasarily that large, but if you want reliability
 on any system it may be worth loosing a page or so of memory (hasn't
 someone said that the data can be compressed to 1K?)

After throwing away in a Makefile rule all "is not set" lines, as they
are trivially recoverable with 'make oldconfig', what is left for an
avarege kernel compresses to something like 500 bytes.  Quite a bit
of space left on this one page if you need more extensive .config.
'zcat /proc/config.gz' works just fine.

As most kernels around are NOT installed "the right way" I found that in
practice separating configuration information from a kernel image is not
even close to be semi-reliable on a longer run.  Those who say
"installation script", and similar things, assume that people compile
kernels for themselves.  This is undoubtely true for folks on this list;
this does not start to approximate the situation in general and, it
seems, that we really want it that way. :-)

BTW - /sbin/installkernel, as seen in practice, is not even correct for
a general case with x86; not to mention other architectures.  Writing
something like /var/log/config from "init data" during a bootup could be
another solution which does not take any kernel memory and still keeps
all this information attached to a kernel image itself.  OTOH we have
all these tons of strings which show in /proc/pci output and somehow
these do not cause such huge opposition.  Yes, I know that 'lspci' was
supposed to replace that; but it did not.

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: Linux 2.4.2ac20

2001-03-13 Thread Michal Jaegermann

On Tue, Mar 13, 2001 at 04:36:18AM +, Alan Cox wrote:
...
> 
> 2.4.2-ac20
...
> o Fix Alpha build (Jeff Garzik)

Now I see (at least on Alpha) a constant wailing:

/linux-2.4.2ac/include/linux/binfmts.h:45: warning: `struct 
mm_struct' declared inside parameter list
/linux-2.4.2ac/include/linux/binfmts.h:45: warning: its scope is 
only this definition or declaration, which is probably not what you want

Is this somehow related?

   Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: Linux 2.4.2ac20

2001-03-13 Thread Michal Jaegermann

On Tue, Mar 13, 2001 at 04:36:18AM +, Alan Cox wrote:
...
 
 2.4.2-ac20
...
 o Fix Alpha build (Jeff Garzik)

Now I see (at least on Alpha) a constant wailing:

/linux-2.4.2ac/include/linux/binfmts.h:45: warning: `struct 
mm_struct' declared inside parameter list
/linux-2.4.2ac/include/linux/binfmts.h:45: warning: its scope is 
only this definition or declaration, which is probably not what you want

Is this somehow related?

   Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: quicksort for linked list

2001-03-10 Thread Michal Jaegermann

On Sat, Mar 10, 2001 at 07:50:06PM +0100, Martin Mares wrote:
> Hello!
> 
> > Well, not really in this situation, after a simple modification.  It is
> > trivial to show that using "shorter interval sorted first" approach one
> > can bound an amount of an extra memory, on stack or otherwise, and by a
> > rather small number.
> 
> By O(log N) which is in reality a small number :)

Assuming that we sort a full range of 32-bit numbers (pointers on a
32-bit CPU, for example, are numbers of that kind but usually a range
can be narrowed down quite substantially) then with a bit of a careful
programming you need, I think, something like 16 extra 4-byte words or
maybe even a bit less.  I do not remember precisely, as I was doing this
exercise a long time ago, but even if this is 32, and you need carefuly
constructed example to need them all these extra cells, I still think
that this is not a huge amount of memory.  Especially when every element
of a list you are sorting is likely quite a bit bigger.

Exponents are something which grows these numbers pretty fast. :-)

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: quicksort for linked list

2001-03-10 Thread Michal Jaegermann

On Sat, Mar 10, 2001 at 07:50:06PM +0100, Martin Mares wrote:
 Hello!
 
  Well, not really in this situation, after a simple modification.  It is
  trivial to show that using "shorter interval sorted first" approach one
  can bound an amount of an extra memory, on stack or otherwise, and by a
  rather small number.
 
 By O(log N) which is in reality a small number :)

Assuming that we sort a full range of 32-bit numbers (pointers on a
32-bit CPU, for example, are numbers of that kind but usually a range
can be narrowed down quite substantially) then with a bit of a careful
programming you need, I think, something like 16 extra 4-byte words or
maybe even a bit less.  I do not remember precisely, as I was doing this
exercise a long time ago, but even if this is 32, and you need carefuly
constructed example to need them all these extra cells, I still think
that this is not a huge amount of memory.  Especially when every element
of a list you are sorting is likely quite a bit bigger.

Exponents are something which grows these numbers pretty fast. :-)

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: quicksort for linked list

2001-03-09 Thread Michal Jaegermann

On Fri, Mar 09, 2001 at 12:52:22PM +0100, Rogier Wolff wrote:
> 
> Quicksort however is an algorithm that is recursive. This means that
> it can use unbounded amounts of stack -> This is not for the kernel.

Well, not really in this situation, after a simple modification.  It is
trivial to show that using "shorter interval sorted first" approach one
can bound an amount of an extra memory, on stack or otherwise, and by a
rather small number.  This assumes that one knows what one is sorting -
which is obviously the case here.

Also my copy of Reingold, Nivergelt, Deo from 1977 presents a
"non-recursive" variant of quicksort as a kind of an "old hat" solution.
One would think that this piece of information would spread during those
years. :-)  It is a simple exercise anyway.

> Quicksort has a very bad "worst case": quadratic sort-time. Are you
> sure this won't happen?

This is much more serious objection.  You can nearly guarantee in an
itended application that somebody will find a way to feed you packets
which will ensure the worst case behaviour.  The same gotcha will
probably kill quite a few other ways to sort here.

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: quicksort for linked list

2001-03-09 Thread Michal Jaegermann

On Fri, Mar 09, 2001 at 12:52:22PM +0100, Rogier Wolff wrote:
 
 Quicksort however is an algorithm that is recursive. This means that
 it can use unbounded amounts of stack - This is not for the kernel.

Well, not really in this situation, after a simple modification.  It is
trivial to show that using "shorter interval sorted first" approach one
can bound an amount of an extra memory, on stack or otherwise, and by a
rather small number.  This assumes that one knows what one is sorting -
which is obviously the case here.

Also my copy of Reingold, Nivergelt, Deo from 1977 presents a
"non-recursive" variant of quicksort as a kind of an "old hat" solution.
One would think that this piece of information would spread during those
years. :-)  It is a simple exercise anyway.

 Quicksort has a very bad "worst case": quadratic sort-time. Are you
 sure this won't happen?

This is much more serious objection.  You can nearly guarantee in an
itended application that somebody will find a way to feed you packets
which will ensure the worst case behaviour.  The same gotcha will
probably kill quite a few other ways to sort here.

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: [Slightly OT] x86 PROM project

2001-03-04 Thread Michal Jaegermann

On Sun, Mar 04, 2001 at 02:02:38PM -0600, Matthew Fredrickson wrote:
> On Sun, Mar 04, 2001 at 08:08:32PM +0100, Erik Mouw wrote:
> > Have a look at OpenBIOS:
> > 
> >   http://www.freiburg.linux.de/OpenBIOS/
> > 
> > The project wants to create an IEEE 1275-1994 compliant firmware, like
> > used by SUN (for example).
> 
> I don't want to appear to be offensive in regards to this project
> considering I have no prior knowledge about it other than what I've seen
> at the web site just now, but it appears that there is a lot more talk
> than coding occuring at this project.

Ok, so what about this one?

http://www.acl.lanl.gov/linuxbios/

The code is on sourceforge.

   Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: [Slightly OT] x86 PROM project

2001-03-04 Thread Michal Jaegermann

On Sun, Mar 04, 2001 at 02:02:38PM -0600, Matthew Fredrickson wrote:
 On Sun, Mar 04, 2001 at 08:08:32PM +0100, Erik Mouw wrote:
  Have a look at OpenBIOS:
  
http://www.freiburg.linux.de/OpenBIOS/
  
  The project wants to create an IEEE 1275-1994 compliant firmware, like
  used by SUN (for example).
 
 I don't want to appear to be offensive in regards to this project
 considering I have no prior knowledge about it other than what I've seen
 at the web site just now, but it appears that there is a lot more talk
 than coding occuring at this project.

Ok, so what about this one?

http://www.acl.lanl.gov/linuxbios/

The code is on sourceforge.

   Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: 2.4 kernels - "attempt to access beyond end of device"

2001-02-28 Thread Michal Jaegermann

On Wed, Feb 28, 2001 at 10:54:15PM +, Petr Vandrovec wrote:
> On 28 Feb 01 at 13:46, Michal Jaegermann wrote:
> > I think that I found what gives me a hell with this box and it
> > looks like that this not Linux at all.  Once again, this is Athlon
> > K6 on Asus AV7 mobo and "Award Advanced ACPI BIOS" version 1005C.
> 
> K7 on A7V, I believe...

Maybe.  'cat /proc/cpuinfo' says "cpu family: 6" and "model :4"
(and "stepping: 2").  I possibly misinterpreted that.  Do not believe
me when I am talking about x86 chips. :-)

> > I have more checks to make before I will be fully satisfied but
> > this looks like it.
> ...
> > System Performance Setting [Optimal, Normal]
> ...
> 
> Try BIOS 1006. AFAIK 1005D changed some VIA values for 'optimal'.

Is that important here?  IDE drives in question were not connected to
on-board controller but the Promise one.  Results seem to indicate
that this 'optimal' was important here anyway.

> And 1006 contains newer Promise BIOS - but I did not notice any difference:
> Windows98 still do not boot if I connect harddisk to /dev/hdh :-(

There is at this moment Windows98 installation on /dev/hde1 and it boots
so far.   It got installed and it was booting regardless with these
"other" BIOS seetings.

> But Linux works fine...

Hope so

  Michal
  [EMAIL PROTECTED]
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: 2.4 kernels - "attempt to access beyond end of device"

2001-02-28 Thread Michal Jaegermann

I think that I found what gives me a hell with this box and it
looks like that this not Linux at all.  Once again, this is Athlon
K6 on Asus AV7 mobo and "Award Advanced ACPI BIOS" version 1005C.
I have more checks to make before I will be fully satisfied but
this looks like it.

In this BIOS setup there are two "advanced" options:

System Performance Setting [Optimal, Normal]
USB Legacy Support [Auto, Enabled, Disabled]

If the first one is set to "Normal" and the second one to "Disabled"
then the whole system becomes stable.  I copied from various file
systems to a directory+on ext2 around 1.2 GB of files without any ill
effects and run succesfully 'diff -r' between two directories 475 MB 
each.  If BIOS options are any other way then one should expect
spectacular blowups with corrupted file systems and other nasty effects
after the first oops.  It survives up to something between 130 to 150
MB of data moved, does not matter which kernel, and that is it.

It is difficult to know what is "System Performance Setting" as it
always shows "Optimal" regardless of a status on the last save.  But a
system behaviour depends on how it was set so it seems to change even if
a display, on the next visit, does not.  How "USB Legacy Support" comes
into the picture I cannot even imagine.

I did try with 2.2.19pre and 2.4 kernels and the picture does not
change.  Any rational explanation beyond that BIOS is doing something
really nasty?

  Cheers,
  Michal

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: 2.4 kernels - attempt to access beyond end of device

2001-02-28 Thread Michal Jaegermann

I think that I found what gives me a hell with this box and it
looks like that this not Linux at all.  Once again, this is Athlon
K6 on Asus AV7 mobo and "Award Advanced ACPI BIOS" version 1005C.
I have more checks to make before I will be fully satisfied but
this looks like it.

In this BIOS setup there are two "advanced" options:

System Performance Setting [Optimal, Normal]
USB Legacy Support [Auto, Enabled, Disabled]

If the first one is set to "Normal" and the second one to "Disabled"
then the whole system becomes stable.  I copied from various file
systems to a directory+on ext2 around 1.2 GB of files without any ill
effects and run succesfully 'diff -r' between two directories 475 MB 
each.  If BIOS options are any other way then one should expect
spectacular blowups with corrupted file systems and other nasty effects
after the first oops.  It survives up to something between 130 to 150
MB of data moved, does not matter which kernel, and that is it.

It is difficult to know what is "System Performance Setting" as it
always shows "Optimal" regardless of a status on the last save.  But a
system behaviour depends on how it was set so it seems to change even if
a display, on the next visit, does not.  How "USB Legacy Support" comes
into the picture I cannot even imagine.

I did try with 2.2.19pre and 2.4 kernels and the picture does not
change.  Any rational explanation beyond that BIOS is doing something
really nasty?

  Cheers,
  Michal

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: 2.4 kernels - attempt to access beyond end of device

2001-02-28 Thread Michal Jaegermann

On Wed, Feb 28, 2001 at 10:54:15PM +, Petr Vandrovec wrote:
 On 28 Feb 01 at 13:46, Michal Jaegermann wrote:
  I think that I found what gives me a hell with this box and it
  looks like that this not Linux at all.  Once again, this is Athlon
  K6 on Asus AV7 mobo and "Award Advanced ACPI BIOS" version 1005C.
 
 K7 on A7V, I believe...

Maybe.  'cat /proc/cpuinfo' says "cpu family: 6" and "model :4"
(and "stepping: 2").  I possibly misinterpreted that.  Do not believe
me when I am talking about x86 chips. :-)

  I have more checks to make before I will be fully satisfied but
  this looks like it.
 ...
  System Performance Setting [Optimal, Normal]
 ...
 
 Try BIOS 1006. AFAIK 1005D changed some VIA values for 'optimal'.

Is that important here?  IDE drives in question were not connected to
on-board controller but the Promise one.  Results seem to indicate
that this 'optimal' was important here anyway.

 And 1006 contains newer Promise BIOS - but I did not notice any difference:
 Windows98 still do not boot if I connect harddisk to /dev/hdh :-(

There is at this moment Windows98 installation on /dev/hde1 and it boots
so far.   It got installed and it was booting regardless with these
"other" BIOS seetings.

 But Linux works fine...

Hope so

  Michal
  [EMAIL PROTECTED]
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Via-rhine is not finding its interrupts under 2.2.19pre14

2001-02-27 Thread Michal Jaegermann


After I booted 2.2.19pre14 on a system with two via-rhine cards I see the
following:

via-rhine.c:v1.08b-LK1.0.0 12/14/2000  Written by Donald Becker
  http://www.scyld.com/network/via-rhine.html
eth0: VIA VT3043 Rhine at 0x9400, 00:50:ba:c1:64:d9, IRQ 0.
eth0: MII PHY found at address 8, status 0x7809 advertising 05e1 Link .
eth1: VIA VT3043 Rhine at 0x8800, 00:50:ba:ab:60:64, IRQ 0.
eth1: MII PHY found at address 8, status 0x782d advertising 05e1 Link .

and a network does not work due to these IRQ 0, I guess.

In contrast when I will boot on the same hardware 2.4.2-ac5 then I get,
among other things,

via-rhine.c:v1.08b-LK1.1.7  8/9/2000  Written by Donald Becker
  http://www.scyld.com/network/via-rhine.html
PCI: Enabling device 00:09.0 (0094 -> 0097)
PCI: Found IRQ 9 for device 00:09.0
PCI: The same IRQ used for device 00:04.2
PCI: The same IRQ used for device 00:04.3
PCI: The same IRQ used for device 00:0d.0
eth0: VIA VT3043 Rhine at 0x9400, 00:50:ba:c1:64:d9, IRQ 9.
eth0: MII PHY found at address 8, status 0x7809 advertising 05e1 Link .
PCI: Enabling device 00:0c.0 (0094 -> 0097)
PCI: Found IRQ 11 for device 00:0c.0
eth1: VIA VT3043 Rhine at 0x8800, 00:50:ba:ab:60:64, IRQ 11.
eth1: MII PHY found at address 8, status 0x782d advertising 05e1 Link .


Devices in question can be seen here:

-[00]-+-00.0  VIA Technologies, Inc. VT8363/8365 [KT133/KM133]
  +-01.0-[01]00.0  ATI Technologies Inc Rage 128 RF
  +-04.0  VIA Technologies, Inc. VT82C686 [Apollo Super South]
  +-04.1  VIA Technologies, Inc. Bus Master IDE
  +-04.2  VIA Technologies, Inc. UHCI USB
  +-04.3  VIA Technologies, Inc. UHCI USB
  +-04.4  VIA Technologies, Inc. VT82C686 [Apollo Super ACPI]
  +-09.0  VIA Technologies, Inc. VT86C100A [Rhine 10/100]
  +-0a.0  Symbios Logic Inc. (formerly NCR) 53c810
  +-0c.0  VIA Technologies, Inc. VT86C100A [Rhine 10/100]
  +-0d.0  Ensoniq CT5880 [AudioPCI]
  \-11.0  Promise Technology, Inc. 20265


I do not have turned on a support for USB or audio in either of these
two kernels.  They are actually configured pretty similar (within a
reason :-).  But network is operational with 2.4.2-ac5.  Hm...

Even with these IRQ conflicts for eth0 one would think that eth1 should
get its interrupt.  It does not conflict with anything.  Forgotten
'pci_enable()' somewhere?

  Michal

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: 2.4 kernels - "attempt to access beyond end of device"

2001-02-27 Thread Michal Jaegermann


To add to my report about troubles with disk activity on a system with
PDC20265 IDE controller (this is on Asus AV7 mobo, BTW) I tried the
same experiments with 2.2.19pre14 patched with ide patches to get a
support for Promise.

I got similar results - i.e. problems after some 130-150 megabytes
was copied.  On different occasions I got things like that:

file_cluster badly computed!!! 0 <> 536870911
file_cluster badly computed!!! 1 <> 0

practically immediately and followed by a period of a lively disk
activity and a crash.

Whoops: end_buffer_io_async: b_count != 1 on async io.

after which 'cp' process hanged in a "D" state.

attempt to access beyond end of device
21:01: rw=0, want=537238629, limit=4506201
dev 21:01 blksize=512 blocknr=1074477258 sector=1074477258 size=512 count=1
...
(and more of these)  terminated with oops decoded below.

To take 'vfat' out of picture I also tried 'cp' from ext2 partitions
(I had to collect number of things as I do not have enough of data on
this system yet) to an ext2 partition while using 2.4.2-ac5.  This resulted
in:

EXT2-fs error (device ide3(34,9)): ext2_readdir: bad entry in
directory #16584: inode out of bounds - offset=0, inode=134234312,
rec_len=12, name_len=1
EXT2-fs error (device ide3(34,9)): ext2_readdir: bad entry in
directory #131542: inode out of bounds - offset=0, inode=134349270,
rec_len=12, name_len=1
EXT2-fs error (device ide3(34,9)): ext2_readdir: bad entry in
directory #82294: inode out of bounds - offset=0, inode=134300022,
rec_len=12, name_len=1
EXT2-fs error (device ide3(34,9)): ext2_readdir: bad entry in
directory #164456: inode out of bounds - offset=0, inode=134382184,
rec_len=12, name_len=1
EXT2-fs error (device ide3(34,9)): ext2_readdir: bad entry in
directory #98872: inode out of bounds - offset=0, inode=134316600,
rec_len=12, name_len=1
22:09: rw=0, want=537530884, limit=1574338
attempt to access beyond end of device
22:09: rw=0, want=537530884, limit=1574338
.
punctuated by oops.

Here is a decoded oops from 2.2.19pre14

Unable to handle kernel paging request at virtual address 0800
current->tss.cr3 = 1f052000, %cr3 = 1f052000
*pde = 1f67b067
Oops: 
CPU:0
EIP:0010:[]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010206
eax: 0800   ebx: 0007   ecx: 00053d24   edx: 0800
esi: 000d   edi: 2202   ebp: 0004906a   esp: de955e7c
ds: 0018   es: 0018   ss: 0018
Process cp (pid: 573, process nr: 19, stackpage=de955000)
Stack: 0004906a 2202 00053d24 c01277e8 2202 0004906a 0200  
   c0127b9a 2202 0004906a 0200  0004906a  ded6a200 
    c0145398 2202 0004906a 0200 c014a0db ded6a200 0004906a 
Call Trace: [] [] [] [] [] 
[] [] 
   [] [] [] 
Code: 8b 00 39 6a 04 75 15 8b 4c 24 20 39 4a 08 75 0c 66 39 7a 0c 

>>EIP; c01277a8<=
Trace; c01277e8 
Trace; c0127b9a 
Trace; c0145398 
Trace; c014a0db 
Trace; c0145b6c 
Trace; c014a26f 
Trace; c014784e 
Trace; c01262d9 
Trace; c01476d0 
Trace; c0109534 
Code;  c01277a8 
 <_EIP>:
Code;  c01277a8<=
   0:   8b 00 mov(%eax),%eax   <=
Code;  c01277aa 
   2:   39 6a 04  cmp%ebp,0x4(%edx)
Code;  c01277ad 
   5:   75 15 jne1c <_EIP+0x1c> c01277c4 
Code;  c01277af 
   7:   8b 4c 24 20   mov0x20(%esp,1),%ecx
Code;  c01277b3 
   b:   39 4a 08  cmp%ecx,0x8(%edx)
Code;  c01277b6 
   e:   75 0c jne1c <_EIP+0x1c> c01277c4 
Code;  c01277b8 
  10:   66 39 7a 0c   cmp%di,0xc(%edx)


And here is the one from ext2 to ext2 copy under 2.4.2-ac5

Unable to handle kernel paging request at virtual address ea096084
c0128edf
*pde = 
Oops: 
CPU:0
EIP:0010:[]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010003
eax: 081b   ebx: 081b   ecx: 0282   edx: ca096000
esi: dffd7cdc   edi:    ebp: 1000   esp: dec6de38
ds: 0018   es: 0018   ss: 0018
Process cp (pid: 543, stackpage=dec6d000)
Stack: 220a 0001220a 0286 0010 c17cb2e0  c0133314 dffd7cdc 
   0003 c17cb2e0  c01333d2 0001 0007ccfc  c0121be9 
   220a c17cb2e0 c17cb2e0 220a  c0133687 c17cb2e0 1000 
Call Trace: [] [] [] [] [] 
[] [] 
   [] [] [] [] [] [] 
Code: 8b 44 82 18 0f af 5e 0c 89 42 14 03 5a 0c 40 75 05 8b 02 89 

>>EIP; c0128edf<=
Trace; c0133314 
Trace; c01333d2 
Trace; c0121be9 
Trace; c0133687 
Trace; c01339ab <__block_prepare_write+4b/2b0>
Trace; c0123e8c 
Trace; c013423d 
Trace; c014ffb0 
Trace; c0126886 
Trace; c014ffb0 
Trace; c0124a70 
Trace; c0131468 
Trace; c01090a3 
Code;  c0128edf 
 <_EIP>:
Code;  c0128edf<=
   0:   8b 44 82 18   mov0x18(%edx,%eax,4),%eax   <=
Code;  c0128ee3 
   4:   0f af 5e 0c   imul   0xc(%esi),%ebx
Code;  c0128ee7 
   8:   89 42 14  mov%eax,0x14(%edx)

Re: 2.4 kernels - attempt to access beyond end of device

2001-02-27 Thread Michal Jaegermann


To add to my report about troubles with disk activity on a system with
PDC20265 IDE controller (this is on Asus AV7 mobo, BTW) I tried the
same experiments with 2.2.19pre14 patched with ide patches to get a
support for Promise.

I got similar results - i.e. problems after some 130-150 megabytes
was copied.  On different occasions I got things like that:

file_cluster badly computed!!! 0  536870911
file_cluster badly computed!!! 1  0

practically immediately and followed by a period of a lively disk
activity and a crash.

Whoops: end_buffer_io_async: b_count != 1 on async io.

after which 'cp' process hanged in a "D" state.

attempt to access beyond end of device
21:01: rw=0, want=537238629, limit=4506201
dev 21:01 blksize=512 blocknr=1074477258 sector=1074477258 size=512 count=1
...
(and more of these)  terminated with oops decoded below.

To take 'vfat' out of picture I also tried 'cp' from ext2 partitions
(I had to collect number of things as I do not have enough of data on
this system yet) to an ext2 partition while using 2.4.2-ac5.  This resulted
in:

EXT2-fs error (device ide3(34,9)): ext2_readdir: bad entry in
directory #16584: inode out of bounds - offset=0, inode=134234312,
rec_len=12, name_len=1
EXT2-fs error (device ide3(34,9)): ext2_readdir: bad entry in
directory #131542: inode out of bounds - offset=0, inode=134349270,
rec_len=12, name_len=1
EXT2-fs error (device ide3(34,9)): ext2_readdir: bad entry in
directory #82294: inode out of bounds - offset=0, inode=134300022,
rec_len=12, name_len=1
EXT2-fs error (device ide3(34,9)): ext2_readdir: bad entry in
directory #164456: inode out of bounds - offset=0, inode=134382184,
rec_len=12, name_len=1
EXT2-fs error (device ide3(34,9)): ext2_readdir: bad entry in
directory #98872: inode out of bounds - offset=0, inode=134316600,
rec_len=12, name_len=1
22:09: rw=0, want=537530884, limit=1574338
attempt to access beyond end of device
22:09: rw=0, want=537530884, limit=1574338
.
punctuated by oops.

Here is a decoded oops from 2.2.19pre14

Unable to handle kernel paging request at virtual address 0800
current-tss.cr3 = 1f052000, %cr3 = 1f052000
*pde = 1f67b067
Oops: 
CPU:0
EIP:0010:[c01277a8]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010206
eax: 0800   ebx: 0007   ecx: 00053d24   edx: 0800
esi: 000d   edi: 2202   ebp: 0004906a   esp: de955e7c
ds: 0018   es: 0018   ss: 0018
Process cp (pid: 573, process nr: 19, stackpage=de955000)
Stack: 0004906a 2202 00053d24 c01277e8 2202 0004906a 0200  
   c0127b9a 2202 0004906a 0200  0004906a  ded6a200 
    c0145398 2202 0004906a 0200 c014a0db ded6a200 0004906a 
Call Trace: [c01277e8] [c0127b9a] [c0145398] [c014a0db] [c0145b6c] 
[c014a26f] [c014784e] 
   [c01262d9] [c01476d0] [c0109534] 
Code: 8b 00 39 6a 04 75 15 8b 4c 24 20 39 4a 08 75 0c 66 39 7a 0c 

EIP; c01277a8 find_buffer+68/90   =
Trace; c01277e8 get_hash_table+18/48
Trace; c0127b9a getblk+1e/144
Trace; c0145398 fat_getblk+3c/4c
Trace; c014a0db fat_add_cluster1+243/3cc
Trace; c0145b6c fat_get_cluster+58/98
Trace; c014a26f fat_add_cluster+b/2c
Trace; c014784e fat_file_write+17e/4ac
Trace; c01262d9 sys_write+e5/118
Trace; c01476d0 fat_file_write+0/4ac
Trace; c0109534 system_call+34/38
Code;  c01277a8 find_buffer+68/90
 _EIP:
Code;  c01277a8 find_buffer+68/90   =
   0:   8b 00 mov(%eax),%eax   =
Code;  c01277aa find_buffer+6a/90
   2:   39 6a 04  cmp%ebp,0x4(%edx)
Code;  c01277ad find_buffer+6d/90
   5:   75 15 jne1c _EIP+0x1c c01277c4 find_buffer+84/90
Code;  c01277af find_buffer+6f/90
   7:   8b 4c 24 20   mov0x20(%esp,1),%ecx
Code;  c01277b3 find_buffer+73/90
   b:   39 4a 08  cmp%ecx,0x8(%edx)
Code;  c01277b6 find_buffer+76/90
   e:   75 0c jne1c _EIP+0x1c c01277c4 find_buffer+84/90
Code;  c01277b8 find_buffer+78/90
  10:   66 39 7a 0c   cmp%di,0xc(%edx)


And here is the one from ext2 to ext2 copy under 2.4.2-ac5

Unable to handle kernel paging request at virtual address ea096084
c0128edf
*pde = 
Oops: 
CPU:0
EIP:0010:[c0128edf]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010003
eax: 081b   ebx: 081b   ecx: 0282   edx: ca096000
esi: dffd7cdc   edi:    ebp: 1000   esp: dec6de38
ds: 0018   es: 0018   ss: 0018
Process cp (pid: 543, stackpage=dec6d000)
Stack: 220a 0001220a 0286 0010 c17cb2e0  c0133314 dffd7cdc 
   0003 c17cb2e0  c01333d2 0001 0007ccfc  c0121be9 
   220a c17cb2e0 c17cb2e0 220a  c0133687 c17cb2e0 1000 
Call Trace: [c0133314] [c01333d2] [c0121be9] [c0133687] [c01339ab] 
[c0123e8c] [c013423d] 
   [c014ffb0] [c0126886] [c014ffb0] [c0124a70] [c0131468] [c01090a3] 
Code: 8b 44 82 18 0f af 5e 0c 89 42 14 03 5a 0c 40 75 05 8b 02 

Via-rhine is not finding its interrupts under 2.2.19pre14

2001-02-27 Thread Michal Jaegermann


After I booted 2.2.19pre14 on a system with two via-rhine cards I see the
following:

via-rhine.c:v1.08b-LK1.0.0 12/14/2000  Written by Donald Becker
  http://www.scyld.com/network/via-rhine.html
eth0: VIA VT3043 Rhine at 0x9400, 00:50:ba:c1:64:d9, IRQ 0.
eth0: MII PHY found at address 8, status 0x7809 advertising 05e1 Link .
eth1: VIA VT3043 Rhine at 0x8800, 00:50:ba:ab:60:64, IRQ 0.
eth1: MII PHY found at address 8, status 0x782d advertising 05e1 Link .

and a network does not work due to these IRQ 0, I guess.

In contrast when I will boot on the same hardware 2.4.2-ac5 then I get,
among other things,

via-rhine.c:v1.08b-LK1.1.7  8/9/2000  Written by Donald Becker
  http://www.scyld.com/network/via-rhine.html
PCI: Enabling device 00:09.0 (0094 - 0097)
PCI: Found IRQ 9 for device 00:09.0
PCI: The same IRQ used for device 00:04.2
PCI: The same IRQ used for device 00:04.3
PCI: The same IRQ used for device 00:0d.0
eth0: VIA VT3043 Rhine at 0x9400, 00:50:ba:c1:64:d9, IRQ 9.
eth0: MII PHY found at address 8, status 0x7809 advertising 05e1 Link .
PCI: Enabling device 00:0c.0 (0094 - 0097)
PCI: Found IRQ 11 for device 00:0c.0
eth1: VIA VT3043 Rhine at 0x8800, 00:50:ba:ab:60:64, IRQ 11.
eth1: MII PHY found at address 8, status 0x782d advertising 05e1 Link .


Devices in question can be seen here:

-[00]-+-00.0  VIA Technologies, Inc. VT8363/8365 [KT133/KM133]
  +-01.0-[01]00.0  ATI Technologies Inc Rage 128 RF
  +-04.0  VIA Technologies, Inc. VT82C686 [Apollo Super South]
  +-04.1  VIA Technologies, Inc. Bus Master IDE
  +-04.2  VIA Technologies, Inc. UHCI USB
  +-04.3  VIA Technologies, Inc. UHCI USB
  +-04.4  VIA Technologies, Inc. VT82C686 [Apollo Super ACPI]
  +-09.0  VIA Technologies, Inc. VT86C100A [Rhine 10/100]
  +-0a.0  Symbios Logic Inc. (formerly NCR) 53c810
  +-0c.0  VIA Technologies, Inc. VT86C100A [Rhine 10/100]
  +-0d.0  Ensoniq CT5880 [AudioPCI]
  \-11.0  Promise Technology, Inc. 20265


I do not have turned on a support for USB or audio in either of these
two kernels.  They are actually configured pretty similar (within a
reason :-).  But network is operational with 2.4.2-ac5.  Hm...

Even with these IRQ conflicts for eth0 one would think that eth1 should
get its interrupt.  It does not conflict with anything.  Forgotten
'pci_enable()' somewhere?

  Michal

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



2.4 kernels - "attempt to access beyond end of device"

2001-02-26 Thread Michal Jaegermann

I have right now on hands a system with PDC20265 controller, not used
as "raid", and it gives me a hard time.  It looks like that after some
number of megabytes copied to a disk, where "number" seems to be
somewhere between 100 and 150, something in a kernel internal structures
get overwritten and the whole thing just blows up.  After an oops mostly
anything will end up with errors so even a clean reboot will likely
be not possible.

In particular this prevents me from installing the recent Red Hat public
beta with its kernel based on 2.4.1.  I tried also some other variants
of 2.4 kernels and so far results are the same.  If there is something
left in logs then I see messages of that sort (21:01 is /dev/hde1).

21:01: rw=0, want=536992869, limit=4506201
attempt to access beyond end of device
21:01: rw=0, want=536992870, limit=4506201
attempt to access beyond end of device

The following log file is for 2.4.2-ac5.  It has less extraneous
stuff like LVM, and RAID, and USB support, and whatever...
These were effects of an attempt to copy from one vfat to another
vfat file system.  Below is also decoded oops.


Linux version 2.4.2-ac5 ([EMAIL PROTECTED]) (gcc version 2.96 2731 (Red Hat 
Linux 7.0)) #4 Mon Feb 26 18:11:13 MST 2001
BIOS-provided physical RAM map:
 BIOS-e820: 0009dc00 @  (usable)
 BIOS-e820: 2400 @ 0009dc00 (reserved)
 BIOS-e820: 0001 @ 000f (reserved)
 BIOS-e820: 1feec000 @ 0010 (usable)
 BIOS-e820: 3000 @ 1ffec000 (ACPI data)
 BIOS-e820: 0001 @ 1ffef000 (reserved)
 BIOS-e820: 1000 @ 1000 (ACPI NVS)
 BIOS-e820: 0001 @  (reserved)
On node 0 totalpages: 131052
zone(0): 4096 pages.
zone(1): 126956 pages.
zone(2): 0 pages.
Kernel command line: initrd=initrd.img root=/dev/hdg3 BOOT_IMAGE=vmlinuz auto
Initializing CPU#0
Detected 1109.899 MHz processor.
Console: colour VGA+ 80x25
Calibrating delay loop... 2215.11 BogoMIPS
Memory: 513140k/524208k available (920k kernel code, 10672k reserved, 351k data, 176k 
init, 0k highmem)
Dentry-cache hash table entries: 65536 (order: 7, 524288 bytes)
Buffer-cache hash table entries: 32768 (order: 5, 131072 bytes)
Page-cache hash table entries: 131072 (order: 7, 524288 bytes)
Inode-cache hash table entries: 32768 (order: 6, 262144 bytes)
CPU: Before vendor init, caps: 0183f9ff c1c7f9ff , vendor = 2
CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line)
CPU: L2 Cache: 256K (64 bytes/line)
CPU: After vendor init, caps: 0183f9ff c1c7f9ff  
CPU: After generic, caps: 0183f9ff c1c7f9ff  
CPU: Common caps: 0183f9ff c1c7f9ff  
CPU: AMD Athlon(tm) Processor stepping 02
Enabling fast FPU save and restore... done.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
mtrr: v1.37 (20001109) Richard Gooch ([EMAIL PROTECTED])
mtrr: detected mtrr type: Intel
PCI: PCI BIOS revision 2.10 entry at 0xf1150, last bus=1
PCI: Using configuration type 1
PCI: Probing PCI hardware
PCI: Using IRQ router VIA [1106/0686] at 00:04.0
PCI: Found IRQ 9 for device 00:09.0
PCI: The same IRQ used for device 00:04.2
PCI: The same IRQ used for device 00:04.3
PCI: The same IRQ used for device 00:0d.0
PCI: Found IRQ 11 for device 00:0c.0
Linux NET4.0 for Linux 2.4
Based upon Swansea University Computer Society NET3.039
Starting kswapd v1.8
pty: 256 Unix98 ptys configured
block: queued sectors max/low 341080kB/210008kB, 1024 slots per queue
RAMDISK driver initialized: 16 RAM disks of 8192K size 1024 blocksize
Uniform Multi-Platform E-IDE driver Revision: 6.31
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
VP_IDE: IDE controller on PCI bus 00 dev 21
VP_IDE: chipset revision 16
VP_IDE: not 100% native mode: will probe irqs later
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
VP_IDE: VIA vt82c686a (rev 22) IDE UDMA66 controller on pci00:04.1
ide0: BM-DMA at 0xb800-0xb807, BIOS settings: hda:DMA, hdb:pio
ide1: BM-DMA at 0xb808-0xb80f, BIOS settings: hdc:pio, hdd:pio
PDC20265: IDE controller on PCI bus 00 dev 88
PCI: Found IRQ 10 for device 00:11.0
PDC20265: chipset revision 2
PDC20265: not 100% native mode: will probe irqs later
PDC20265: (U)DMA Burst Bit ENABLED Primary PCI Mode Secondary PCI Mode.
ide2: BM-DMA at 0x6800-0x6807, BIOS settings: hde:DMA, hdf:pio
ide3: BM-DMA at 0x6808-0x680f, BIOS settings: hdg:DMA, hdh:pio
hda: CREATIVE CD5233E, ATAPI CD/DVD-ROM drive
hde: IBM-DTLA-307045, ATA DISK drive
hdg: IBM-DTLA-307045, ATA DISK drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
ide2 at 0x8000-0x8007,0x7802 on irq 10
ide3 at 0x7400-0x7407,0x7002 on irq 10
hde: 90069840 sectors (46116 MB) w/1916KiB Cache, CHS=89355/16/63, UDMA(100)
hdg: 90069840 sectors (46116 MB) w/1916KiB Cache, CHS=89355/16/63, UDMA(100)
hda: ATAPI 52X CD-ROM drive, 128kB Cache, DMA
Uniform CD-ROM driver 

2.4 kernels - attempt to access beyond end of device

2001-02-26 Thread Michal Jaegermann

I have right now on hands a system with PDC20265 controller, not used
as "raid", and it gives me a hard time.  It looks like that after some
number of megabytes copied to a disk, where "number" seems to be
somewhere between 100 and 150, something in a kernel internal structures
get overwritten and the whole thing just blows up.  After an oops mostly
anything will end up with errors so even a clean reboot will likely
be not possible.

In particular this prevents me from installing the recent Red Hat public
beta with its kernel based on 2.4.1.  I tried also some other variants
of 2.4 kernels and so far results are the same.  If there is something
left in logs then I see messages of that sort (21:01 is /dev/hde1).

21:01: rw=0, want=536992869, limit=4506201
attempt to access beyond end of device
21:01: rw=0, want=536992870, limit=4506201
attempt to access beyond end of device

The following log file is for 2.4.2-ac5.  It has less extraneous
stuff like LVM, and RAID, and USB support, and whatever...
These were effects of an attempt to copy from one vfat to another
vfat file system.  Below is also decoded oops.


Linux version 2.4.2-ac5 ([EMAIL PROTECTED]) (gcc version 2.96 2731 (Red Hat 
Linux 7.0)) #4 Mon Feb 26 18:11:13 MST 2001
BIOS-provided physical RAM map:
 BIOS-e820: 0009dc00 @  (usable)
 BIOS-e820: 2400 @ 0009dc00 (reserved)
 BIOS-e820: 0001 @ 000f (reserved)
 BIOS-e820: 1feec000 @ 0010 (usable)
 BIOS-e820: 3000 @ 1ffec000 (ACPI data)
 BIOS-e820: 0001 @ 1ffef000 (reserved)
 BIOS-e820: 1000 @ 1000 (ACPI NVS)
 BIOS-e820: 0001 @  (reserved)
On node 0 totalpages: 131052
zone(0): 4096 pages.
zone(1): 126956 pages.
zone(2): 0 pages.
Kernel command line: initrd=initrd.img root=/dev/hdg3 BOOT_IMAGE=vmlinuz auto
Initializing CPU#0
Detected 1109.899 MHz processor.
Console: colour VGA+ 80x25
Calibrating delay loop... 2215.11 BogoMIPS
Memory: 513140k/524208k available (920k kernel code, 10672k reserved, 351k data, 176k 
init, 0k highmem)
Dentry-cache hash table entries: 65536 (order: 7, 524288 bytes)
Buffer-cache hash table entries: 32768 (order: 5, 131072 bytes)
Page-cache hash table entries: 131072 (order: 7, 524288 bytes)
Inode-cache hash table entries: 32768 (order: 6, 262144 bytes)
CPU: Before vendor init, caps: 0183f9ff c1c7f9ff , vendor = 2
CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line)
CPU: L2 Cache: 256K (64 bytes/line)
CPU: After vendor init, caps: 0183f9ff c1c7f9ff  
CPU: After generic, caps: 0183f9ff c1c7f9ff  
CPU: Common caps: 0183f9ff c1c7f9ff  
CPU: AMD Athlon(tm) Processor stepping 02
Enabling fast FPU save and restore... done.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
mtrr: v1.37 (20001109) Richard Gooch ([EMAIL PROTECTED])
mtrr: detected mtrr type: Intel
PCI: PCI BIOS revision 2.10 entry at 0xf1150, last bus=1
PCI: Using configuration type 1
PCI: Probing PCI hardware
PCI: Using IRQ router VIA [1106/0686] at 00:04.0
PCI: Found IRQ 9 for device 00:09.0
PCI: The same IRQ used for device 00:04.2
PCI: The same IRQ used for device 00:04.3
PCI: The same IRQ used for device 00:0d.0
PCI: Found IRQ 11 for device 00:0c.0
Linux NET4.0 for Linux 2.4
Based upon Swansea University Computer Society NET3.039
Starting kswapd v1.8
pty: 256 Unix98 ptys configured
block: queued sectors max/low 341080kB/210008kB, 1024 slots per queue
RAMDISK driver initialized: 16 RAM disks of 8192K size 1024 blocksize
Uniform Multi-Platform E-IDE driver Revision: 6.31
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
VP_IDE: IDE controller on PCI bus 00 dev 21
VP_IDE: chipset revision 16
VP_IDE: not 100% native mode: will probe irqs later
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
VP_IDE: VIA vt82c686a (rev 22) IDE UDMA66 controller on pci00:04.1
ide0: BM-DMA at 0xb800-0xb807, BIOS settings: hda:DMA, hdb:pio
ide1: BM-DMA at 0xb808-0xb80f, BIOS settings: hdc:pio, hdd:pio
PDC20265: IDE controller on PCI bus 00 dev 88
PCI: Found IRQ 10 for device 00:11.0
PDC20265: chipset revision 2
PDC20265: not 100% native mode: will probe irqs later
PDC20265: (U)DMA Burst Bit ENABLED Primary PCI Mode Secondary PCI Mode.
ide2: BM-DMA at 0x6800-0x6807, BIOS settings: hde:DMA, hdf:pio
ide3: BM-DMA at 0x6808-0x680f, BIOS settings: hdg:DMA, hdh:pio
hda: CREATIVE CD5233E, ATAPI CD/DVD-ROM drive
hde: IBM-DTLA-307045, ATA DISK drive
hdg: IBM-DTLA-307045, ATA DISK drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
ide2 at 0x8000-0x8007,0x7802 on irq 10
ide3 at 0x7400-0x7407,0x7002 on irq 10
hde: 90069840 sectors (46116 MB) w/1916KiB Cache, CHS=89355/16/63, UDMA(100)
hdg: 90069840 sectors (46116 MB) w/1916KiB Cache, CHS=89355/16/63, UDMA(100)
hda: ATAPI 52X CD-ROM drive, 128kB Cache, DMA
Uniform CD-ROM driver 

Re: 2.4.x/alpha/ALI chipset/IDE problems summary Re: 2.4.1 not fully sane on Alpha - file systems

2001-02-15 Thread Michal Jaegermann

On Thu, Feb 15, 2001 at 03:15:01PM -0500, John Jasen wrote:
> 
> I retried the mysticism and incantations (d -l 801feac d) at the srm
> prompt, and the machine locked on fsck, under kernel 2.4.1-ac13.

Like I wrote - I did not get to locks on fsck but then stuff was weird
and if I would press sufficiently long maybe I would.  I still had some
use for my file systems so I did not try hard enough.  Maybe we need
black hens on the top of the magic quoted above?

> I don't care about X on this system, all that much, honestly.

"Technicolor" thingy seems to be side effect of your particular
graphics card, no?

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: 2.4.x/alpha/ALI chipset/IDE problems summary Re: 2.4.1 not fully sane on Alpha - file systems

2001-02-15 Thread Michal Jaegermann

On Thu, Feb 15, 2001 at 12:49:29PM -0500, John Jasen wrote:
> 
> Well, the situation is improving, I suppose ...
> 
> Under kernel 2.4.0 and 2.4.1, a dd of about 1 4k blocks would cause
> the system to go technicolor and lock up.

On UP1100 which I have here somehow this looks a bit different _after_
I put on it the latest SRM and used this "magic incantation" from
Hyung Min SEO ('d -l 801feac d' at SRM prompt to modify firmware).
I copied from disk to disk directory tries with some 150 MB of data
in these and no ill effects.

OTOH things are still wobbly.  This shows up in this that trying to
run e2fsck on a dirty file system while booting one 2.4.1 is likely
to come up with all kind of errors in a file sytstem requiring manual
interactions.  If one breaks this process and repeats an exercise
on the same file system, but booting this time 2.2.18, then things
check out without any incidents.  Once clean file systems can be
used with 2.4.1 again and no problems are reported.

I really do not see any kernel problems with 2.2 series kernels and IDE
patches.

> Now, under 2.4.1-ac13, at about 11000 blocks, it goes technicolor, but
> doesn't lock up until somewhere between 13000 and 2.

I got various lockups but no "technicolor" on any occasion.  Recently
I even got a picture with X and G450 Matrox card although one can
be very careful not to look at it a wrong angle or a power button will
be the only way out.

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: 2.4.x/alpha/ALI chipset/IDE problems summary Re: 2.4.1 not fully sane on Alpha - file systems

2001-02-15 Thread Michal Jaegermann

On Thu, Feb 15, 2001 at 12:49:29PM -0500, John Jasen wrote:
 
 Well, the situation is improving, I suppose ...
 
 Under kernel 2.4.0 and 2.4.1, a dd of about 1 4k blocks would cause
 the system to go technicolor and lock up.

On UP1100 which I have here somehow this looks a bit different _after_
I put on it the latest SRM and used this "magic incantation" from
Hyung Min SEO ('d -l 801feac d' at SRM prompt to modify firmware).
I copied from disk to disk directory tries with some 150 MB of data
in these and no ill effects.

OTOH things are still wobbly.  This shows up in this that trying to
run e2fsck on a dirty file system while booting one 2.4.1 is likely
to come up with all kind of errors in a file sytstem requiring manual
interactions.  If one breaks this process and repeats an exercise
on the same file system, but booting this time 2.2.18, then things
check out without any incidents.  Once clean file systems can be
used with 2.4.1 again and no problems are reported.

I really do not see any kernel problems with 2.2 series kernels and IDE
patches.

 Now, under 2.4.1-ac13, at about 11000 blocks, it goes technicolor, but
 doesn't lock up until somewhere between 13000 and 2.

I got various lockups but no "technicolor" on any occasion.  Recently
I even got a picture with X and G450 Matrox card although one can
be very careful not to look at it a wrong angle or a power button will
be the only way out.

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: 2.4.x/alpha/ALI chipset/IDE problems summary Re: 2.4.1 not fully sane on Alpha - file systems

2001-02-15 Thread Michal Jaegermann

On Thu, Feb 15, 2001 at 03:15:01PM -0500, John Jasen wrote:
 
 I retried the mysticism and incantations (d -l 801feac d) at the srm
 prompt, and the machine locked on fsck, under kernel 2.4.1-ac13.

Like I wrote - I did not get to locks on fsck but then stuff was weird
and if I would press sufficiently long maybe I would.  I still had some
use for my file systems so I did not try hard enough.  Maybe we need
black hens on the top of the magic quoted above?

 I don't care about X on this system, all that much, honestly.

"Technicolor" thingy seems to be side effect of your particular
graphics card, no?

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: 2.4.1 not fully sane on Alpha - file systems

2001-02-01 Thread Michal Jaegermann

To follow my own message about lockups on UP1100.  This time I tried
to boot 2.4.1-ac1.  Results are really the same but this time
an attempt to copy kernel source from a partition on a SCSI drive
to another one on an IDE drive brought different message.  I include
it below.

When trying to immediatly reboot with this kernel a machine locks
up in the middle of fsck.  Luckily 2.2.18 does not have problems with
that or other disk operations for that matter.

Here is what I collected in logs this time before a machine went "poof".


kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753664
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753664
kernel: EXT2-fs error (device ide0(3,1)): ext2_free_blocks: Freeing blocks in system 
zones - Block = 753665, count = 1
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753683
kernel: EXT2-fs error (device ide0(3,1)): ext2_free_blocks: Freeing blocks in system 
zones - Block = 753686, count = 5
kernel: EXT2-fs error (device ide0(3,1)): ext2_add_entry: bad entry in directory 
#379108: inode out of bounds - offset=0, inode=4049125, rec_len=12, name_len=1
last message repeated 10 times
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753686
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753687
kernel: EXT2-fs error (device ide0(3,1)): ext2_free_blocks: Freeing blocks in system 
zones - Block = 753688, count = 7
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753688
kernel: EXT2-fs error (device ide0(3,1)): ext2_free_blocks: Freeing blocks in system 
zones - Block = 753689, count = 7
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753700
kernel: EXT2-fs error (device ide0(3,1)): ext2_free_blocks: Freeing blocks in system 
zones - Block = 753702, count = 6
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753702
kernel: EXT2-fs error (device ide0(3,1)): ext2_free_blocks: Freeing blocks in system 
zones - Block = 753705, count = 5
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753734
kernel: EXT2-fs error (device ide0(3,1)): ext2_free_blocks: Freeing blocks in system 
zones - Block = 753739, count = 3
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753739
kernel: EXT2-fs error (device ide0(3,1)): ext2_free_blocks: Freeing blocks in system 
zones - Block = 753746, count = 1
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753746
kernel: EXT2-fs error (device ide0(3,1)): ext2_free_blocks: Freeing blocks in system 
zones - Block = 753747, count = 7
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753747

BTW - on a target disk there are no traces that somebody attempted to
copy something.

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: 2.4.1 not fully sane on Alpha - file systems

2001-02-01 Thread Michal Jaegermann

On Thu, Feb 01, 2001 at 10:46:12AM -0500, John Jasen wrote:
> On Wed, 31 Jan 2001, Michal Jaegermann wrote:
> 
> > I just tried to boot 2.4.1 kernel on Alpha UP1100.  This machine
> > happens to have two SCSI disks on sym53c875 controller and two IDE
> > drives hooked to a builtin "Acer Laboratories Inc. [ALi] M5229 IDE".
> 
> ALI M1535D pci-ide bridge, isn't it? That's what the specs on
> API's webpage seem to indicate.

'lspci' claims that this is:

"07.0  Acer Laboratories Inc. [ALi] M1533 PCI to ISA Bridge [Aladdin IV]"

> 
> Try this for fun: dd if=/dev/hda of=/dev/null bs=4096, and see if it
> cronks out.

Probably.

> In my case, any serious I/O on the IDE drives quickly results in pretty
> technicolor on the VGA screen, and then a hard lockup.

No, no technicolor or other sounds effects.  The whole thing just
locks up with a power switch as the only option.

> Furthermore, after power-reset, 2.4.x, x=0 or 1, cannot successfully fsck
> the drives.  It hangs after about the 2nd-3rd partition, again in a hard
> lockup.

My box is much healtier than that.  Regardless if I booted into a file
system on a SCSI drive or on an IDE drive (I happen to have those
options although I prefer IDE - I have there something which I can loose
without any real pain :-) I can still fsck drives healthy after the
crash but I did NOT risk fsck under 2.4.1.  Things looks way too screwy
for this.

> 
> My WAG is that there are problems in the ALI driver.

Possibly, but I crashed the whole thing without mounting anything from
IDE drives at all.  There are still there but unused.  I simply managed
to get something in logs for the case described.  Note that errors
I quoted are from a device 08:05, i.e. SCSI driver (/dev/sda5 to be
more precise).  When my compiler went bonkers and started to read
clearly some random stuff instead of sources then the whole action was
happening on a SCSI drive.

 Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: 2.4.1 not fully sane on Alpha - file systems

2001-02-01 Thread Michal Jaegermann

On Thu, Feb 01, 2001 at 10:46:12AM -0500, John Jasen wrote:
 On Wed, 31 Jan 2001, Michal Jaegermann wrote:
 
  I just tried to boot 2.4.1 kernel on Alpha UP1100.  This machine
  happens to have two SCSI disks on sym53c875 controller and two IDE
  drives hooked to a builtin "Acer Laboratories Inc. [ALi] M5229 IDE".
 
 ALI M1535D pci-ide bridge, isn't it? That's what the specs on
 API's webpage seem to indicate.

'lspci' claims that this is:

"07.0  Acer Laboratories Inc. [ALi] M1533 PCI to ISA Bridge [Aladdin IV]"

 
 Try this for fun: dd if=/dev/hda of=/dev/null bs=4096, and see if it
 cronks out.

Probably.

 In my case, any serious I/O on the IDE drives quickly results in pretty
 technicolor on the VGA screen, and then a hard lockup.

No, no technicolor or other sounds effects.  The whole thing just
locks up with a power switch as the only option.

 Furthermore, after power-reset, 2.4.x, x=0 or 1, cannot successfully fsck
 the drives.  It hangs after about the 2nd-3rd partition, again in a hard
 lockup.

My box is much healtier than that.  Regardless if I booted into a file
system on a SCSI drive or on an IDE drive (I happen to have those
options although I prefer IDE - I have there something which I can loose
without any real pain :-) I can still fsck drives healthy after the
crash but I did NOT risk fsck under 2.4.1.  Things looks way too screwy
for this.

 
 My WAG is that there are problems in the ALI driver.

Possibly, but I crashed the whole thing without mounting anything from
IDE drives at all.  There are still there but unused.  I simply managed
to get something in logs for the case described.  Note that errors
I quoted are from a device 08:05, i.e. SCSI driver (/dev/sda5 to be
more precise).  When my compiler went bonkers and started to read
clearly some random stuff instead of sources then the whole action was
happening on a SCSI drive.

 Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: 2.4.1 not fully sane on Alpha - file systems

2001-02-01 Thread Michal Jaegermann

To follow my own message about lockups on UP1100.  This time I tried
to boot 2.4.1-ac1.  Results are really the same but this time
an attempt to copy kernel source from a partition on a SCSI drive
to another one on an IDE drive brought different message.  I include
it below.

When trying to immediatly reboot with this kernel a machine locks
up in the middle of fsck.  Luckily 2.2.18 does not have problems with
that or other disk operations for that matter.

Here is what I collected in logs this time before a machine went "poof".


kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753664
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753664
kernel: EXT2-fs error (device ide0(3,1)): ext2_free_blocks: Freeing blocks in system 
zones - Block = 753665, count = 1
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753683
kernel: EXT2-fs error (device ide0(3,1)): ext2_free_blocks: Freeing blocks in system 
zones - Block = 753686, count = 5
kernel: EXT2-fs error (device ide0(3,1)): ext2_add_entry: bad entry in directory 
#379108: inode out of bounds - offset=0, inode=4049125, rec_len=12, name_len=1
last message repeated 10 times
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753686
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753687
kernel: EXT2-fs error (device ide0(3,1)): ext2_free_blocks: Freeing blocks in system 
zones - Block = 753688, count = 7
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753688
kernel: EXT2-fs error (device ide0(3,1)): ext2_free_blocks: Freeing blocks in system 
zones - Block = 753689, count = 7
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753700
kernel: EXT2-fs error (device ide0(3,1)): ext2_free_blocks: Freeing blocks in system 
zones - Block = 753702, count = 6
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753702
kernel: EXT2-fs error (device ide0(3,1)): ext2_free_blocks: Freeing blocks in system 
zones - Block = 753705, count = 5
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753734
kernel: EXT2-fs error (device ide0(3,1)): ext2_free_blocks: Freeing blocks in system 
zones - Block = 753739, count = 3
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753739
kernel: EXT2-fs error (device ide0(3,1)): ext2_free_blocks: Freeing blocks in system 
zones - Block = 753746, count = 1
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753746
kernel: EXT2-fs error (device ide0(3,1)): ext2_free_blocks: Freeing blocks in system 
zones - Block = 753747, count = 7
kernel: EXT2-fs error (device ide0(3,1)): ext2_new_block: Allocating block in system 
zone - block = 753747

BTW - on a target disk there are no traces that somebody attempted to
copy something.

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



2.4.1 not fully sane on Alpha - file systems

2001-01-31 Thread Michal Jaegermann

I just tried to boot 2.4.1 kernel on Alpha UP1100.  This machine
happens to have two SCSI disks on sym53c875 controller and two IDE
drives hooked to a builtin "Acer Laboratories Inc. [ALi] M5229 IDE".
It boots and in the first moment makes even a pretty good impression
of beeing healthy.  But an attempt to compile something causes the
whole setup to start behaving weird, with a compiler obviously unable
to find both itself and the right sources, and the whole thing ends in
a silent lockup.

On the second boot I tried to copy kernel sources from a SCSI to an
IDE drive.  This time I got something in my logs and the same stuff
was printed on my screen before everything lockded up really tight
again (no sysrq).  Here it is:

 kernel: attempt to access beyond end of device
 kernel: 08:05: rw=0, want=198500353, limit=5779456
 kernel: attempt to access beyond end of device
 kernel: 08:05: rw=0, want=4294934529, limit=5779456
 kernel: attempt to access beyond end of device
 kernel: 08:05: rw=0, want=198500353, limit=5779456
 kernel: EXT2-fs error (device sd(8,5)): ext2_readdir: bad entry in
 directory #250255: directory entry across blocks - offset=0,
 inode=198505472, rec_len=32768, name_len=255

(and the machine dies at this point).

There is nothing wrong with this device and a file system on it.
Copying the same way, or compiling the same sources, but when booted
with 2.2.18 does not present a whiff of trouble and e2fsck, luckily
enough, finds my file systems still in place.  One should be grateful
for small favours.

Anybody have seen something similar?

  Michal
  [EMAIL PROTECTED]

p.s. I find a bit humorous the fact that the code required to
recognize that one has _some_ partition table (I happen to have two
kinds at the moment) is billed in a config file as ADVANCED.
It did the job anyway. :-)
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



2.4.1 not fully sane on Alpha - file systems

2001-01-31 Thread Michal Jaegermann

I just tried to boot 2.4.1 kernel on Alpha UP1100.  This machine
happens to have two SCSI disks on sym53c875 controller and two IDE
drives hooked to a builtin "Acer Laboratories Inc. [ALi] M5229 IDE".
It boots and in the first moment makes even a pretty good impression
of beeing healthy.  But an attempt to compile something causes the
whole setup to start behaving weird, with a compiler obviously unable
to find both itself and the right sources, and the whole thing ends in
a silent lockup.

On the second boot I tried to copy kernel sources from a SCSI to an
IDE drive.  This time I got something in my logs and the same stuff
was printed on my screen before everything lockded up really tight
again (no sysrq).  Here it is:

 kernel: attempt to access beyond end of device
 kernel: 08:05: rw=0, want=198500353, limit=5779456
 kernel: attempt to access beyond end of device
 kernel: 08:05: rw=0, want=4294934529, limit=5779456
 kernel: attempt to access beyond end of device
 kernel: 08:05: rw=0, want=198500353, limit=5779456
 kernel: EXT2-fs error (device sd(8,5)): ext2_readdir: bad entry in
 directory #250255: directory entry across blocks - offset=0,
 inode=198505472, rec_len=32768, name_len=255

(and the machine dies at this point).

There is nothing wrong with this device and a file system on it.
Copying the same way, or compiling the same sources, but when booted
with 2.2.18 does not present a whiff of trouble and e2fsck, luckily
enough, finds my file systems still in place.  One should be grateful
for small favours.

Anybody have seen something similar?

  Michal
  [EMAIL PROTECTED]

p.s. I find a bit humorous the fact that the code required to
recognize that one has _some_ partition table (I happen to have two
kinds at the moment) is billed in a config file as ADVANCED.
It did the job anyway. :-)
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



2.2.18pre24 - forgotten symbols

2000-12-05 Thread Michal Jaegermann

With an SMP kernel one gets, in particular,

depmod: *** Unresolved symbols in /lib/modules/2.2.18pre24/misc/agpgart.o
depmod: smp_call_function
depmod: smp_num_cpus

The machine affected is actually Alpha but likely this is not relevant.

   Michal
   [EMAIL PROTECTED]
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



2.2.18pre24 - forgotten symbols

2000-12-05 Thread Michal Jaegermann

With an SMP kernel one gets, in particular,

depmod: *** Unresolved symbols in /lib/modules/2.2.18pre24/misc/agpgart.o
depmod: smp_call_function
depmod: smp_num_cpus

The machine affected is actually Alpha but likely this is not relevant.

   Michal
   [EMAIL PROTECTED]
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



vm in 2.2.18pre23 - behaving worse

2000-11-23 Thread Michal Jaegermann

I was busy with other things and did not track 2.2.18pre kernels very
carefuly, but now I tried 2.2.18pre23 on Alpha and got an impression
that a situation with a virtual memory handling is worse than it was,
say, in 2.2.18pre15.  I can now see in /var/log/messages entries like
"VM: killing process sh" or "VM: killing process emacs" because I was
compiling a kernel.  This does not happen consistently, predictably, or
very often but it does happen and I should not be oom.  Nothing else
crashes, or leaves any traces in log files, and a machine continues
apparently not disturbed, but a process is gone.

Applying patches from Andrea and the one from Rik, posted some week
ago under "blindingly stupid 2.2 VM bug" subject, does not seem
to help very much.  Sigh!

Am I isolated in this experience?

  Michal

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



vm in 2.2.18pre23 - behaving worse

2000-11-23 Thread Michal Jaegermann

I was busy with other things and did not track 2.2.18pre kernels very
carefuly, but now I tried 2.2.18pre23 on Alpha and got an impression
that a situation with a virtual memory handling is worse than it was,
say, in 2.2.18pre15.  I can now see in /var/log/messages entries like
"VM: killing process sh" or "VM: killing process emacs" because I was
compiling a kernel.  This does not happen consistently, predictably, or
very often but it does happen and I should not be oom.  Nothing else
crashes, or leaves any traces in log files, and a machine continues
apparently not disturbed, but a process is gone.

Applying patches from Andrea and the one from Rik, posted some week
ago under "blindingly stupid 2.2 VM bug" subject, does not seem
to help very much.  Sigh!

Am I isolated in this experience?

  Michal

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: ux164 (ruffian) fixes

2000-11-21 Thread Michal Jaegermann

On Tue, Nov 21, 2000 at 10:47:20AM -0800, Richard Henderson wrote:
> On Tue, Nov 21, 2000 at 06:46:09PM +0300, Ivan Kokshaysky wrote:
> >Interesting, other pyxis machines do not seem to be so sensitive,
> >so I guess some design problems with ux164 motherboard - all this
> >looks pretty much like timing issues.
> 
> Wow.  Thanks for following through on this.

I can now confirm that I can boot using SCSI disks (the fact that
this was possible for a while into IDE was a life-saver here :-)
a Ruffian (pyxis) Alpha using 2.4.0-test11 kernel and two patches
posted by Ivan (bridges-2.4.0t11.gz and extra ruffian fixes).

Here are fragments from 'dmesg':


Booting on Ruffian using machine vector Ruffian from MILO
Command line: bootdevice=sda2 bootfile=/vml240o11.gz root=/dev/sda2

SCSI subsystem driver Revision: 1.00
sym53c8xx: at PCI bus 1, device 13, function 0
sym53c8xx: 53c875 detected 
sym53c875-0: rev 0x3 on pci bus 1 device 13 function 0 irq 20
sym53c875-0: ID 7, Fast-20, Parity Checking
sym53c875-0: on-chip RAM at 0xa101000
sym53c875-0: restart (scsi reset).
sym53c875-0: Downloading SCSI SCRIPTS.
scsi0 : sym53c8xx - version 1.6b
  Vendor: IBM   Model: DDRS-39130D   Rev: DC1B
  Type:   Direct-Access  ANSI SCSI revision: 02
  Vendor: TOSHIBA   Model: CD-ROM XM-6201TA  Rev: 1037
  Type:   CD-ROM ANSI SCSI revision: 02
  Vendor: IBM   Model: DDRS-34560D   Rev: DC1B
  Type:   Direct-Access  ANSI SCSI revision: 02
sym53c875-0-<2,0>: tagged command queue depth set to 8
sym53c875-0-<10,0>: tagged command queue depth set to 8
Detected scsi disk sda at scsi0, channel 0, id 2, lun 0
Detected scsi disk sdb at scsi0, channel 0, id 10, lun 0

VFS: Mounted root (ext2 filesystem) readonly.

Those who posted "me too" could you please test that this is not
only a fluke on my particular machine?

Thanks a bunch, Ivan. Also thanks are extended to Gerard Roudier who
provided a crucial hint in the moment when we appeared to be completly
stuck. :-)

  Michal

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: ux164 (ruffian) fixes

2000-11-21 Thread Michal Jaegermann

On Tue, Nov 21, 2000 at 10:47:20AM -0800, Richard Henderson wrote:
 On Tue, Nov 21, 2000 at 06:46:09PM +0300, Ivan Kokshaysky wrote:
 Interesting, other pyxis machines do not seem to be so sensitive,
 so I guess some design problems with ux164 motherboard - all this
 looks pretty much like timing issues.
 
 Wow.  Thanks for following through on this.

I can now confirm that I can boot using SCSI disks (the fact that
this was possible for a while into IDE was a life-saver here :-)
a Ruffian (pyxis) Alpha using 2.4.0-test11 kernel and two patches
posted by Ivan (bridges-2.4.0t11.gz and extra ruffian fixes).

Here are fragments from 'dmesg':


Booting on Ruffian using machine vector Ruffian from MILO
Command line: bootdevice=sda2 bootfile=/vml240o11.gz root=/dev/sda2

SCSI subsystem driver Revision: 1.00
sym53c8xx: at PCI bus 1, device 13, function 0
sym53c8xx: 53c875 detected 
sym53c875-0: rev 0x3 on pci bus 1 device 13 function 0 irq 20
sym53c875-0: ID 7, Fast-20, Parity Checking
sym53c875-0: on-chip RAM at 0xa101000
sym53c875-0: restart (scsi reset).
sym53c875-0: Downloading SCSI SCRIPTS.
scsi0 : sym53c8xx - version 1.6b
  Vendor: IBM   Model: DDRS-39130D   Rev: DC1B
  Type:   Direct-Access  ANSI SCSI revision: 02
  Vendor: TOSHIBA   Model: CD-ROM XM-6201TA  Rev: 1037
  Type:   CD-ROM ANSI SCSI revision: 02
  Vendor: IBM   Model: DDRS-34560D   Rev: DC1B
  Type:   Direct-Access  ANSI SCSI revision: 02
sym53c875-0-2,0: tagged command queue depth set to 8
sym53c875-0-10,0: tagged command queue depth set to 8
Detected scsi disk sda at scsi0, channel 0, id 2, lun 0
Detected scsi disk sdb at scsi0, channel 0, id 10, lun 0

VFS: Mounted root (ext2 filesystem) readonly.

Those who posted "me too" could you please test that this is not
only a fluke on my particular machine?

Thanks a bunch, Ivan. Also thanks are extended to Gerard Roudier who
provided a crucial hint in the moment when we appeared to be completly
stuck. :-)

  Michal

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: easy-to-fix bug in /dev/null driver

2000-11-20 Thread Michal Jaegermann

On Tue, Nov 21, 2000 at 12:53:04AM +1030, Alan Kennington wrote:
> 
> I still think that write_null() should be rewritten as:
> 
> ===
> static ssize_t write_null(struct file * file, const char * buf,
>   size_t count, loff_t *ppos)
> {
> return (count <= 2147483647) ? count : 2147483647;
> }
> === 
> 
> This would fix the problem without introducing any new errors.
> (Unless someone change the definitions of ssize_t and size_t!!)

Someone already did.  Or, more precisely, there are platforms where
values used in 'return' above are bogus.

Shifting 1 up by sizeof(ssize_t) * 8 - 1 and subtracting one from
the result, in order to derive the maximal allowable value, might work.
I do not think that we have anything with non-8 bit bytes yet.

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: easy-to-fix bug in /dev/null driver

2000-11-20 Thread Michal Jaegermann

On Tue, Nov 21, 2000 at 12:53:04AM +1030, Alan Kennington wrote:
 
 I still think that write_null() should be rewritten as:
 
 ===
 static ssize_t write_null(struct file * file, const char * buf,
   size_t count, loff_t *ppos)
 {
 return (count = 2147483647) ? count : 2147483647;
 }
 === 
 
 This would fix the problem without introducing any new errors.
 (Unless someone change the definitions of ssize_t and size_t!!)

Someone already did.  Or, more precisely, there are platforms where
values used in 'return' above are bogus.

Shifting 1 up by sizeof(ssize_t) * 8 - 1 and subtracting one from
the result, in order to derive the maximal allowable value, might work.
I do not think that we have anything with non-8 bit bytes yet.

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



[PATCH] __builtin_expect in 2.4.0-test11pre4

2000-11-14 Thread Michal Jaegermann

At least on Alpha, and possibly other architectures, the following
minor correction is needed:

--- linux-2.4.0p11p/include/asm-alpha/semaphore.h~  Mon Nov 13 14:01:10 2000
+++ linux-2.4.0p11p/include/asm-alpha/semaphore.h   Mon Nov 13 14:03:44 2000
@@ -11,6 +11,7 @@
 #include 
 #include 
 #include 
+#include 
 
 #define DEBUG_SEMAPHORE 0
 #define DEBUG_RW_SEMAPHORE 0

or various version of a compiler are blowing a fuse on a missing
__builtin_expect prototype.

  Michal

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



[PATCH] __builtin_expect in 2.4.0-test11pre4

2000-11-14 Thread Michal Jaegermann

At least on Alpha, and possibly other architectures, the following
minor correction is needed:

--- linux-2.4.0p11p/include/asm-alpha/semaphore.h~  Mon Nov 13 14:01:10 2000
+++ linux-2.4.0p11p/include/asm-alpha/semaphore.h   Mon Nov 13 14:03:44 2000
@@ -11,6 +11,7 @@
 #include asm/current.h
 #include asm/system.h
 #include asm/atomic.h
+#include asm/compiler.h
 
 #define DEBUG_SEMAPHORE 0
 #define DEBUG_RW_SEMAPHORE 0

or various version of a compiler are blowing a fuse on a missing
__builtin_expect prototype.

  Michal

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: Linux 2.4 Status/TODO page (test11-pre3)

2000-11-13 Thread Michal Jaegermann

On Sun, Nov 12, 2000 at 10:09:39PM -0500, Jeff Garzik wrote:
> [EMAIL PROTECTED] wrote:
> > 4. Boot Time Failures
> 
> >  * Various Alpha's don't boot under 2.4.0-test9 (PCI-PCI bridges are
> >not configured correctly Michal Jaegermann; Richard Henderson may
> >have an idea what's failing.)
> 
> Move to patch-exists-but-not-merged.  rth has patches, co-developed with
> Ivan Kokshaysky

Would be nice, wouldn't it.  Unfortunately this is not the case.
rth has patches which help to boot his machine, and few others, but
this still does not work in general.

Ivan works now pretty hard, with my involvment into this, trying to
identify and fix the problem.

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: Linux 2.4 Status/TODO page (test11-pre3)

2000-11-13 Thread Michal Jaegermann

On Sun, Nov 12, 2000 at 10:09:39PM -0500, Jeff Garzik wrote:
 [EMAIL PROTECTED] wrote:
  4. Boot Time Failures
 
   * Various Alpha's don't boot under 2.4.0-test9 (PCI-PCI bridges are
 not configured correctly Michal Jaegermann; Richard Henderson may
 have an idea what's failing.)
 
 Move to patch-exists-but-not-merged.  rth has patches, co-developed with
 Ivan Kokshaysky

Would be nice, wouldn't it.  Unfortunately this is not the case.
rth has patches which help to boot his machine, and few others, but
this still does not work in general.

Ivan works now pretty hard, with my involvment into this, trying to
identify and fix the problem.

  Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: PCI-PCI bridges mess in 2.4.x

2000-11-09 Thread Michal Jaegermann

On Thu, Nov 09, 2000 at 11:33:47AM -0500, Wakko Warner wrote:
> > It was posted to lkml, so no link (except if you want to dig through
> > lkml mail archives).
> 
> It booted but then it oops'ed before userland I belive.  I tried it this
> morning and didn't have much time.  It did find the scsi controller (which
> is across the bridge) and the drives attached so it does appear to be
> working.

Looks so far that I am the worst off.  If I am trying to boot with
a root on a SCSI device then either a controller is misdetected,
or goes into an infinite "abort/reset" loop, or it does not initialize
properly and disks are not found.  This is a non-exclusive, logical,
"or". :-)

Booting to an IDE device makes difference only in that that if I can
boot then SCSI disks will be simply ignored.  If somebody is interested
in a collection of dmesg outputs, with DEBUG printks, from such attempts
then I am game.  Ivan was getting these pretty consistently. :-)

   Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: PCI-PCI bridges mess in 2.4.x

2000-11-09 Thread Michal Jaegermann

On Thu, Nov 09, 2000 at 11:33:47AM -0500, Wakko Warner wrote:
  It was posted to lkml, so no link (except if you want to dig through
  lkml mail archives).
 
 It booted but then it oops'ed before userland I belive.  I tried it this
 morning and didn't have much time.  It did find the scsi controller (which
 is across the bridge) and the drives attached so it does appear to be
 working.

Looks so far that I am the worst off.  If I am trying to boot with
a root on a SCSI device then either a controller is misdetected,
or goes into an infinite "abort/reset" loop, or it does not initialize
properly and disks are not found.  This is a non-exclusive, logical,
"or". :-)

Booting to an IDE device makes difference only in that that if I can
boot then SCSI disks will be simply ignored.  If somebody is interested
in a collection of dmesg outputs, with DEBUG printks, from such attempts
then I am game.  Ivan was getting these pretty consistently. :-)

   Michal
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



FIX: Nice oops from agpgart - 2.2 kernels and Alpha

2000-10-17 Thread Michal Jaegermann

I complained few days ago that 'agpgart.o' module from 2.2.18pre
is causing a kernel oops.  The problem turned out to be an apparent
assumption that PCI memory <-> memory mapping is an identity and
this is not always the case.

Here is a patch applicable to all 2.2.18pre kernels with agpgart
support:

--- linux-2.2.18px/drivers/char/agp/agp.h~  Tue Oct  3 16:03:13 2000
+++ linux-2.2.18px/drivers/char/agp/agp.h   Tue Oct 17 10:23:14 2000
@@ -27,6 +27,8 @@
 #ifndef _AGP_BACKEND_PRIV_H
 #define _AGP_BACKEND_PRIV_H 1
 
+#include 
+
 enum aper_size_type {
U8_APER_SIZE,
U16_APER_SIZE,
@@ -119,13 +121,13 @@
void (*free_by_type) (agp_memory *);
 };
 
-#define OUTREG32(mmap, addr, val)   *(volatile u32 *)(mmap + (addr)) = (val)
-#define OUTREG16(mmap, addr, val)   *(volatile u16 *)(mmap + (addr)) = (val)
-#define OUTREG8 (mmap, addr, val)   *(volatile u8 *) (mmap + (addr)) = (val)
-
-#define INREG32(mmap, addr) *(volatile u32 *)(mmap + (addr))
-#define INREG16(mmap, addr) *(volatile u16 *)(mmap + (addr))
-#define INREG8 (mmap, addr) *(volatile u8 *) (mmap + (addr))
+#define OUTREG32(mmap, addr, val)   writel((val),(mmap + (addr)))
+#define OUTREG16(mmap, addr, val)   writew((val),(mmap + (addr)))
+#define OUTREG8 (mmap, addr, val)   writeb((val),(mmap + (addr)))
+
+#define INREG32(mmap, addr) readl(mmap + (addr))
+#define INREG16(mmap, addr) readw(mmap + (addr))
+#define INREG8 (mmap, addr) readb(mmap + (addr))
 
 #define CACHE_FLUSHagp_bridge.cache_flush
 #define A_SIZE_8(x)((aper_size_info_8 *) x)

With this patch 'agpgart.o' module compiled on x86 machine is
byte-by-byte identical as the one without it.  OTOH on UP1100 Alpha
with an Irongate after the fixed module is inserted instead of
crashing prints this in dmesg:

Linux agpgart interface v0.99 (c) Jeff Hartmann
agpgart: Maximum main memory to use for agp memory: 204M
agpgart: Detected AMD Irongate chipset
agpgart: AGP aperture is 512M @ 0x4000

I have no idea if this is really useful on Alpha and if there are no
other address mapping problems still there but the patch above should
at least be a start. :-)

A peek at agpgart sources from 2.4 reveals that this mapping problem
is absent there as INREG... and OUTREG... macros are defined with
a help of __raw_read... and __raw_write... which function the same
like the stuff above from 2.2.

On x86 machines 'agpgart' appears to be really doing something. :-)
A small test program 'testgart' (got in mail from somebody) reports
a memory benchmark speed in 580-590 mb/s range before I started my
X server and some 720+ mb/s after X was started and exited.  Don't
ask me why. :-)

The same 'testgart' on Alpha, after obviously x86 specific or not
applicable stuff was edited out, is causing a spectacular crash
with mulitple oopses, a total lockup and zilch in log files.  So
not everything is healthy yet but not bombing out for a start
seems like a good beginning.

  Michal
  [EMAIL PROTECTED]

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



FIX: Nice oops from agpgart - 2.2 kernels and Alpha

2000-10-17 Thread Michal Jaegermann

I complained few days ago that 'agpgart.o' module from 2.2.18pre
is causing a kernel oops.  The problem turned out to be an apparent
assumption that PCI memory - memory mapping is an identity and
this is not always the case.

Here is a patch applicable to all 2.2.18pre kernels with agpgart
support:

--- linux-2.2.18px/drivers/char/agp/agp.h~  Tue Oct  3 16:03:13 2000
+++ linux-2.2.18px/drivers/char/agp/agp.h   Tue Oct 17 10:23:14 2000
@@ -27,6 +27,8 @@
 #ifndef _AGP_BACKEND_PRIV_H
 #define _AGP_BACKEND_PRIV_H 1
 
+#include asm/io.h
+
 enum aper_size_type {
U8_APER_SIZE,
U16_APER_SIZE,
@@ -119,13 +121,13 @@
void (*free_by_type) (agp_memory *);
 };
 
-#define OUTREG32(mmap, addr, val)   *(volatile u32 *)(mmap + (addr)) = (val)
-#define OUTREG16(mmap, addr, val)   *(volatile u16 *)(mmap + (addr)) = (val)
-#define OUTREG8 (mmap, addr, val)   *(volatile u8 *) (mmap + (addr)) = (val)
-
-#define INREG32(mmap, addr) *(volatile u32 *)(mmap + (addr))
-#define INREG16(mmap, addr) *(volatile u16 *)(mmap + (addr))
-#define INREG8 (mmap, addr) *(volatile u8 *) (mmap + (addr))
+#define OUTREG32(mmap, addr, val)   writel((val),(mmap + (addr)))
+#define OUTREG16(mmap, addr, val)   writew((val),(mmap + (addr)))
+#define OUTREG8 (mmap, addr, val)   writeb((val),(mmap + (addr)))
+
+#define INREG32(mmap, addr) readl(mmap + (addr))
+#define INREG16(mmap, addr) readw(mmap + (addr))
+#define INREG8 (mmap, addr) readb(mmap + (addr))
 
 #define CACHE_FLUSHagp_bridge.cache_flush
 #define A_SIZE_8(x)((aper_size_info_8 *) x)

With this patch 'agpgart.o' module compiled on x86 machine is
byte-by-byte identical as the one without it.  OTOH on UP1100 Alpha
with an Irongate after the fixed module is inserted instead of
crashing prints this in dmesg:

Linux agpgart interface v0.99 (c) Jeff Hartmann
agpgart: Maximum main memory to use for agp memory: 204M
agpgart: Detected AMD Irongate chipset
agpgart: AGP aperture is 512M @ 0x4000

I have no idea if this is really useful on Alpha and if there are no
other address mapping problems still there but the patch above should
at least be a start. :-)

A peek at agpgart sources from 2.4 reveals that this mapping problem
is absent there as INREG... and OUTREG... macros are defined with
a help of __raw_read... and __raw_write... which function the same
like the stuff above from 2.2.

On x86 machines 'agpgart' appears to be really doing something. :-)
A small test program 'testgart' (got in mail from somebody) reports
a memory benchmark speed in 580-590 mb/s range before I started my
X server and some 720+ mb/s after X was started and exited.  Don't
ask me why. :-)

The same 'testgart' on Alpha, after obviously x86 specific or not
applicable stuff was edited out, is causing a spectacular crash
with mulitple oopses, a total lockup and zilch in log files.  So
not everything is healthy yet but not bombing out for a start
seems like a good beginning.

  Michal
  [EMAIL PROTECTED]

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Nice oops from agpgart - 2.2 kernels and Alpha

2000-10-13 Thread Michal Jaegermann


On UP1100 Alpha with an AGP slot and "Advanced Micro Devices [AMD]
AMD-751 [Irongate] System Controller" an attempt to use 'agpgart'
support ends up with an oops. I tried 2.2.17 and 2.2.18pre15 kernels.
With CONFIG_AGP=y and CONFIG_AGP_AMD=y resulting kernel gets stuck
after oops and does not boot.  If CONFIG_AGP=m is used then an attempt
to insert 'agpgart' module ends up with the following oops (after
a run through 'ksymoops'):

ksymoops 2.3.4 on alpha 2.2.18pre15x.  Options used
 -V (default)
 -k /proc/ksyms (default)
 -l /proc/modules (default)
 -o /lib/modules/2.2.18pre15x/ (default)
 -m /boot/System.map-2.2.18pre15x (specified)

Unable to handle kernel paging request at virtual address 6204
insmod(1048): Oops 1
pc = []  ra = []  ps = 
Using defaults from ksymoops -t elf64-alpha -a alpha
v0 =   t0 = 6200  t1 = 6200
t2 = 0c322000  t3 = fc802000  t4 = fe85
t5 = fe85  t6 = fffe  t7 = fc000c2a
s0 = fe844e50  s1 = fe844f50  s2 = fe844cb0
s3 = 0001  s4 = 0001  s5 = fe844e50
s6 = fe840090  a0 = fd01fe14  a1 = 00b0
a2 = 0080  a3 = fc569310  a4 = fc000c2a3d60
a5 = fc000c2a3d58  t8 = 0001  t9 = fc523078
t10=   t11= fc523070  pv = fc31d640
 47f01412  or zero,128,a2
 4441f102  andnot t1,15,t1
 44420401  or t1,t1,t0
 b05e0020  stl t1,32(sp)
 4821f621  zapnot t0,15,t0
 b42a  stq t0,0(s1)
 a6090028  ldq a0,40(s0)
Trace: 332014 33bc18 310cf8 42fe80 
Warning (Oops_read): Code line not seen, dumping what data is available

>>PC;  fe841ba8 <[agpgart]amd_irongate_configure+68/140>   <=
Trace; 00332014 Before first symbol
Trace; 0033bc18 Before first symbol
Trace; 00310cf8 Before first symbol
Trace; 0042fe80 Before first symbol

1 warning issued.  Results may not be reliable.

followed by a "Segmentation fault" from insmod.  Unfortunately option
-m produces an empty symbol map for the module; also later the module
is not removable with the following output from lsmod:

agpgart21312   1  (initializing)

This bomb happens on this code

/* Write out the address of the gatt table */
OUTREG32(amd_irongate_private.registers, AMD_ATTBASE,
 agp_bridge.gatt_bus_addr);

from amd_irongate_configure() in drivers/char/agp/agpgart_be.c.
I dropped few printk's there like that:

/* Get the memory mapped registers */
pci_read_config_dword(agp_bridge.dev, AMD_MMBASE, );
printk(KERN_INFO PFX "Read irongate temp %x\n", temp);
temp = (temp & PCI_BASE_ADDRESS_MEM_MASK);
printk(KERN_INFO PFX "Masked temp to %x\n", temp);
amd_irongate_private.registers = (volatile u8 *) ioremap(temp, 4096);
printk(KERN_INFO PFX "Updated private registers to %x\n",
   amd_irongate_private.registers);

and results are as follows:

Linux agpgart interface v0.99 (c) Jeff Hartmann
agpgart: Maximum main memory to use for agp memory: 204M
agpgart: Detected AMD Irongate chipset
agpgart: Read irongate temp 6208
agpgart: Masked temp to 6200
agpgart: Updated private registers to 6200
Unable to handle kernel paging request at virtual address 6204

Any ideas what is wrong with this picture?

BTW - despite the following commment in drivers/char/drm/drm.h
"Dec 1999, Richard Henderson <[EMAIL PROTECTED]>, move to generic cmpxchg."
an attempt to compile 'drm' module includes some x86 specific code
from drivers/char/drm/drmP.h, like this:

__asm__ __volatile__(LOCK_PREFIX "cmpxchgb %b1,%2"
 : "=a"(prev)
 : "q"(new), "m"(*__xg(ptr)), "0"(old)
 : "memory");

and, as you can guess, does not compile on Alpha.  But that is the
next problem after 'agpgart'. :-)

  Michal
  [EMAIL PROTECTED]
  [EMAIL PROTECTED]
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: want tool to open RPM package on Window 95

2000-10-11 Thread Michal Jaegermann

> Somewhere floating around there is a perl version of rpm2cpio.

This is what I wrote one day a long time ago:

#!/usr/bin/perl -w
use strict;

my ($buffer, $pos, $gzmagic);
$gzmagic = "\037\213";
open OUT, "| gunzip" or die "cannot find gunzip; $!\n";
while(1) {
  exit 1 unless defined($pos = read STDIN, $buffer, 8192) and $pos > 0;
  next unless ($pos = index $buffer, $gzmagic) >= 0;
  print OUT substr $buffer, $pos;
  last;
}
print OUT ;
exit 0;

Yes, I know that I should not mix 'read' with stdio but it worked
every time I tried the above. :-)

Can we go back now to kernel issues?

  Michal

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: want tool to open RPM package on Window 95

2000-10-11 Thread Michal Jaegermann

 Somewhere floating around there is a perl version of rpm2cpio.

This is what I wrote one day a long time ago:

#!/usr/bin/perl -w
use strict;

my ($buffer, $pos, $gzmagic);
$gzmagic = "\037\213";
open OUT, "| gunzip" or die "cannot find gunzip; $!\n";
while(1) {
  exit 1 unless defined($pos = read STDIN, $buffer, 8192) and $pos  0;
  next unless ($pos = index $buffer, $gzmagic) = 0;
  print OUT substr $buffer, $pos;
  last;
}
print OUT STDIN;
exit 0;

Yes, I know that I should not mix 'read' with stdio but it worked
every time I tried the above. :-)

Can we go back now to kernel issues?

  Michal

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Trident soundcard on Alpha/UX

2000-10-04 Thread Michal Jaegermann

An attempt to use on Alpha UX a 'trident' module on 2.2.18pre15 (and
2.2.17, and 2.2.16 with patches) with a sound card present ends up
with the following lines in 'dmesg':

Trident 4DWave/SiS 7018/Ali 5451 PCI Audio, version 0.14.6a, 23:57:52 Oct  3 2000
trident: Trident 4DWave DX found at IO 0x9800, IRQ 27
ac97_codec: AC97 Audio codec, vendor id1: 0x4352, id2: 0x5903 (Cirrus Logic CS4297)
trident: DMA buffer beyond 1 GB; bus address = 0x48158000, size = 32768

and that's it.  A presence or absence of 'bigmem" patches from Andrea
does not seem to have effects.   Any ideas how to prevent that
from happening?

  Michal

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Trident soundcard on Alpha/UX

2000-10-04 Thread Michal Jaegermann

An attempt to use on Alpha UX a 'trident' module on 2.2.18pre15 (and
2.2.17, and 2.2.16 with patches) with a sound card present ends up
with the following lines in 'dmesg':

Trident 4DWave/SiS 7018/Ali 5451 PCI Audio, version 0.14.6a, 23:57:52 Oct  3 2000
trident: Trident 4DWave DX found at IO 0x9800, IRQ 27
ac97_codec: AC97 Audio codec, vendor id1: 0x4352, id2: 0x5903 (Cirrus Logic CS4297)
trident: DMA buffer beyond 1 GB; bus address = 0x48158000, size = 32768

and that's it.  A presence or absence of 'bigmem" patches from Andrea
does not seem to have effects.   Any ideas how to prevent that
from happening?

  Michal

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



2.4 kernels do not boot on UX (Alpha)

2000-09-22 Thread Michal Jaegermann

I tried to boot recent 2.4.0 kernels (the last one 2.4.0-test9-pre5)
on Ruffian type of Alpha, also known as UX, with a notable lack
of success.  So far I had not a single succesful boot.  My test
machine ran without any hiccups various 2.2 kernels, patched and
upatched, and before that a long list of 2.0 kernels.

Here are excerpts from boot messages, transcribed from a screen,
which seem to be relevant to the problem:


Booting GENERIC on Ruffian using machine vector Ruffian from MILO

Calibrating delay loop... 1191.18 BogoMips

pci: cia revision 1 (pyxis)
...
pci: passed tb register update test
pci: passed passed sg loopback i/o read test
pci: passed tbia test
pci: passed pte write cache snoop test
pci: failed valid tad invalid pte reload test (mcheck; workaround avialable)
pci: passed pci machine check test
PCI: Failed to allocate resource 8 for Digital Equipment Corporation
   DECchip 21052
PCI: Failed to allocate resource 1 for Trident Microsystems 4DWave DX
PCI: Failed to allocate resource 1 for Symbios Logic Inc. (formerly
   NCR) 53c875
Activating ISA DMA hang workarounds
.
hda: IBM-DJNA=370910, ATA DISK driver
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
hda: 17803440 sectors (9115 MB) w/1966KiB Cache, CHS=17662/16/63, (U)DMA
Partition check:
 hda: unknown partition table

sym53c8xx: at PCI bus 1, device 13, function 0
sym53c8xx: 53c875 detected 
sym53c876-0: rev 0xff on pci bus 1 device 13 function 0 irq 20
sym53c876-0: ID 7, Fast-20, Parity Checking, Differential
sym53c876-0: on-chip RAM at 0xa00
CACHE TEST FAILED: script execution failed.
start=4fea4dd8, pc=. end=4fea4df8
CACHE INCORRECTLY CONFIGURED.
sym53c876-0: giving up ...


and a kernel panic because my root file system on /dev/sda2 cannot be
mounted.  The machine is dead with even a keyboard gone.

There is also another failure mode when some messages are scrolling
up the screen too fast to be of any use because a screen is flooded
with repeated messages:
pc_keyb: controller jammed (0xFF).
followed by:
keyboard timed out [1].
and a kernel hangs the moment it should print 
hda: IBM-DJNA-370910, ATA DISK drive
line. The second failure mode happens apparently less often but
apparently at random.

For comparison here are similar lines from a correct boot of a kernel
from 2.2 series on the same machine:

Booting GENERIC on Ruffian using machine vector Ruffian from MILO

Calibrating delay loop... 595.59 BogoMIPS
hda: IBM-DJNA-370910, ATA DISK drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
hda: IBM-DJNA-370910, 8693MB w/1966kB Cache, CHS=17662/16/63, (U)DMA
...
sym53c8xx: at PCI bus 32, device 13, function 0
sym53c8xx: 53c875 detected 
sym53c875-0: rev 0x3 on pci bus 32 device 13 function 0 irq 20
sym53c875-0: ID 7, Fast-20, Parity Checking
scsi0 : sym53c8xx-1.7.0-2709
scsi : 1 host.
  Vendor: IBM   Model: DDRS-39130D   Rev: DC1B
  Type:   Direct-Access  ANSI SCSI revision: 02
Detected scsi disk sda at scsi0, channel 0, id 2, lun 0
  Vendor: TOSHIBA   Model: CD-ROM XM-6201TA  Rev: 1037
  Type:   CD-ROM ANSI SCSI revision: 02
Detected scsi CD-ROM sr0 at scsi0, channel 0, id 6, lun 0
  Vendor: IBM   Model: DDRS-34560D   Rev: DC1B
  Type:   Direct-Access  ANSI SCSI revision: 02
Detected scsi disk sdb at scsi0, channel 0, id 10, lun 0
scsi : detected 1 SCSI cdrom 2 SCSI disks total.
sym53c875-0-<6,*>: FAST-10 SCSI 10.0 MB/s (100 ns, offset 16)
Partition check:
 sda: sda1 sda2 sda3 sda4 < sda5 sda6 >
 sdb: sdb1 sdb2 sdb3 < sdb5 sdb6 sdb7 sdb8 sdb9 sdb10 sdb11 >
 hda: hda1 hda2 hda3 hda4
VFS: Mounted root (ext2 filesystem) readonly.


In particular note differences in a detected SCSI controller and lack
of problems on a partition check. I also do not know from where
"DECchip 21052" is coming with 2.4 kernel.  The real ethernet
controller is DS21143 Tulip rev. 48.

This is a PCI layout as reported by 'lspci -tv':

-[00]-+-0d.0-[01]--+-0a.0  Trident Microsystems 4DWave DX
  |\-0d.0  Symbios Logic Inc. (formerly NCR) 53c875
  +-0e.0  Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II]
  +-0e.1  Intel Corporation 82371SB PIIX3 IDE [Natoma/Triton II]
  +-0f.0  Digital Equipment Corporation DECchip 21142/43
  \-11.0  Texas Instruments TVP4020 [Permedia 2]

Any ideas what gives here?

  Michal
  [EMAIL PROTECTED]
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



2.4 kernels do not boot on UX (Alpha)

2000-09-22 Thread Michal Jaegermann

I tried to boot recent 2.4.0 kernels (the last one 2.4.0-test9-pre5)
on Ruffian type of Alpha, also known as UX, with a notable lack
of success.  So far I had not a single succesful boot.  My test
machine ran without any hiccups various 2.2 kernels, patched and
upatched, and before that a long list of 2.0 kernels.

Here are excerpts from boot messages, transcribed from a screen,
which seem to be relevant to the problem:


Booting GENERIC on Ruffian using machine vector Ruffian from MILO

Calibrating delay loop... 1191.18 BogoMips

pci: cia revision 1 (pyxis)
...
pci: passed tb register update test
pci: passed passed sg loopback i/o read test
pci: passed tbia test
pci: passed pte write cache snoop test
pci: failed valid tad invalid pte reload test (mcheck; workaround avialable)
pci: passed pci machine check test
PCI: Failed to allocate resource 8 for Digital Equipment Corporation
   DECchip 21052
PCI: Failed to allocate resource 1 for Trident Microsystems 4DWave DX
PCI: Failed to allocate resource 1 for Symbios Logic Inc. (formerly
   NCR) 53c875
Activating ISA DMA hang workarounds
.
hda: IBM-DJNA=370910, ATA DISK driver
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
hda: 17803440 sectors (9115 MB) w/1966KiB Cache, CHS=17662/16/63, (U)DMA
Partition check:
 hda: unknown partition table

sym53c8xx: at PCI bus 1, device 13, function 0
sym53c8xx: 53c875 detected 
sym53c876-0: rev 0xff on pci bus 1 device 13 function 0 irq 20
sym53c876-0: ID 7, Fast-20, Parity Checking, Differential
sym53c876-0: on-chip RAM at 0xa00
CACHE TEST FAILED: script execution failed.
start=4fea4dd8, pc=. end=4fea4df8
CACHE INCORRECTLY CONFIGURED.
sym53c876-0: giving up ...


and a kernel panic because my root file system on /dev/sda2 cannot be
mounted.  The machine is dead with even a keyboard gone.

There is also another failure mode when some messages are scrolling
up the screen too fast to be of any use because a screen is flooded
with repeated messages:
pc_keyb: controller jammed (0xFF).
followed by:
keyboard timed out [1].
and a kernel hangs the moment it should print 
hda: IBM-DJNA-370910, ATA DISK drive
line. The second failure mode happens apparently less often but
apparently at random.

For comparison here are similar lines from a correct boot of a kernel
from 2.2 series on the same machine:

Booting GENERIC on Ruffian using machine vector Ruffian from MILO

Calibrating delay loop... 595.59 BogoMIPS
hda: IBM-DJNA-370910, ATA DISK drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
hda: IBM-DJNA-370910, 8693MB w/1966kB Cache, CHS=17662/16/63, (U)DMA
...
sym53c8xx: at PCI bus 32, device 13, function 0
sym53c8xx: 53c875 detected 
sym53c875-0: rev 0x3 on pci bus 32 device 13 function 0 irq 20
sym53c875-0: ID 7, Fast-20, Parity Checking
scsi0 : sym53c8xx-1.7.0-2709
scsi : 1 host.
  Vendor: IBM   Model: DDRS-39130D   Rev: DC1B
  Type:   Direct-Access  ANSI SCSI revision: 02
Detected scsi disk sda at scsi0, channel 0, id 2, lun 0
  Vendor: TOSHIBA   Model: CD-ROM XM-6201TA  Rev: 1037
  Type:   CD-ROM ANSI SCSI revision: 02
Detected scsi CD-ROM sr0 at scsi0, channel 0, id 6, lun 0
  Vendor: IBM   Model: DDRS-34560D   Rev: DC1B
  Type:   Direct-Access  ANSI SCSI revision: 02
Detected scsi disk sdb at scsi0, channel 0, id 10, lun 0
scsi : detected 1 SCSI cdrom 2 SCSI disks total.
sym53c875-0-6,*: FAST-10 SCSI 10.0 MB/s (100 ns, offset 16)
Partition check:
 sda: sda1 sda2 sda3 sda4  sda5 sda6 
 sdb: sdb1 sdb2 sdb3  sdb5 sdb6 sdb7 sdb8 sdb9 sdb10 sdb11 
 hda: hda1 hda2 hda3 hda4
VFS: Mounted root (ext2 filesystem) readonly.


In particular note differences in a detected SCSI controller and lack
of problems on a partition check. I also do not know from where
"DECchip 21052" is coming with 2.4 kernel.  The real ethernet
controller is DS21143 Tulip rev. 48.

This is a PCI layout as reported by 'lspci -tv':

-[00]-+-0d.0-[01]--+-0a.0  Trident Microsystems 4DWave DX
  |\-0d.0  Symbios Logic Inc. (formerly NCR) 53c875
  +-0e.0  Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II]
  +-0e.1  Intel Corporation 82371SB PIIX3 IDE [Natoma/Triton II]
  +-0f.0  Digital Equipment Corporation DECchip 21142/43
  \-11.0  Texas Instruments TVP4020 [Permedia 2]

Any ideas what gives here?

  Michal
  [EMAIL PROTECTED]
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Fix for non-booting Alpha with BRIDGE_OTHER device

2000-09-13 Thread Michal Jaegermann

I found by an experiment that an Alpha with a device which provides
its own PCI bridge, and if that device is on a PCI bus 0, will stop
booting right after "Partition check" messages.  The only way out is
through a power switch.  One can still boot if the device in question
is plugged in a slot on a PCI bus 1.

That particular hardware which revealed the problem is in fact SCI
board from Scali ( http://www.scali.com ) used in a cluster
communication and Scali provides a fix which makes machines bootable
again.

Here it is recreated against linux-2.2.18pre6.  It will actually
apply to a wide range of kernels but with a possible line offsets
noise.

--- linux-2.2.18p/arch/alpha/kernel/bios32.c.orig   Wed Jun  7 15:26:42 2000
+++ linux-2.2.18p/arch/alpha/kernel/bios32.cWed Sep 13 14:08:05 2000
@@ -828,6 +828,7 @@
 
for (dev = bus->devices; dev; dev = dev->sibling) {
if ((dev->class >> 16 != PCI_BASE_CLASS_BRIDGE) ||
+   (dev->class >> 8 == PCI_CLASS_BRIDGE_OTHER) ||
(dev->class >> 8 == PCI_CLASS_BRIDGE_PCMCIA)) {
disable_dev(dev);
}
@@ -840,6 +841,7 @@
 
for (dev = bus->devices; dev; dev = dev->sibling) {
if ((dev->class >> 16 != PCI_BASE_CLASS_BRIDGE) ||
+   (dev->class >> 8 == PCI_CLASS_BRIDGE_OTHER) ||
(dev->class >> 8 == PCI_CLASS_BRIDGE_PCMCIA)) {
layout_dev(dev);
}
@@ -1081,6 +1083,7 @@
 */
for (dev = pci_devices; dev; dev = dev->next) {
if ((dev->class >> 16 == PCI_BASE_CLASS_BRIDGE) &&
+   (dev->class >> 8 != PCI_CLASS_BRIDGE_OTHER) &&
(dev->class >> 8 != PCI_CLASS_BRIDGE_PCMCIA))
continue;
 
It indeed makes possible to boot regardles of a slot with SCI in it and
in tests on various Alphas _without_ that device does not seem to harm
anything - which is not a big surprise as otherwise PCMCIA would likely
be a problem too.:-)  Any comments from those who sleep with PCI specs
under pillows?  Should not that be included into standart kernels?

In arch/sparc64/kernel/psycho.c exists already a code which seems to
be somewhat related.

  Michal
  [EMAIL PROTECTED]

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Fix for non-booting Alpha with BRIDGE_OTHER device

2000-09-13 Thread Michal Jaegermann

I found by an experiment that an Alpha with a device which provides
its own PCI bridge, and if that device is on a PCI bus 0, will stop
booting right after "Partition check" messages.  The only way out is
through a power switch.  One can still boot if the device in question
is plugged in a slot on a PCI bus 1.

That particular hardware which revealed the problem is in fact SCI
board from Scali ( http://www.scali.com ) used in a cluster
communication and Scali provides a fix which makes machines bootable
again.

Here it is recreated against linux-2.2.18pre6.  It will actually
apply to a wide range of kernels but with a possible line offsets
noise.

--- linux-2.2.18p/arch/alpha/kernel/bios32.c.orig   Wed Jun  7 15:26:42 2000
+++ linux-2.2.18p/arch/alpha/kernel/bios32.cWed Sep 13 14:08:05 2000
@@ -828,6 +828,7 @@
 
for (dev = bus-devices; dev; dev = dev-sibling) {
if ((dev-class  16 != PCI_BASE_CLASS_BRIDGE) ||
+   (dev-class  8 == PCI_CLASS_BRIDGE_OTHER) ||
(dev-class  8 == PCI_CLASS_BRIDGE_PCMCIA)) {
disable_dev(dev);
}
@@ -840,6 +841,7 @@
 
for (dev = bus-devices; dev; dev = dev-sibling) {
if ((dev-class  16 != PCI_BASE_CLASS_BRIDGE) ||
+   (dev-class  8 == PCI_CLASS_BRIDGE_OTHER) ||
(dev-class  8 == PCI_CLASS_BRIDGE_PCMCIA)) {
layout_dev(dev);
}
@@ -1081,6 +1083,7 @@
 */
for (dev = pci_devices; dev; dev = dev-next) {
if ((dev-class  16 == PCI_BASE_CLASS_BRIDGE) 
+   (dev-class  8 != PCI_CLASS_BRIDGE_OTHER) 
(dev-class  8 != PCI_CLASS_BRIDGE_PCMCIA))
continue;
 
It indeed makes possible to boot regardles of a slot with SCI in it and
in tests on various Alphas _without_ that device does not seem to harm
anything - which is not a big surprise as otherwise PCMCIA would likely
be a problem too.:-)  Any comments from those who sleep with PCI specs
under pillows?  Should not that be included into standart kernels?

In arch/sparc64/kernel/psycho.c exists already a code which seems to
be somewhat related.

  Michal
  [EMAIL PROTECTED]

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: [OT] Re: Availability of kdb

2000-09-07 Thread Michal Jaegermann

Jamie Lokier wrote:
> World Domination is my hobby too :-)

Now, that is THE T-shirt!  What should be added?  A flock of penguins
in an attack mode. :-)

  --mj
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



  1   2   >