Re: test12: innd bug came back?

2000-12-18 Thread Jorg de Jong

Alexander Viro wrote:
> 
> On Sun, 17 Dec 2000, Jorg de Jong wrote:
> 
> > > >On 13 Dec 2000, Henrik [ISO-8859-1] Strner wrote:
> > > >
> > > >> Just to add a "me too" on this. I didn't report when I saw it last week,
> >
> > I'd like to second that. ME TOO !
> > Since I switched to 2.4.0.test12 I again have the innd bug.
> > ( well at least the same symptoms !)
> 
> I.e. old contents resurfacing in active?

I tryed your test program and got correct results, a file with bytes 11-16385 being 
zero.

I will try to give a description of my problems:
after a reboot inn is 're-using' existing messages to store new
messages. It seems that after a renumber command the active file 
is correced again. I have not checked to see if the active file
was corrutped before. 
I am using a plain stock kernel, no other patches what so ever,
but am using LVM. 

The blocksize the ext2 filesystem is using is 1024.

-- 
Jorg de Jong
Work : mailto:[EMAIL PROTECTED] 
Play : mailto:[EMAIL PROTECTED]
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: test12: innd bug came back?

2000-12-18 Thread Chris Mason



On 17 Dec 2000, Henrik [ISO-8859-1] Størner wrote:

> In <[EMAIL PROTECTED]> Alexander Viro 
><[EMAIL PROTECTED]> writes:
> 
> >On Sun, 17 Dec 2000, Jorg de Jong wrote:
> 
> >> > >On 13 Dec 2000, Henrik [ISO-8859-1] Størner wrote:
> >> > >
> >> > >> Just to add a "me too" on this. I didn't report when I saw it last week
> 
> >> I'd like to second that. ME TOO !
> >> Since I switched to 2.4.0.test12 I again have the innd bug.
> >> ( well at least the same symptoms !)
> 
> >Guys, what blocksize are you using?
> 
> I am using Reiserfs, and I hear it has some problems with the changes
> introduced in pre12. So I will report back once the Reiserfs guys get 
> this settled.

Ok, the reiserfs patches for test12 are on ftp.reiserfs.org/pub/2.4/beta,
please let me know if they work for you.

I just reran the test case on test12, with tails on and off and got the
correct results.  There might be some interaction with the new O_SYNC code
I'm missing that is causing innd problems though (reiserfs still isn't
using the new sync stuff, workin on it).

-chris


-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: test12: innd bug came back?

2000-12-18 Thread Jorg de Jong

Alexander Viro wrote:
 
 On Sun, 17 Dec 2000, Jorg de Jong wrote:
 
   On 13 Dec 2000, Henrik [ISO-8859-1] Strner wrote:
   
Just to add a "me too" on this. I didn't report when I saw it last week,
 
  I'd like to second that. ME TOO !
  Since I switched to 2.4.0.test12 I again have the innd bug.
  ( well at least the same symptoms !)
 
 I.e. old contents resurfacing in active?

I tryed your test program and got correct results, a file with bytes 11-16385 being 
zero.

I will try to give a description of my problems:
after a reboot inn is 're-using' existing messages to store new
messages. It seems that after a renumber command the active file 
is correced again. I have not checked to see if the active file
was corrutped before. 
I am using a plain stock kernel, no other patches what so ever,
but am using LVM. 

The blocksize the ext2 filesystem is using is 1024.

-- 
Jorg de Jong
Work : mailto:[EMAIL PROTECTED] 
Play : mailto:[EMAIL PROTECTED]
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: test12: innd bug came back?

2000-12-17 Thread Henrik Størner

In <[EMAIL PROTECTED]> Alexander Viro 
<[EMAIL PROTECTED]> writes:

>On Sun, 17 Dec 2000, Jorg de Jong wrote:

>> > >On 13 Dec 2000, Henrik [ISO-8859-1] Størner wrote:
>> > >
>> > >> Just to add a "me too" on this. I didn't report when I saw it last week

>> I'd like to second that. ME TOO !
>> Since I switched to 2.4.0.test12 I again have the innd bug.
>> ( well at least the same symptoms !)

>Guys, what blocksize are you using?

I am using Reiserfs, and I hear it has some problems with the changes
introduced in pre12. So I will report back once the Reiserfs guys get 
this settled.
-- 
Henrik Storner  | "Crackers thrive on code secrecy. Cockcroaches breed 
<[EMAIL PROTECTED]> |  in the dark. It's time to let the sunlight in."
|  
|  Eric S. Raymond, re. the Frontpage backdoor
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: test12: innd bug came back?

2000-12-17 Thread Alexander Viro



On Sun, 17 Dec 2000, Jorg de Jong wrote:

> > >On 13 Dec 2000, Henrik [ISO-8859-1] Størner wrote:
> > >
> > >> Just to add a "me too" on this. I didn't report when I saw it last week,
> 
> I'd like to second that. ME TOO !
> Since I switched to 2.4.0.test12 I again have the innd bug.
> ( well at least the same symptoms !)

Guys, what blocksize are you using? BTW, old testcase was
cat >foo.c <
main(argc,argv)
int argc;
char **argv;
{
int fd;
char c=0;
truncate(argv[1], 10);
fd = open(argv[1], 1);
lseek(fd, 16384, 0);
write(fd, , 1);
close(fd);
}
EOF
gcc foo.c
./a.out /tmp/something_old
od -c http://www.tux.org/lkml/



Re: test12: innd bug came back?

2000-12-17 Thread Jorg de Jong

> >On 13 Dec 2000, Henrik [ISO-8859-1] Størner wrote:
> >
> >> Just to add a "me too" on this. I didn't report when I saw it last week,

I'd like to second that. ME TOO !
Since I switched to 2.4.0.test12 I again have the innd bug.
( well at least the same symptoms !)

No problems with test10 and test11. I have not used any pre kernels.

regards


-- 
Jorg de Jong
Work : mailto:[EMAIL PROTECTED] 
Play : mailto:[EMAIL PROTECTED]
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: test12: innd bug came back?

2000-12-17 Thread Jorg de Jong

 On 13 Dec 2000, Henrik [ISO-8859-1] Størner wrote:
 
  Just to add a "me too" on this. I didn't report when I saw it last week,

I'd like to second that. ME TOO !
Since I switched to 2.4.0.test12 I again have the innd bug.
( well at least the same symptoms !)

No problems with test10 and test11. I have not used any pre kernels.

regards


-- 
Jorg de Jong
Work : mailto:[EMAIL PROTECTED] 
Play : mailto:[EMAIL PROTECTED]
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: test12: innd bug came back?

2000-12-17 Thread Alexander Viro



On Sun, 17 Dec 2000, Jorg de Jong wrote:

  On 13 Dec 2000, Henrik [ISO-8859-1] Størner wrote:
  
   Just to add a "me too" on this. I didn't report when I saw it last week,
 
 I'd like to second that. ME TOO !
 Since I switched to 2.4.0.test12 I again have the innd bug.
 ( well at least the same symptoms !)

Guys, what blocksize are you using? BTW, old testcase was
cat foo.c EOF
#include unistd.h
main(argc,argv)
int argc;
char **argv;
{
int fd;
char c=0;
truncate(argv[1], 10);
fd = open(argv[1], 1);
lseek(fd, 16384, 0);
write(fd, c, 1);
close(fd);
}
EOF
gcc foo.c
./a.out /tmp/something_old
od -c /tmp/something_old
where something_old would be something not touched for long (i.e.
completely out of cache). Buggy kernels would leave much more than
10 non-zero bytes. Correct result is a file with bytes 11-16385 being zero.
I doubt that it would be the same beast, though...

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: test12: innd bug came back?

2000-12-17 Thread Henrik Størner

In [EMAIL PROTECTED] Alexander Viro 
[EMAIL PROTECTED] writes:

On Sun, 17 Dec 2000, Jorg de Jong wrote:

  On 13 Dec 2000, Henrik [ISO-8859-1] Størner wrote:
  
   Just to add a "me too" on this. I didn't report when I saw it last week

 I'd like to second that. ME TOO !
 Since I switched to 2.4.0.test12 I again have the innd bug.
 ( well at least the same symptoms !)

Guys, what blocksize are you using?

I am using Reiserfs, and I hear it has some problems with the changes
introduced in pre12. So I will report back once the Reiserfs guys get 
this settled.
-- 
Henrik Storner  | "Crackers thrive on code secrecy. Cockcroaches breed 
[EMAIL PROTECTED] |  in the dark. It's time to let the sunlight in."
|  
|  Eric S. Raymond, re. the Frontpage backdoor
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: test12: innd bug came back?

2000-12-13 Thread Albert Cranford

And the problem started with pre8 not final.
currently investigating difference pre7-pre8
Albert
Linus Torvalds wrote:
> 
> In article <[EMAIL PROTECTED]>,
> Alexander Viro  <[EMAIL PROTECTED]> wrote:
> >
> >
> >On 13 Dec 2000, Henrik [ISO-8859-1] Størner wrote:
> >
> >> Just to add a "me too" on this. I didn't report when I saw it last week,
> >> because I was uncertain of exactly what might have caused it - I was
> >> booting several different kernels at the time, including one from a
> >> rescue disk (I was trying to salvage bits of a Win9x disk at the time -
> >> don't ask for details!)
> >>
> >> Alas, I lost the test program someone wrote to test for the truncate
> >> problem, and due to moving I will not be able to test anything until
> >> next Monday. But if needed, I can do some testing then. Something
> >> definitely went wrong with innd during the test12 pre-patches.
> >
> >It may be a side effect of removing partial_clear() in test12-final.
> 
> No. If you read the code, partial_clear() has been a no-op for the
> longest time (the "start & ~PAGE_MASK" thing could never trigger, as
> "start" has been page-aligned for a long long while now.
> 
> So it must be something else.
> 
> Linus
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to [EMAIL PROTECTED]
> Please read the FAQ at http://www.tux.org/lkml/

-- 
Albert Cranford Deerfield Beach FL USA
[EMAIL PROTECTED]
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: test12: innd bug came back?

2000-12-13 Thread Linus Torvalds

In article <[EMAIL PROTECTED]>,
Alexander Viro  <[EMAIL PROTECTED]> wrote:
>
>
>On 13 Dec 2000, Henrik [ISO-8859-1] Størner wrote:
>
>> Just to add a "me too" on this. I didn't report when I saw it last week,
>> because I was uncertain of exactly what might have caused it - I was
>> booting several different kernels at the time, including one from a
>> rescue disk (I was trying to salvage bits of a Win9x disk at the time -
>> don't ask for details!)
>> 
>> Alas, I lost the test program someone wrote to test for the truncate
>> problem, and due to moving I will not be able to test anything until 
>> next Monday. But if needed, I can do some testing then. Something 
>> definitely went wrong with innd during the test12 pre-patches.
>
>It may be a side effect of removing partial_clear() in test12-final.

No. If you read the code, partial_clear() has been a no-op for the
longest time (the "start & ~PAGE_MASK" thing could never trigger, as
"start" has been page-aligned for a long long while now.

So it must be something else.

Linus
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: test12: innd bug came back?

2000-12-13 Thread Alexander Viro



On 13 Dec 2000, Henrik [ISO-8859-1] Størner wrote:

> Just to add a "me too" on this. I didn't report when I saw it last week,
> because I was uncertain of exactly what might have caused it - I was
> booting several different kernels at the time, including one from a
> rescue disk (I was trying to salvage bits of a Win9x disk at the time -
> don't ask for details!)
> 
> Alas, I lost the test program someone wrote to test for the truncate
> problem, and due to moving I will not be able to test anything until 
> next Monday. But if needed, I can do some testing then. Something 
> definitely went wrong with innd during the test12 pre-patches.

It may be a side effect of removing partial_clear() in test12-final.
Relevant chunk (in mm/memory.c):
@@ -953,10 +914,6 @@
/* Ok, partially affected.. */
start += diff << PAGE_SHIFT;
len = (len - diff) << PAGE_SHIFT;
-   if (start & ~PAGE_MASK) {
-   partial_clear(mpnt, start);
-   start = (start + ~PAGE_MASK) & PAGE_MASK;
-   }
flush_cache_range(mm, start, end);
zap_page_range(mm, start, len);
flush_tlb_range(mm, start, end);
should actually be
@@ -954,7 +915,6 @@
start += diff << PAGE_SHIFT;
len = (len - diff) << PAGE_SHIFT;
if (start & ~PAGE_MASK) {
-   partial_clear(mpnt, start);
start = (start + ~PAGE_MASK) & PAGE_MASK;
}
flush_cache_range(mm, start, end);

IOW, we have off-by-one when calling zap_page_range() and friends.
Cheers,
Al

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: test12: innd bug came back?

2000-12-13 Thread Henrik Størner

In <[EMAIL PROTECTED]> Anton Petrusevich 
<[EMAIL PROTECTED]> writes:

>Today I saw well-known "innd bug"(truncate(tm)), and my brother said
>he had seen it with -test12-pre7. I don't know about -test12-pre3,
>neither I nor my brother hadn't noticed it since -test10. But we could
>miss it with -test12-pre3, and I didn't try any -test11 kernels. Thus
>possibly that was introduced changes between -test12-pre3 and
>-test12-pre7, but I can definitly say it present in -test12-final.

Just to add a "me too" on this. I didn't report when I saw it last week,
because I was uncertain of exactly what might have caused it - I was
booting several different kernels at the time, including one from a
rescue disk (I was trying to salvage bits of a Win9x disk at the time -
don't ask for details!)

Alas, I lost the test program someone wrote to test for the truncate
problem, and due to moving I will not be able to test anything until 
next Monday. But if needed, I can do some testing then. Something 
definitely went wrong with innd during the test12 pre-patches.
-- 
Henrik Storner  | "Crackers thrive on code secrecy. Cockcroaches breed 
<[EMAIL PROTECTED]> |  in the dark. It's time to let the sunlight in."
|  
|  Eric S. Raymond, re. the Frontpage backdoor
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: test12: innd bug came back?

2000-12-13 Thread Henrik Størner

In [EMAIL PROTECTED] Anton Petrusevich 
[EMAIL PROTECTED] writes:

Today I saw well-known "innd bug"(truncate(tm)), and my brother said
he had seen it with -test12-pre7. I don't know about -test12-pre3,
neither I nor my brother hadn't noticed it since -test10. But we could
miss it with -test12-pre3, and I didn't try any -test11 kernels. Thus
possibly that was introduced changes between -test12-pre3 and
-test12-pre7, but I can definitly say it present in -test12-final.

Just to add a "me too" on this. I didn't report when I saw it last week,
because I was uncertain of exactly what might have caused it - I was
booting several different kernels at the time, including one from a
rescue disk (I was trying to salvage bits of a Win9x disk at the time -
don't ask for details!)

Alas, I lost the test program someone wrote to test for the truncate
problem, and due to moving I will not be able to test anything until 
next Monday. But if needed, I can do some testing then. Something 
definitely went wrong with innd during the test12 pre-patches.
-- 
Henrik Storner  | "Crackers thrive on code secrecy. Cockcroaches breed 
[EMAIL PROTECTED] |  in the dark. It's time to let the sunlight in."
|  
|  Eric S. Raymond, re. the Frontpage backdoor
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: test12: innd bug came back?

2000-12-13 Thread Alexander Viro



On 13 Dec 2000, Henrik [ISO-8859-1] Størner wrote:

 Just to add a "me too" on this. I didn't report when I saw it last week,
 because I was uncertain of exactly what might have caused it - I was
 booting several different kernels at the time, including one from a
 rescue disk (I was trying to salvage bits of a Win9x disk at the time -
 don't ask for details!)
 
 Alas, I lost the test program someone wrote to test for the truncate
 problem, and due to moving I will not be able to test anything until 
 next Monday. But if needed, I can do some testing then. Something 
 definitely went wrong with innd during the test12 pre-patches.

It may be a side effect of removing partial_clear() in test12-final.
Relevant chunk (in mm/memory.c):
@@ -953,10 +914,6 @@
/* Ok, partially affected.. */
start += diff  PAGE_SHIFT;
len = (len - diff)  PAGE_SHIFT;
-   if (start  ~PAGE_MASK) {
-   partial_clear(mpnt, start);
-   start = (start + ~PAGE_MASK)  PAGE_MASK;
-   }
flush_cache_range(mm, start, end);
zap_page_range(mm, start, len);
flush_tlb_range(mm, start, end);
should actually be
@@ -954,7 +915,6 @@
start += diff  PAGE_SHIFT;
len = (len - diff)  PAGE_SHIFT;
if (start  ~PAGE_MASK) {
-   partial_clear(mpnt, start);
start = (start + ~PAGE_MASK)  PAGE_MASK;
}
flush_cache_range(mm, start, end);

IOW, we have off-by-one when calling zap_page_range() and friends.
Cheers,
Al

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: test12: innd bug came back?

2000-12-13 Thread Linus Torvalds

In article [EMAIL PROTECTED],
Alexander Viro  [EMAIL PROTECTED] wrote:


On 13 Dec 2000, Henrik [ISO-8859-1] Størner wrote:

 Just to add a "me too" on this. I didn't report when I saw it last week,
 because I was uncertain of exactly what might have caused it - I was
 booting several different kernels at the time, including one from a
 rescue disk (I was trying to salvage bits of a Win9x disk at the time -
 don't ask for details!)
 
 Alas, I lost the test program someone wrote to test for the truncate
 problem, and due to moving I will not be able to test anything until 
 next Monday. But if needed, I can do some testing then. Something 
 definitely went wrong with innd during the test12 pre-patches.

It may be a side effect of removing partial_clear() in test12-final.

No. If you read the code, partial_clear() has been a no-op for the
longest time (the "start  ~PAGE_MASK" thing could never trigger, as
"start" has been page-aligned for a long long while now.

So it must be something else.

Linus
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/



Re: test12: innd bug came back?

2000-12-13 Thread Albert Cranford

And the problem started with pre8 not final.
currently investigating difference pre7-pre8
Albert
Linus Torvalds wrote:
 
 In article [EMAIL PROTECTED],
 Alexander Viro  [EMAIL PROTECTED] wrote:
 
 
 On 13 Dec 2000, Henrik [ISO-8859-1] Størner wrote:
 
  Just to add a "me too" on this. I didn't report when I saw it last week,
  because I was uncertain of exactly what might have caused it - I was
  booting several different kernels at the time, including one from a
  rescue disk (I was trying to salvage bits of a Win9x disk at the time -
  don't ask for details!)
 
  Alas, I lost the test program someone wrote to test for the truncate
  problem, and due to moving I will not be able to test anything until
  next Monday. But if needed, I can do some testing then. Something
  definitely went wrong with innd during the test12 pre-patches.
 
 It may be a side effect of removing partial_clear() in test12-final.
 
 No. If you read the code, partial_clear() has been a no-op for the
 longest time (the "start  ~PAGE_MASK" thing could never trigger, as
 "start" has been page-aligned for a long long while now.
 
 So it must be something else.
 
 Linus
 -
 To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
 the body of a message to [EMAIL PROTECTED]
 Please read the FAQ at http://www.tux.org/lkml/

-- 
Albert Cranford Deerfield Beach FL USA
[EMAIL PROTECTED]
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/