Re: Reiser FS will not boot after crash

2006-09-11 Thread John M Harrison

Hi,

   Sorry to take so long giving you a response. I did connect to the box
as it crashed [via ssh]. There were no terminal messages [tail -f
/var/log/messages]. This latest time an initial window for camorama was
starting to open. 

   You ask, "Am I willing to help?" debug the problem. Absolutely. [Linux
has given so much to me!!]

   I reloaded camorama, so I should be able to reproduce the problem. The
problem (not being able to boot) occurs frequently though not every time I
run camorama. So my simple solution has been just to not run that program.

   This box has LILO on the master boot block. This chains to the Knoppix
partition which has GRUB as the partition boot loader. As mentioned GRUB
starts but then claims it finds an inconsistent filesystem.  This repeats
over and over until fsck.reiser is run on this partition from a different
OS on a different partition. fsck.reiser replays the journal at first and
then examines the filesystem and finds nothing wrong. I assume I am using
the "stock"  GRUB which Knoppix 5.0.1 installed since I haven't knowingly
changed it? Perhaps the "problem" is simply that Knoppix installed an
older version of GRUB ? 

john

On Tue, 5 Sep 2006, Vladimir V. Saveliev wrote:

> Hello
> 
> On Tuesday 05 September 2006 04:10, John M Harrison wrote:
> > Hi,
> >
> >You make some good points. I wonder what the "right" fix is?
> >
> >I certainly think the user should not be stopped from booting and then
> > get the inconsistent filesystem message over and over again as I do.
> >
> >Perhaps GRUB should offer to 'replay' the filesystem after discovering
> > that the filesystem is "inconsistent". I am not sure what other choices
> > there are since the kernel and the initial boot filesystem are presumably
> > not loadable?
> >
> 
> I looked at grub sources. It looks like it takes journal into account. It may 
> have a bug, though.
> What version of grub do you have?
> Would you like to help to debug the problem?
> 
> >john
> >
> > On Tue, 5 Sep 2006, Vladimir V. Saveliev wrote:
> > > Hello
> > >
> > > On Tuesday 05 September 2006 00:30, [EMAIL PROTECTED] wrote:
> > > > On Mon, 04 Sep 2006 23:33:27 +0400, "Vladimir V. Saveliev" said:
> > > > > after unclean shutdown journal reply is necessary to return reiserfs
> > > > > to consistent state. Maybe GRUB did not do that?
> > > >
> > > > A case can be made that GRUB should be keeping its grubby little paws
> > > > off the filesystem journal.  It's a *bootloader*.  It's only purpose in
> > > > life is to load other code that can make intelligent decisions about
> > > > things like how (or even whether) to replay a filesystem journal.
> > >
> > > Yes, I did not say that grub has to replay a journal, I just tried to
> > > guess why grub failed to boot and why things went ok after fsck.
> 



Mt Knox Datacenter in Franklin TN

2006-09-11 Thread dfrench

Can you please help me to contact your Information
Technology Director?  My company has just opened a new
computer datacenter in Franklin, TN for the purpose of
providing disaster recovery, business continuity, and
outsourcing services to small and medium size businesses
in the Nashville area.  This facility can also aid
companies such as yours in compliance with HIPAA and
Sarbanes/Oxley.

If you can provide me with contact information for your
IT director, it would be greatly appreciated.


Regards,


Dana French  615.556.0456
President,   Mt Knox Availability Services
[EMAIL PROTECTED]   http://www.mtknox.com



Re: reiser4: mount -o remount,ro / causes error on reboot

2006-09-11 Thread Peter
On Sun, 10 Sep 2006 17:01:18 +, Peter wrote:

> Using: gentoo
> kernel 2.6.17.11 with beyond patchset
> reiser patch 2.6.17-3
> reiser4progs 1.0.5
> 
update...
Transferring / to a reiser3 partition removes this problem. Shutdown and
startup proceed normally. I am using util-linux-2.12r with gentoo patches
-r4. This was updated on 9/4/06. I am thinking I will downgrade to -r3 and
see if that removes the problem.

-- 
Peter
+
Do not reply to this email, it is a spam trap and not monitored.
I can be reached via this list, or via 
jabber: pete4abw at jabber.org
ICQ: 73676357



Re: reiser4: mount -o remount,ro / causes error on reboot

2006-09-11 Thread Peter
On Mon, 11 Sep 2006 11:30:39 +0400, Vladimir V. Saveliev wrote:

snip...
> Sorry, I am confused. In the first mail you said:
> "On reboot or after a poweroff, root does not mount properly, and after
> some modules are loaded, there are segfaults when running init scripts."
> 
> This looks like  you have problems on startup. Would you, please, describe 
> the 
> sequence of operations which leads to the problem with more details.
> 
I should have written that it occurs always after a normal shutdown or
reboot. On initial startup, the error occurs. Then, after CTRL-D, the
system reboots and all is fine. Then, after the day, normal shutdown, then
abnormal startup.

Some more information, I looked at the output from the final mount -v -n -o
remont,ro / command, it appears perfectly normal. However, I am not sure
it is working normally!

-- 
Peter
+
Do not reply to this email, it is a spam trap and not monitored.
I can be reached via this list, or via 
jabber: pete4abw at jabber.org
ICQ: 73676357



Re: reiser4: mount -o remount,ro / causes error on reboot

2006-09-11 Thread Peter
On Mon, 11 Sep 2006 13:10:54 +0200, Sander Sweers wrote:

snip...

> There was a bug in baselayout which caused partition (except /) not to
> remount ro properly. The bug number is 131001 [1], is this your problem?
> 
> Greets
> Sander
> 
> 1: http://bugs.gentoo.org/show_bug.cgi?id=131001

Thank you, I read that. My version of baselayout has that fix, but that
does not appear to be the problem. I think it has more specifically to do
with the way the remount option is affecting the reiser4 fs. And, only /
is left when the remount,ro part comes anyway. Everything else is
unmounted by that time.

-- 
Peter
+
Do not reply to this email, it is a spam trap and not monitored.
I can be reached via this list, or via 
jabber: pete4abw at jabber.org
ICQ: 73676357



Re: reiser4: mount -o remount,ro / causes error on reboot

2006-09-11 Thread Sander Sweers

On 11/09/06, Peter <[EMAIL PROTECTED]> wrote:
[]

>> >> At the end of the gentoo shutdown script is a short function which
>> >> remounts / as ro.
>> >
>> > There's also one in the Gentoo startup script, which attempts to remount
>> > / ro, then remount it rw.  I commented that out, because it was causing
>> > similar problems.  I figure if it runs sync when it shuts down, that's
>> > good enough.
>>
>> The errors I note only occur on shutdown (halt.sh) not startup.


There was a bug in baselayout which caused partition (except /) not to
remount ro properly. The bug number is 131001 [1], is this your
problem?

Greets
Sander

1: http://bugs.gentoo.org/show_bug.cgi?id=131001


Re: reiser4: mount -o remount,ro / causes error on reboot

2006-09-11 Thread Peter
On Mon, 11 Sep 2006 11:30:39 +0400, Vladimir V. Saveliev wrote:

> Hello
> 
> On Monday 11 September 2006 01:26, Peter wrote:
>> On Sun, 10 Sep 2006 15:12:00 -0500, David Masover wrote:
>> > Peter wrote:
>> >> Using: gentoo
>> >> kernel 2.6.17.11 with beyond patchset
>> >> reiser patch 2.6.17-3
>> >> reiser4progs 1.0.5
>> >>
>> >> At the end of the gentoo shutdown script is a short function which
>> >> remounts / as ro.
>> >
>> > There's also one in the Gentoo startup script, which attempts to remount
>> > / ro, then remount it rw.  I commented that out, because it was causing
>> > similar problems.  I figure if it runs sync when it shuts down, that's
>> > good enough.
>>
>> The errors I note only occur on shutdown (halt.sh) not startup. 
> 
> Sorry, I am confused. In the first mail you said:
> "On reboot or after a poweroff, root does not mount properly, and after
> some modules are loaded, there are segfaults when running init scripts."
> 
> This looks like  you have problems on startup. Would you, please, describe 
> the 
> sequence of operations which leads to the problem with more details.
> 
Yes, it is confusing. I may be incorrect. It is my BELIEF that the error
is caused by an improper shutdown, not an improper startup. This is
because after the error, typing CTRL-D to continue merely unmounts the /
volume and reboots again. Next time the system runs fine. If it was a
startup problem, wouldn't it occur every time then? If you recall, when I
wrote about this with teh Sabayon Linux CD issue, it was not unmounting
volumes or syncing them properly which led to a similar problem on
startup. That's why I thought the mount -o remount,ro / command was the
culprit. I think the only way to nail this down will be to create a
separate partition, identical in all ways, except with reiserfs or ext3
instead. See what happens then.

-- 
Peter
+
Do not reply to this email, it is a spam trap and not monitored.
I can be reached via this list, or via 
jabber: pete4abw at jabber.org
ICQ: 73676357



Re: reiser4: mount -o remount,ro / causes error on reboot

2006-09-11 Thread Vladimir V. Saveliev
Hello

On Monday 11 September 2006 01:26, Peter wrote:
> On Sun, 10 Sep 2006 15:12:00 -0500, David Masover wrote:
> > Peter wrote:
> >> Using: gentoo
> >> kernel 2.6.17.11 with beyond patchset
> >> reiser patch 2.6.17-3
> >> reiser4progs 1.0.5
> >>
> >> At the end of the gentoo shutdown script is a short function which
> >> remounts / as ro.
> >
> > There's also one in the Gentoo startup script, which attempts to remount
> > / ro, then remount it rw.  I commented that out, because it was causing
> > similar problems.  I figure if it runs sync when it shuts down, that's
> > good enough.
>
> The errors I note only occur on shutdown (halt.sh) not startup. 

Sorry, I am confused. In the first mail you said:
"On reboot or after a poweroff, root does not mount properly, and after
some modules are loaded, there are segfaults when running init scripts."

This looks like  you have problems on startup. Would you, please, describe the 
sequence of operations which leads to the problem with more details.


> Do you 
> think it could be an IDE timing thing similar to what was described on
> another thread on this ml? What's interesting is that this problem is
> recent and I am trying to look back and see what system-level packages
> were updated recently (I just converted to the 2006.1 profile before this
> occured and that caused a lot of programs to recompile. A week earlier,
> it was gcc-4.1.1. I know base layout was updated recently). Maybe something
> in mount changed? The shutdown scripts look the same. Something is left
> unhinged somewhere. Glad I was not hallucinating! Thanks for confirming
> for me.
>
> > Still, it's an annoying problem, I think it's a kernel oops.  Namesys,
> > what kind of information would be helpful?
>
> Yes, it's annoying and disconcerting at the same time. If it was a kernel
> oop then, wouldn't it have shown itself earlier? I