Re: another error with md malloc based fs

2007-03-22 Thread M.Hirsch

Michael Schuh schrieb:

hi Chuck,
hi @list

me again, behind the scenes,
i would take over my Server by an hoster from
Linux to FreeBSD on the running system,
while the hoster takes money for pressing 2 buttons and
put a disk in my Server...so
i create now a mfs based system that can bootet from disk and resides
fully in ram. I have tryed out Colin's depenguinator,
but it fails...some times
now i would do the steps by hand

thanks

cheers

michael


Hi Michael!

Some months ago I modified the depenguinator and made it work again. But 
sorry, I didn't yet get to write down all the steps involved (I am not a 
good writer). But I recently found another much more comfortable way 
using QEMU which I documented.
Does your provider offer a linux rescue system? If yes, you can try 
this: http://www.vrdevelopers.de/content/view/45/40/1/5/lang,en/


I'll try to follow up with the modified depenguinator asap, but don't 
hold your breath...


HTH,
Manuel
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: weird permitions

2006-11-29 Thread M.Hirsch



Hello,

Can someone explain to me why next can happened on freebsd:
1. add 2 users in same group - user test and test-ro in group test
2. as user test: cd /home/test ; mkdir test; chmod 775 test; echo 
"asdasd" > ~/test/del.me
3. su - test-ro ; cd /home/test; vim del.me - make changes; force save 
(:x!)


ls -l
total 2
-rw-r--r--  1 test-ro  test  10 Nov 29 18:19 del.me (how is that 
possible ?)


back "su - test" and try to edit this file - impossible!

I do not know what the RFC says about it, but it is ultra weird for me
that such ownership takeover is possible.

6.2-PRERELEASE FreeBSD Fri Oct 27 19:53:30 amd64



Correct me if I'm wrong... but you obviously were editing two completely 
distinct files.

~test/del.me (logged in as "test-ro")
and
~test/test/del.me (logged in as "test")

I fail to see anything odd here.
You seem to have enabled group writable home directories though.

M.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


[OT] Thanks

2006-06-27 Thread M.Hirsch

Just wanted to say thank you for clearing up my confusion about ECC.

And also, I want to excuse for being a bit harsh in some posts.
(I am a rather cynic person, this helps me against not going crazy over 
all this stuff.)
Last night, after hours of working on the very same problem without any 
success at all, I was at the end of my powers.

Sorry, I'll try to keep back from posting in such situations in the future.

So it seems like I can not track down ram problems in software.
Thanks very much, besides my lack of understanding ECC, I wasn't aware 
of that either.

Lesson learned.

Since everything worked fine before, I guess something must have broke 
when I took the machine out of the shelf.

But I have decided now to go the easy way out and retire the hardware.
This old box isn't worth wasting more time on...

M.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: FreeBSD 6.x CVSUP today crashes with zero load ...

2006-06-26 Thread M.Hirsch

Yes, the result may be correct.
'Do not take "ECC" for "equals additional security"'

So I understand what's ECC good for, other than the usual "marketing talk".

But, in FreeBSD, the function is a result of hardware-level correction. 
Something that only kicks in in _real_ _serious_ situations.
I just would like you (not specifically you, Dmitry) to aknowledge that 
broken RAM is worth a "panic" in "standard situations"- if I may call it 
like that.


If the RAM is broken for some bits, chances are great that there are 
more following soon.
... from the replies I got via PM, I feel some people don't agree with 
that


Sticks don't just break on a single bit. From my experience, a stick 
that's got any problems at all, will cause even more trouble soon...

If a hardware problem isn't worth panick'ing, what else is?
(don't answer this one please, this was a rhetorical question - to those 
who didn't get it...)


Still, I'd rather have the node break down completely than that...

M.^


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: FreeBSD 6.x CVSUP today crashes with zero load ...

2006-06-26 Thread M.Hirsch

So what do I need to do to make the box panic() on an ECC error?
Is there a kernel parameter, sysctl, or what else?

Thanks,
M.

Pete French schrieb:

I am not looking for workarounds, like ECC. I want the box to break 
immediately once any single component goes wrong...
   



Uh, that *is* what ECC does (or can do). Without ECC your broken hardware
continues to run un-noticed. With ECC you can either make it break
immediatley, or log an error or continue to run.

Stop thinking of ECC as error correction and start thinking of it as
error detection. No ECC gives you no way to detect failing memory.

-pete.
 



___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: FreeBSD 6.x CVSUP today crashes with zero load ...

2006-06-26 Thread M.Hirsch

Wow, Steven,

you've been really helpful here...

M.

Steven Hartland schrieb:



My advice would be dont feed the troll.

   Steve



___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: FreeBSD 6.x CVSUP today crashes with zero load ...

2006-06-26 Thread M.Hirsch

Dmitry Pryanishnikov schrieb:

When you wrote "ECC is a way to mask broken hardware", you were plain 
wrong.
If you're using hardware w/o ECC, it just can't tell whether error 
present

or absent. So ECC _is_ the way to detect (not mask) broken hardware.


Ok, thanks. I think I understand the meaning of ECC now.
So, unlike my supplier claims, ECC is not supposed to help against 
hardware failures.

But it is the way to detect them, right?

 If you want ECC corrector to raise NMI on corrected error (as well as 
uncorrectable), just set approproate bit in control register - every

Intel's ECC-capable chipset allows it. But if we're speaking about
production environment, such behaviour (abnormal termination on 
_corrected_

error) is unacceptable.


"abnormal termination" is not only acceptable for me, it is what I am 
looking for.
Make the node crash completely, so one of the others can take over its 
task(s).


Don't get me wrong, but tracking bugs in FreeBSD is quite more of an 
effort than "just" akquiring a new box...


 I don't see connection between this sentence and ECC (which is 
hardware option).


What I wanted to say:
Looking for errors in the logs is only a few seconds.
Finding out what caused them, is hours...
Akquiring a new box is only $29,95 ;) - that's like 30 minutes, if you 
regard it from the business side. ... I rather rent 100 boxes to do the 
task of ten, than employ 100 admins to find the "real" problem.


Thanks, Dmitry. I think I know what to look for now...

M.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: FreeBSD 6.x CVSUP today crashes with zero load ...

2006-06-26 Thread M.Hirsch



Ok...
Does the standard fs, UFS2, do "extra sanity checks", then?


Sorry, replying to myself...
No, this does not matter.
If the OS thinks the data is ok, UFS will write OK data...

So, let me rephrase this:
How can I make sure there is no broken hardware in my cluster?
I am not looking for workarounds, like ECC. I want the box to break 
immediately once any single component goes wrong...


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: FreeBSD 6.x CVSUP today crashes with zero load ...

2006-06-26 Thread M.Hirsch

Dmitry Pryanishnikov schrieb:



Hello!

On Mon, 26 Jun 2006, M.Hirsch wrote:

ECC is a way to mask broken hardware. I rather have my hardware fail 
directly when it does first, so I can replace it _immediately_



 You got it backwards. If your data has any value to you, then you 
don't want
to miss any single-error bit in it, do you? If you're running hardware 
w/o
ECC, your single-bit error in your data will go to the disk unnoticed, 
and you'll lose your data. With ECC, hardware will correct it. In 
(rare) case of multiple-bit error ECC logic will generate NMI for you, 
so you'll notice and "replace it _immediately_" instead of two weeks 
ago when your archive wont extract.



Nope, I am right on track.
I do not want to lose any data. So I'd prefer a ECC error to raise a 
panic so I can replace the hardware ASAP.
Don't get me wrong, but tracking bugs in FreeBSD is quite more of an 
effort than "just" akquiring a new box...


What's your hardware good for if it passes a "test", but fails in 
production?



 It's the way in what RAM will manifest single-bit errors: you run 
memory test - it won't catch them, later in production you'll miss 
this error because

nothing will provide extra sanity check of your data.


Ok...
Does the standard fs, UFS2, do "extra sanity checks", then?

M.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: FreeBSD 6.x CVSUP today crashes with zero load ...

2006-06-26 Thread M.Hirsch

Wilko Bulte schrieb:


You really have never seen a machine used for serious business apparantly.

 


Depends on what you define "serious business"...
Yes, I am rather new to FreeBSD (2y+)
I am just trying to setup a /stable/ cluster of six machines right now. 
For over a week straight.
4.11 works perfectly. But support is going to be dropped very soon, so 
that's a bad option for me right now.


Over all, the system is /only/ supposed to handle a few hundred hits per 
second. (but including dynamic stuff like php...)


Dunno if that (or what else) is "serious business" for you.
Which version would you suggest for "serious business"?

Anyways, my point stands: I rather have any of my nodes panic than 
carrying the risk of creating invalid data...
One in a billion can be high probability, soon... (just planning for the 
future...)



panics like that should be eradicated, adding more nonsensical panics
is not what we need.
 

uh, I would not call hardware failure "nonsensical panics". I guess I 
must have misunderstood you...


M.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: FreeBSD 6.x CVSUP today crashes with zero load ...

2006-06-26 Thread M.Hirsch

Ok, sorry. Misunderstanding here.
My point was, along what has been posted here in this thread:
"An ECC error should raise a kernel panic immediately, not only a 
message in the log files."

Any hardware showing ECC errors should be replaced asap..
Make them lazy admins do what they're getting paid for...

Correct, you can't (quickly) detect this without ECC hardware, of course.
But I keep reading about "ECC" being the solution to broken RAM sticks...

Since FreeBSD panics on creating simple malloc() vnodes, it should do so 
on ECC errors first.

Different mission, I guess ;)
(And different problems with the recent fricking code...)

M.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: FreeBSD 6.x CVSUP today crashes with zero load ...

2006-06-26 Thread M.Hirsch




.. So the logs are there, all that's required is a utility to read them
and, optionally, alert the administrator to the event,

 

No, I think a panic _should_ occur, even if there was a correctable 
error. Not "when there's no other option left".

Maybe make it optional via a kernel option.
There are much less-significant problems that can cause a panic.

Sure, you may be one of the few people out there who knows how to 
correctly run a _BSD_ system...

There's few of yous out there, ;)

M.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: FreeBSD 6.x CVSUP today crashes with zero load ...

2006-06-26 Thread M.Hirsch

Nope,

I'd like my bank data to be stored on a system that does ECC, no question.
But please, on hard disk level (RAID; that is _permanent_), not in the 
RAM of a single node.


If memory gets corrupted, please, raise a kernel panic... Even if 
there's ECC in place.


Counter question:
Would you like your bank account data to be stored on a medium where one 
failure can be corrected, two can be detected, but three go unnoticed? 
How unlikely is that, if you've got some hardware that is really /broken/?


I know this is a rather random thing to happen.
Still, I think ECC memory is overrated. Better have it fail immediately. 
_With a kernel panic, please_


M.

Wilko Bulte schrieb:

Balderdash.  


Following your rationale you want your bank account data
silently be corrupted by hardware with bit errors?  Be my guest, give
me ECC any day.

Proper hardware will log the ECC errors, a proper OS tailored to that
hardware will log and notify the sysadmins.

That is how it should be done.

Wilko

 



___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: kernel can't find root filesystem

2006-06-26 Thread M.Hirsch

Sorry, doesn't help.

There is some kind of bug hiding somewhere in 6.1 where it does not 
auto-detect the root partition under certain circumstances. Can't tell 
when it worked last, as the last distro I consider "stable" was 4.X... 
(sorry for the rant...)


I am not using (and don't want to use...) boot0 at all.
Well, I tried, but it didn't help the situation anyways...

It should work with the standard MBR and boot code ("/boot/mbr" and 
"/boot/boot"), right?
i.e. fdisk -B and bsdlabel -B without further params should do the job 
to get the system bootstrapped.

But it does not.

M.


If I'm not mistaken, you could also try to (re)install the boot0 loader:

boot0cfg /dev/da0


-Proto
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

 



___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: FreeBSD 6.x CVSUP today crashes with zero load ...

2006-06-26 Thread M.Hirsch
ECC is a way to mask broken hardware. I rather have my hardware fail 
directly when it does first, so I can replace it _immediately_
What's your hardware good for if it passes a "test", but fails in 
production?


ECC is totally overrated.

(sorry, couldn't resist...)

M.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: kernel can't find root filesystem

2006-06-24 Thread M.Hirsch
I had the same problem with 6.1. But only on some occasions, not always 
(iirc).
The installations I made over the last weeks had all very different 
environments and deployment methods.
I can't tell anymore when it happens and when not because I simply added 
the below loader.conf setting to my postinstall-script.


Add "vfs.root.mountfrom=ufs:da0s1" to /boot/loader.conf to fix it.

HTH,
Manuel
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: Question about current rc scripts

2006-06-20 Thread M.Hirsch

Another problem solved :)

So when my script overwrote rc.conf, the file was already loaded and 
thus overwriting it had no effect.

That's why it didn't work. (not with "BEFORE: initrandom" either).

This must have changed somewhen between 6.0 and 6-STABLE because it did 
work in 6.0.


Thanks vm.

I'm going to publish a "freebsd 6 remote deployment" article on my blog 
after I got it working 100%.
There really don't seem to be any good and recent guides for that out 
there...


M.

[LoN]Kamikaze schrieb:


BTW, rc.subr only loads rc.conf if it hasn't already been loaded.

 



___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: Question about current rc scripts

2006-06-20 Thread M.Hirsch

Ah, ok. I didn't get that correctly before.

"It's just another shell script" - nice hack, thanks ;)

M.

Brooks Davis schrieb:


Pete is suggested that you put the code in /etc/rc.conf.  It's just
another shell script.

-- Brooks

 


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: Question about current rc scripts

2006-06-20 Thread M.Hirsch
This is what I initially tried, but it didn't work due to a wrong 
"BEFORE:" tag.
"rcconf" was removed from 6-STABLE, so my scripts didn't start at all 
anymore.


Invalid dependencies seem to make rc-scripts not execute *at all* in 
6-STABLE. (note taken)


So far I think I got it working now.
Thanks for your input!

M.

Pete French schrieb:


Actually I was suggesting you overwrite rc.conf *itself* with the variable
setting code - so every script which reloads it gets the variables set.
Surely that would work ? Though it would mean your code would be run
multiple times...

-pete.
 


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: Question about current rc scripts

2006-06-20 Thread M.Hirsch
As stated in my initial post, I used "BEFORE: rcconf" up to and 
including 6.0.


So what is an "appropriate BEFORE entry" for 6-STABLE?

Sorry, I really can't find any documentation on this issue, not in 
UPDATING and not in the mailing list archives either.


Thanks,
M.


Brooks Davis schrieb:


Add appropriate BEFORE entries so it is before all unparented scripts.

-- Brooks

 



___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: Question about current rc scripts

2006-06-20 Thread M.Hirsch

One question remains though:

How can I make my script to be the very first rc script to be executed?

M.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: Question about current rc scripts

2006-06-20 Thread M.Hirsch

Yes, I think this is the solution.

It seems to depend on the filename, specifically the ".sh" extension.

CODE (in /etc/rc):
   /etc/rc.d/*.sh) # run in current shell

Thanks, I am testing this now. I'll post back as soon as I got results 
(1h or so...)


M.

[LoN]Kamikaze schrieb:


If your scripts name ends with .sh it will be sourced into the process and all 
variables set there will prevail (this feature only works in /etc/rc.d). If you 
do this you must not use the exit command anywhere in your script.
 


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Re: Question about current rc scripts

2006-06-20 Thread M.Hirsch

I have investigated a bit more.

Setting the variables can't work. As far as I can see, rc.conf is 
sourced from rc.subr. And every single script in /etc/rc.d/ sources 
rc.subr, so they reload the rc.conf file for each call.
The "rc" scripts are being executed in a sub-shell though. So 
overwriting variables in any of them will have no effect on the 
following files. It does work for /usr/local/etc/rc.d though, but I 
really need it to execute before anything else.


I made it work by overwriting rc.diskless (again). Stupid /me, 
rc.diskless does not follow the syntax for rc scripts. It's just a 
"normal" shell script :)


Anyways, I would still be interested in the "correct" way to do it.

M.


Pete French schrieb:


I thought rc.conf was simply a script that set some variables. If
this is the case then you don't need to overwrite it - you simply need to
make your script set the appropriate variables and then drop it in as
a repplacement for rc.conf - hence no need to rewrite rc.conf at all.

-pcf.
 


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


Question about current rc scripts

2006-06-20 Thread M.Hirsch

Hello everybody,

Before FreeBSD 6.1, I was able to write a rc script which modifies the 
rc.conf file.
All rc scripts exectuted after this first one would use the new rc.conf 
then.


To make this work, I put the script in /etc/rc.d/ and "tagged" it with 
"BEFORE: rcconf".


Unfortunately, rcconf has been removed in FreeBSD 6.1 (or -STABLE?).

I have tried many different ways, like "BEFORE:NETWORKING", and others. 
But none seems to work.

I even overwrote "rc.diskless", but still no luck.

What is the correct way for 6-STABLE to achieve what I want to do?
(i.e. write the rc.conf from a rc script)

Thanks in advance,

M.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"