Mersenne: Re: build-it-yourself Athlon

Brian J. Beesley Mon, 16 Apr 2001 23:38:58 -0700
On 16 Apr 2001, at 16:27, Ernst Mayer wrote:

> Fry's (a big local discount computer & electronics chain) has a great
> deal on build-it-yourself Athlons (US$ 350 for 1.2GHz CPU, MB, case,
> floppy, ethernet card, 56K modem & video card) so I'm going to take
> the plunge.

Does sound like a good bargain... Are you _sure_ this includes the 
processor? Tip, buy the best heat sink you can find, even if you have 
to throw away the one supplied. And install an extra case fan.

BTW there are now two variants of Athlon (Thunderbird) processor, 
with 200 MHz (100 MHz DDR) or 266 MHz (133 MHz DDR) front-speed bus. 
Since the multiplier is fixed, you want to make sure you have the 
correct type for your motherboard. Some motherboards are switchable 
for FSB, but, if you install a 266 MHz FSB CPU in a M/B running at 
200 MHz FSB, the system will run at only 3/4 of its rated speed. 
Which will slow it down to about the same speed as a PIII running at 
the same rated speed.

The key is the last character of the part type stamped onto the die. 
A 200 MHz FSB CPU ends with B whilst a 266 MHz FSB CPU ends with C.
Unfortunately you won't easily be able to read this if you have 
bought a system with the heat sink already stuck onto the processor.

> The one question that remains is whether to buy ECC or
> non-ECC memory? Neither is terribly expensive (256MB of
> Athlon-compatible PC133 memory goes for $45 non-ECC, $90 ECC at
> pricewatch.com), but I don't want to pay extra unless I'm sure it will
> give some definite advantage. So the question is: does ECC memory of
> this kind actually do active error correction, or merely detection?

Actually there are several questions here...

Memory described as ECC should actively correct single-bit errors and 
detect most multi-bit errors. If an uncorrected error is detected, it 
should raise non-maskable interrupt, which (depending on the OS) will 
probably cause a kernel panic (or BSOD on a Windows system).
If an error is detected and corrected, the memory should raise a 
different interrupt, which the OS may ignore, or log somewhere.

My understanding is that the correction is actually done in the DIMM 
itself, but depends on the appropriate signals being supplied by the 
chipset.

The only experience I have of a corrected memory error on a PC-type 
system was on a 486 running linux kernel 2.2.x (using old FPM SIMMS); 
the error was logged in /var/log/messages & the system sailed happily 
on for months afterwards.

Something similar should happen if you have the CPU cache ECC 
enabled.

However, whether this works or not depends on the chipset as well as 
the BIOS settings and the OS. Note that the VIA chipsets often 
supplied on Athlon motherboards do NOT support ECC; if you install 
ECC memory, it functions as non-parity memory. 

I much prefer ECC memory, but there is absolutely _no_ point in 
paying extra for ECC memory if the capability is non-functional due 
to deficiencies in the chipset. However it is definitely worth paying 
the small amount extra for faster memory (PC133 instead of PC100, 
even if the memory bus is running at 100 MHz, or CL2 instead of CL3) 
since this can and usually does significantly benefit system 
performance - though some tuning of chipset memory timings through 
BIOS may be necessary.

> If OTOH
> one only gains the ability to *detect* errors, is Mprime configured
> with this in mind, i.e. will it restart from the last savefile if a
> memory error is detected?

Not relevant - mprime will not be aware of a corrected memory error, 
whilst a kernel panic will crash the system, so mprime have to 
restart from the last savefile (once you've stood the system up 
again!) Even if the system crashes whilst mprime is writing a 
savefile you should be OK, since the savefile is renamed to something 
mprime will look for only after the file has been written 
successfully.

Even with full ECC capability, there _could_ still be an uncorrected, 
undetected error - but this would have to be multi-bit, and therefore 
rather rare. The chance of this happening without there being a high 
rate of corrected errors must be rather small.

BTW if you're looking for a x86 linux distribution, I'd strongly 
suggest you look at "Rawhide" which is the beta for RedHat 7.1. This 
comes with an apparently functional & reasonably stable kernel 
(v2.4.2 last time I looked); there are _major_ advantages in running 
a v2.4 kernel on an x86 system (support for UDMA66 & UDMA100 disks, 
built-in I2C support which is useful for CPU temperature sensing, 
etc.). But the real point is that it's a lot easier to install 
Rawhide from scratch than it is to do an install of RH 7.0 then 
upgrade the kernel. I have little experience of distributions other 
than RedHat, but I have little doubt that the same principles apply.


Regards
Brian Beesley
_________________________________________________________________________
Unsubscribe & list info -- http://www.scruz.net/~luke/signup.htm
Mersenne Prime FAQ      -- http://www.tasam.com/~lrwiman/FAQ-mers
Mersenne: Re: build-it-yourself Athlon

Reply via email to