Re: Replacement for GMP: Update

Peter Tanski Wed, 24 Jan 2007 07:30:15 -0800

On Tue, 23 Jan 2007, John Meacham wrote:

I think the benchmarks are flawed in an important way, I believe, (but
am not positive) that ARPREC uses a special factorized form of
representing numbers, which makes multiplicative type operations

extremely fast, but simple things like addition/subtraction quiteslow.

Oh, don't worry. I've given up on ARPREC for the simple reason that*every* operation on small-size operands is very slow to say nothingof the time it would take to initialise ARPREC every time. ARPRECdoes store numbers using a representation similar to floating point,essentially an array of doubles where:

        (int)(array[0]) = array_size
        (int)(array[1]) = no_mantissa_words
                array[2]        = exponent
                array[ 3 .. ] = mantissa

I hope the tests aren't "flawed," in that the main purpose ofperforming those operations was to see the median-level performance(extreme performance being Integers with operands greater than,10,000 or 30,000 bits). The crumby graphs do show that I made a lowcutoff of 256 bits so the most common Integer-use ( < 128 bits, 4uint32_t) wasn't even tested. I probably didn't clarify why I testedlarger size operands. GHC should have something that is comparableto GMP, and that means speed (and precision) for medium-largeoperands as well as small operands.

you are only benchmarking multiplicative or similar routines, giving
ARPREC a huge lead, when in practice it might end up being slowest, as
addition/subtraction are extremely more common than multiplication.


In the common case it is slowest--but not by much.

Also, how are you choosing the numbers to test with? it is possiblethat
some packages are using 'sparse' representations or other specialized
forms if all your numbers happen to be powers of two or something.

Random numbers--a different number for each iteration in size. Iused a PRNG based on the SNOW2 stream cipher--something I wroteawhile ago, as fast as arc4random and tests well on DIEHARD and otherstatistical things. The GMP and OpenSSL random number generatorswere slower and I wanted to use the same generator across libraries.

also, pretty much all uses of integers will be for very smallintegers,we should be careful to not lose sight of speed for the common casedue
to pretty asymtotic bounds.

All numbers were the same size, so cases like multiplying a 1024-bitoperand by a 256-bit operand weren't tested; that could be a realflaw. It's all very sloppy--just to get an idea of where thingsgenerally line up.

So, where am I now? I worked on the GHC-Win off and on and then wentback to making a uniform API between GHC and the replacement, and Iam re-writing the replacement. I thought I would be done severalweeks ago but of course little things take over... One area of GHC Iwould really like to change is the memory-use. ForeignPtr seems towork well but places the full burden of memory management on theInteger library; for SIMD-use (AltiVec, SSE2), the library wouldrequire GHC to allocate memory aligned to 16-bytes. Right now, theonly choice would be allocatePinned() (in rts/sm/Storage.c), which is4-byte aligned and it is, of course, pinned so the GC can't move it.Imagine the RTS-memory problems you could have if you wrote a Haskellprogram using lots of small Integers allocated with, say, anallocatePinned_16() (16-byte aligned). The alternative would be touse normal memory but re-align the operands for the vectorizedoperations by peeling; o.k., doable (especially easy with AltiVec),but slower. In the meantime I have been working through SSE2assembler, which doesn't have an addc (add-carry) operation anddoesn't set a flag for overflow, so I have been experimenting with aSWAR-like algorithm--essentially the same thing as GMP's Nails--tomake the normally 32-bit operands 31 bits with the last bit for acarry flag. Thorkil Naur and others have suggested writing the wholething as small assembler operations and piece them together inHaskell; I have been looking into that as well but it seems to entailinlining every Integer function--imagine the code bloat.


Cheers,
Pete

_______________________________________________
Glasgow-haskell-users mailing list
Glasgow-haskell-users@haskell.org
http://www.haskell.org/mailman/listinfo/glasgow-haskell-users

Re: Replacement for GMP: Update

Reply via email to