Re: [Haskell-cafe] WideFinder

Sterling Clover Sat, 10 Nov 2007 21:27:10 -0800

http://www.tbray.org/tmp/o10k.ap is the basic data set. For heavierduty testing, folks seem to be appending it to itself 99 more timesto yield a "o1000k.ap" dataset. I'd be curious for comments on mycode or other suggestions to speed things up -- the strictnesssemantics of the mapUnionPar function seem pretty decent to me, butI'd like to find a way to give higher preference to evaluating lateriterations of until as opposed to earlier ones (so as to improvememory performance) but can't think of any way to do that withoutexplicit threads. Implementing memory mapped reads, as was suggestedhere recently in a different context, might be another bigperformance gain.

On my decidedly not powerful machine (Mac PowerPC G5, 1.8GHz) I can'tget much lower than 12.25s for the 1000k dataset (out of which,roughly 3s in GC), which is 192M, which is actually slower than hissample ruby implementation. :-(. I'm sure parallel processing willhelp quite a bit, however, as profiling indicates that most time isspent in the "count" function. Maps are a good choice for parallelismbecause they merge efficiently, but for the iterative aspect theirperformance leaves a lot to be desired. This seems evident in thateven on a single processor, lower sizes of chunks, at least to apoint, still improve overall performance, although this may possiblybe equally an issue with space efficiency.

I wonder if Haskell's lack of an efficient hashtable isn't hurting ithere again too, but on the other hand for a real efficiency gain,switching to a custom-built trie that combined pattern matching andinsertion into a single operation would probably be a significantwin, and it would let us force unboxing ints too, for whatever thatgains.


--S

On Nov 10, 2007, at 3:36 AM, Berlin Brown wrote:

Which data set did you test it on?


_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe

Re: [Haskell-cafe] WideFinder

Reply via email to