[Haskell-cafe] Parallel weirdness

Andrew Coppin Sat, 19 Apr 2008 07:56:08 -0700

OK, so just for fun, I decided to try implementing a parallel merge sortusing the seq and par combinators. My plan was to generate somepsuedo-random data and time how long it takes to sort it. To try toaccount for lazy evaluation, what the program actually does is this:

1. Write the input data to disk without any sorting. (This ought toforce it to be fully evaluated.)

2. Sort and save the data to disk 8 times. (So I can average the runtimes.)

This is done with two data sets - one with 1 million items, and anotherwith 2 million rows. Each data set is run through both the purelysequential algorithm and a simple parallel one. (Split the list in half,merge-sort each half in parallel, and then merge them.)

The results of this little benchmark utterly defy comprehension. Allowme to enumerate:

Weird thing #1: The first time you sort the data, it takes a fewseconds. The other 7 times, it takes a split second - roughly 100xfaster. Wuh?

Weird thing #2: The parallel version runs *faster* than the sequentialone in all cases - even with SMP disabled! (We're only talking a fewpercent faster, but still.)

Weird thing #3: Adding the "-threaded" compiler option makes*everything* run a few percent faster. Even with only 1 OS thread.

Weird thing #4: Adding "-N2" makes *everything* slow down a few percent.In particular, Task Manager shows only one CPU core in use.

Adding more than 2 OS threads makes everything slow down even further -but that's hardly surprising.

Can anybody explain any of this behaviour? I have no idea what I'mbenchmarking, but it certainly doesn't appear to be the performance of aparallel merge sort!


_______________________________________________
Haskell-Cafe mailing list
Haskell-Cafe@haskell.org
http://www.haskell.org/mailman/listinfo/haskell-cafe

[Haskell-cafe] Parallel weirdness

Reply via email to