Re: Timsort vs some others

Xinok Mon, 17 Dec 2012 22:55:32 -0800

On Monday, 17 December 2012 at 15:28:36 UTC, bearophile wrote:

Regarding the recent Phobos improvements that introduce aTimsort:
http://forum.dlang.org/thread/50c8a4e67f79_3fdd19b7ae8146...@sh3.rs.github.com.mail
I have found a blog post that compares the performance ofTimsort, Smoothsort, and std::stable_sort:
http://www.altdevblogaday.com/2012/06/15/smoothsort-vs-timsort/

Bye,
bearophile

While Smoothsort may be mathematically sound, it simply doesn'ttranslate well to computer hardware. It's a variant of heap sortwhich requires a great deal of random access, whereas Timsort isa variant of merge sort which is largely sequential and benefitsfrom the CPU cache. Furthermore, the Leonardo heap is much morecomputationally expensive than a typical binary or ternary heap.

Both Timsort and Smoothsort are what you call "natural sorts,"meaning they typically require fewer comparisons on data with lowentropy. They're also rather complex which means added overhead.When sorting primitive types like int, comparisons areinexpensive, so the overhead makes these algorithms slower. Buthad he tested it on a data type like strings, then we'd likelysee Timsort take the lead.

On purely random data, quick sort and merge sort will win most ofthe time. But Timsort has an advantage over Smoothsort of beingan adaptive algorithm; the so called "galloping mode," which iscomputationally expensive, is only activated when minGallopreaches a certain threshold and therefore beneficial. Otherwise,a simple linear merge is used (i.e. merge sort).

On another note, I highly doubt that std::sort uses a "median ofmedians" algorithm, which would add much overhead and essentiallydouble the number of comparisons required with little to nobenefit. More likely, it simply chooses the pivot from a medianof three.

Re: Timsort vs some others

Reply via email to