And here's another interesting algorithm/structure: Randomized Slide to Front

Andrei Alexandrescu via Digitalmars-d Mon, 30 Nov 2015 13:36:02 -0800

Now that we got talking about searching in arrays, allow me to alsoshare an idea I've had a short while ago.

(Again, we're in the "I'd prefer to use an array if at all possible"mindset. So let's see how we can help searching an array with as littlework as possible.)

One well-known search strategy is "Bring to front" (described by Knuthin TAoCP). A BtF-organized linear data structure is searched with theclassic linear algorithm. The difference is what happens after thesearch: whenever the search is successful, the found element is broughtto the front of the structure. If we're looking most often for a handfulof elements, in time these will be near the front of the searched structure.

For a linked list, bringing an element to the front is O(1) (just rewirethe pointers). For an array, things are not so pleasant - rotating thefound element to the front of the array is O(n).


So let's see how we can implement a successful BtF for arrays.

The first idea is to just swap the found element with the first elementof the array. That's O(1) but has many disadvantages - if you searche.g. for two elements, they'll compete for the front of the array andthey'll go back and forth without making progress.

Another idea is to just swap the found element with the one just beforeit. The logic is, each successful find will shift the element closer tothe front, in a bubble sort manner. In time, the frequently searchedelements will slowly creep toward the front. The resulting performanceis not appealing - you need O(n) searches to bring a given element tothe front, for a total of O(n * n) steps spent in the n searches. Meh.

So let's improve on that: whenever an element is found in position k,pick a random number i in the range 0, 1, 2, ..., k inclusive. Then swapthe array elements at indexes i and k. This is the Randomized Slide toFront strategy.

With RStF, worst case search time remains O(n), as is the unsuccessfulsearch. However, frequently searched elements migrate quickly to thefront - it only takes O(log n) searches to bring a given value at thefront of the array.

Insertion and removal are both a sweet O(1), owing to the lightstructuring: to insert just append the element (and perhaps swap it in arandom position of the array to prime searching for it). Removal byposition simply swaps the last element into the position to be removedand then reduces the size of the array.

So the RStF is suitable in all cases where BtF would be recommended, butallows an array layout without considerable penalty.

Related work: Theodoulos Garefalakis' Master's thesis "A Family ofRandomized Algorithms for List Accessing" describes Markov Move toFront, which brings the searched element to front according to a Markovchain schedule; and also Randomized Move to Front, which decides whethera found element is brought to front depending on a random choice. Theseapproaches are similar in that they both use randomization, butdifferent because neither has good complexity on array storage.



Andrei

And here's another interesting algorithm/structure: Randomized Slide to Front

Reply via email to