'functional' performance VS 'imperative' complexity

Jim - FooBar(); Sat, 25 Aug 2012 13:01:28 -0700

Hello everyone,

in this post I'm not asking for something specific, but rather I'd liketo spark a discussion regarding the issue of performance within thefunctional paradigm...most of the things i will mention will probablynot be news for most of you...Hopefully, however the issues I plan toraise will eventualy help someone else faced with the same dilemmas as me...

First of all, let me clarify upfront that I strongly believe that'functional' and 'immutable' should be the default (as Rich often says).This thread is certainly not about praising the 'imperative' style. Itis about having all the facts before you start coding (probably beforeeven designing)...Most of you presumably already do...

Ok so, it is evident from my other posts that I'm building a library forwriting board games. In a nutshell, when it's finished, I'd like someoneelse to be able to write up his own board game, show it up on screen andgenetically train a neural-net for an opponent, in literally less than5-6 hours (~ 100 LOC).Now, for those of you that have done any board games you can immediatelyidentify the hot-spots of such a program. These are 2: actuallyexploring the game-tree and training the neural-net. Notice how boththese tasks can be run in parallel...(exploring the game tree is notimmediately apparent how to do in parallel but we have reducers)...

Generally there are 3 major ways going about writing a program -functionally all the way, imperatively all the way, a mixture. I sort ofimplemented all 3 categories for my chess game and I've got someinteresting results:


1. Firstly and more importantly (I mean that), the purely functional
   and immutable road is simply  such a pleasure to travel...There are
   no words to describe the beauty, clarity and elegance of the
   functional version. Mutation is non-existent or only through
   reference types and operations like 'update-position' and 'move'
   return brand new piece and board respectively. Also, it is the only
   one that is extremely stable and always brings back the correct
   answer. It is *gorgeous*... On the flip-side, it performs horrible!
   It is very very slow for realistic depths like 4 or 6 even
   regardless of utilising reducers to the maximum and countless
   optimisations. The best time I can report is 9 min for level 4.
2. after watching Daniel Solano Gomez's presentation on infoq ("11 tips
   to boost performance"), I realised that If I wanted raw speed (as he
   puts it), I 'd have to resort to arrays. Well, I made my heart a
   stone and went to implement an array-based version that favours
   mutation. Having such modular code in the first place, that did not
   take too long...I just wrote up different version of 'move' and
   'collides?' (amove, acollides?) that know how to deal with arrays
   and created a ChessPiece2 record which holds a java.awt.Point object
   which is mutated by update-position (instead of returning a brand
   new piece). Basically,  (I thought) i was done in 30 min... However
   it turned out I was being sooo ignorant!!! Making such a u-turn in
   programming paradigms while working on the same project is never
   that simple. The functional style protected me from so many bad
   things...of course, I already knew that but I was surprised to see
   how many these are! For instance, making a move in the functional
   version caused absolutely no damage...there is an 'execute!' fn that
   does damage if we want it to (via atom only) but this is to be used
   only when we decide what move we want. Now, trying out a move messes
   up everything!!! Now, I need means of undoing and not only that...My
   entire searching algorithm can no longer function properly...Off the
   top of my head, I need some sort of serial loop/recur that tries
   moves when recursion rolls in and takes them back (by undoing) when
   recursion rolls out . In other words I need to keep track of the
   changes carefully! On the filp-side, even though this version has
   bugs and does not return the correct answer, it seems it can reach
   level 4 in roughly 2 min. This is 4x faster! Again, I'm being
   cautious cos there are bugs so I can't be sure of the time but there
   seems to be a dramatic performance increase...The code however is a
   lot buggier and uglier...
3. Finally I tried doing a functional version that uses mutable arrays
   but doesn't mutate them...Instead critical fns like 'move' build new
   arrays (btw 'aclone' does not deep copy) to return and updating a
   piece's position does return a new piece... This version, is not
   very stable but does return the correct answer in just over 7 min
   for level 4...again the importance of immutability shines! the
   timing shows that i got 22%  better performance. I have not profiled
   this but I expect most of the time being spent copying arrays.

In essence what am I telling you here? 2 things basically... Firstly,think about performance before you actually design your solution...itmay be that some tasks are not well suited for immutable data-structuresand it can be a show stopper. Don't think for a second that you canconvert your gorgeous, purely immutable approach to a purely mutable(better performing one) without any cost. These are 2 fundamentallydifferent choices that lead to completely different algorithmic designs...

so where does this leave me? well, I am going to stick with the elegant,all-clojure solution and perhaps find a 8/12-core machine to do mytraining on... however, in a different setting I might chooseotherwise...at the moment I don't have time to spend hundreds of hoursto fix all the bugs of the mutable version so i can get it to preformbetter... I hate the process as well...if someone was paying me though,my ego would be far less!

Of course, there are going to be people that will say: "You're using thewrong algorithm!", "You can prune the tree!" etc etc...that is not thepoint though...even of I do pruning does anybody think I can get tolevel 4 in less than 5 min? That would require pruning half the tree!anyway, what I'm trying to say is that in algorithms like theseperformance matters a lot... most machine learning algorithms involvematrix multiplication...why do you think all the machine-learners usematlab? you think they enjoy writing matlab code? No - it just goessuper fast ! If performance matters to you , then perhaps your time isbest spent thinking a mutable approach from day 1... I cannot believe Isaid that but there you go - I said it!


No hard feelings eh? I still love Clojure... :-)

Jim





--
You received this message because you are subscribed to the Google
Groups "Clojure" group.
To post to this group, send email to clojure@googlegroups.com
Note that posts from new members are moderated - please be patient with your 
first post.
To unsubscribe from this group, send email to
clojure+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/clojure?hl=en

'functional' performance VS 'imperative' complexity

Reply via email to