Re: pool-map - a more disciplined version of pmap on top of executors

Jim foo.bar Tue, 15 Jan 2013 04:36:12 -0800

On 15/01/13 09:25, Marko Topolnik wrote:

The order in which you are polling is not very relevant given the factthat /doall/ won't return until *all* futures are realized. It's justan internal detail.

I finally fully grasped what you were saying...So yes you're right - aslong as I'm forcing realisation at the end there is nothing to begained...However, what if I submit jobs eagerly and poll for resultslazily? Then there must be some some gain from using the completionservice which will bring back the results in the order they finished....some basic testing:


(defn pool-map

"A saner, more disciplined version of pmap. Submits jobs eagerly butpolls for results lazily.

 Don't use if original ordering of 'coll' matters."
[f coll]
 (let [cpu-no (.. Runtime getRuntime availableProcessors)
       exec (java.util.concurrent.Executors/newFixedThreadPool cpu-no)
       pool (java.util.concurrent.ExecutorCompletionService. exec)

futures (doall (for [x coll] (.submit pool #(f x))))] ;;submiteverything up front

(try
 (for [_ futures]  (.. pool take get))
(finally (.shutdown exec)))))

;;your version is 'pool-map1'

;;weirdly enough 'pool-map1' doesn't behave lazily (even though it has acall to 'map'!)!!!



user=> (def dummy-times [3000 10 9 8 7 6 5 4 3 2 1])
#'user/dummy-times
user=> (time  (pmap #(do (Thread/sleep %) %) dummy-times))
"Elapsed time: 16.213366 msecs"

(3000 10 9 8 7 6 5 4 3 2 1) ;;here you waited 3s before sleeping for0.01 s

user=> (time  (pool-map #(do (Thread/sleep %) %) dummy-times))
"Elapsed time: 21.004979 msecs"

(10 9 8 7 6 5 4 3 2 1 3000) ;;here you've not waited at all - sleepingfor 3s finished last and is last

user=> (time  (pool-map1 #(do (Thread/sleep %) %) dummy-times))
"Elapsed time: 3008.174631 msecs"  ;;non-lazy?

(3000 10 9 8 7 6 5 4 3 2 1) ;;again your version will wait for thefirst item to finish before proceeding

I think what you trying to get across is that the overall timings (ifwe do realise the result) will not differ much as all jobs have tofinish eventually. In other words, sleeping for 3 s first and for 1later is the same thing as sleeping for 1 s and then for 3seconds!...and of course this is generally true! However, there is noreal benefit waiting for the 1st task to finish when we don't mindabout ordering. You 'll get the first item whenever it finishes inwhatever position...This MUST be good but perhaps it needs to be pairedwith laziness to witness any effect?

aking into account all that was said, /pool-map/ can't offer much morethan /pmap/. You can't know which tasks will take less time until theyare already done. It is theoretically impossible to pre-order themaccording to execution time, thereby harvesting the results of thefastest ones earlier, eventually promoting total concurrency.

hmmm...so the completion service is useless? It can't be... You saythat'You can't know which tasks will take less time until they arealready done' but the way I see it you don't need to...all you need toknow at any given time is whether a or some futures have completed. Ifone has indeed completed you invoke .get for the result. If it hasn'tfinished and you do .get it will block until it finishes just likederef-ing in Clojure... I honestly don't see why harvesting the resultsof the fastest ones earlier requires to know the execution times upfront! As you go along you can ask the futures whether they finished ornot, can't you?

I am in no way trying to contradict you ,I'm just trying to set thingsstraight so we are all on the same page...again thanks for your time andcomments! :)



Jim


--
You received this message because you are subscribed to the Google
Groups "Clojure" group.
To post to this group, send email to clojure@googlegroups.com
Note that posts from new members are moderated - please be patient with your 
first post.
To unsubscribe from this group, send email to
clojure+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/clojure?hl=en

Re: pool-map - a more disciplined version of pmap on top of executors

Reply via email to