Re: unique identifiers as a separate library

Isaac Dupree Tue, 23 Dec 2008 15:23:05 -0800

Hi again Iavor,

A couple performance ideas if you want to test them:

unsafeInterleaveIO is cheap until you need to evaluate itsresult. So how about this, I think it makes there be 1/3 asmany "structural" unsafeInterleaveIO's, so if it took "2"amount of time on unsafeInterleaveIO:ing before, it shouldtake "1.33" time on it after this: and just a bit moretime/memory to construct Nodes that might not be used.


gen r = unsafeInterleaveIO $ do
                  v <- unsafeInterleaveIO (genSym r)

n1 <- gen r; n2 <- gen r; n3 <- gen r; n4<- gen rreturn (Node v1 (Node v2 n1 n2) (Node v3n3 n4))

I also feel tempted to apply thestatic-argument-transformation manually,

where
   gen r = gen'
     where
       gen' = unsafeInterleaveIO $ do
          v <- unsafeInterleaveIO (genSym r)
          n1 <- gen'; n2 <- gen' --etc.
          return (Node ...)

or similar

which I guess is safe because this is onlyunsafeInterleaveIO, not unsafePerformIO? Dunno if it'd bespeed-beneficial though.


version 0.4:

genericNewSupply :: b -> (IORef b -> IO a) -> IO (Supply a)
genericNewSupply start genSym = gen =<< newIORef start
  where gen r = unsafeInterleaveIO
              $ do ls <- gen r
                   rs <- gen r
                   return (Node (unsafePerformIO (genSym r)) ls rs)

Why unsafePerformIO, was it faster?(i'd guess sloweractually, as unsafePerformIO is NOINLINE..) It'sconsiderably less safe than unsafeInterleaveIO! Forexample, do the static-argument-transformation above, thenfloat out the unsafePerformIO because it's the sameexpression each time through gen', and suddenly the all the"unique" values are all the same!


we can make this value-supply very good ultimately :-)

also, I might call "unsafeNewIntSupply" something morespecific, like "unthreadsafeNew...", or the more obscure butconventional "dupable" description-word. Did it helpspecializing that to Int, i.e. why not"unsafeGenericNewSupply"? because I can imagine a simpledata that's not an Int, where you'd still want to avoid thethread-safety overhead. Also, your implementation of itcould be more efficient: it doesn't need to do locking, so Isuggest modifyIORef rather than atomicModifyIORef (Actuallyyou'll have to use readIORef >>= writeIORef >> return,instead, because modifyIORef has a different type thanatomicModifyIORef). Possible refactor: All the functions***GenSym r = atomicModifyIORef r (some expression thatdoesn't mention r); doing the "[atomic]ModifyIORef r" couldbe the caller's responsibility instead, and e.g. listGenSym(a:as) = (as,a).

in fact, for lists (as you get a incomplete-pattern-matchwarning there, but you know the list is always infinite,because you made it with "iterate"), you could instead usean infinite-list type, Data.Stream from package "Stream"[*];as Stream is not a sum type (it only has one possibleconstructor: Cons), it might even be a bit more efficient![*]http://hackage.haskell.org/packages/archive/Stream/0.2.6/doc/html/Data-Stream.html

thanks for your effort! and especially for measuring theperformance timing!

-Isaac
_______________________________________________
Glasgow-haskell-users mailing list
Glasgow-haskell-users@haskell.org
http://www.haskell.org/mailman/listinfo/glasgow-haskell-users

Re: unique identifiers as a separate library

Reply via email to