Re: Cloning in D

Michel Fortin Mon, 06 Sep 2010 19:20:37 -0700

On 2010-09-06 20:55:16 -0400, dsimcha <dsim...@yahoo.com> said:

== Quote from Michel Fortin (michel.for...@michelf.com)'s article
I'm under the impression that a too permissive generic implementation
of cloning is going to break things in various scenarios.
In general you raise some very good issues, but IMHO the right way todo cloning
is to have permissive generic cloning that works in the 90% of cases and can be
easily overridden in the 10% of cases, not to require writing tons ofboilerplatein the 90% of cases just to make sure it doesn't do the wrong thing bydefault in
the 10% of cases.

To me automatic cloning of everything (physical cloning in yourparlance) looks more like 50/50 work/doesn't-work ratio. I can onlyguess, but I'm probably used to different use cases than you are.

A second point is that the thing that brought this whole cloning issueto my mind
was making std.concurrency's message passing model less obtuse.  Right now it's
hard to use for non-trivial things because there's no safe way to pass complex
state between threads. If we start allowing all kinds of exceptions tothe "clonethe **entire** object graph" rule, cloning will rapidly become uselessfor safely
passing complex object graphs between threads.

This I agree with. I'm not arguing against automatic cloning per-see,I'm just trying to show cases where it doesn't work well.

Personally, I'm rather skeptical that we can make it safe and efficientat the same time without better support from the language, somethingakin the mythical "unique" type modifier representing a reference withno aliasing.

What if your
object or structure is part of a huge hierarchy where things contains
pointers to their parent (and indirectly to the whole hierarchy), will
the whole hierarchy be cloned?


Isn't that kind of the point?

Well, that depends. If you send each leaves of a tree as a message tovarious threads presumably to perform something concurrently with thedata in that leaf, then you may want only the leaf to be copied. Youmay not want every parent down to the root and then up to every otherleaf to be copied alongside with each message just because the leaf yousend has a pointer to the parent.

In fact, it depends on the situation. If what you want to do with theleaf in the other thread requires the leaf to know its parent andeverything else, then sure you need to copy the whole hierarchy. Butotherwise it's a horrible waste of memory and CPU to clone the wholeobject graph for each message, even though it won't affect theprogram's correctness.

And it's basically the same thing with observers. If your observer is acontroller in charge of updating a window when something changes, youdon't want to clone the observer, then clone the window and everythingin it just because you're sending some piece of data to another thread.Perhaps the program architecture is just wrong, or perhaps thatobserver is a synchronized class capable of handling function callsfrom multiple threads so it doesn't really need to be copied.

What happens if your object or structure
maintains a reference to a singleton, will we get two instances of a
singleton?


Very good point.  I guess the reasonable use case for holding a reference to a
singleton (instead of just using the globally accessible one) would be if it's
polymorphic with some other object type?  If you're using message passing

concurrency, most of your mutable singletons are probably thread-local,and what

you probably really want to do is use the thread-local singleton of the thread
you're passing to.

What intrigues me is how such a mechanism would work... although in mymind it's probably not even worth supporting at all, singletons bedamned!

My understanding is that a data structure containing a pointer cannot
be cloned safely unless it contains some specific code to perform the
cloning. That's because the type system can't tell you which pointers
point to things owned by the struct/class and which one need to be
discarded when cloning (such as a list of observers, or the parents of
a hierarchy).
This discussion is making me think we really need two kinds of cloning:Physicalcloning would clone the entire object graph no matter what, such thatthe clonedobject could be safely passed to another thread via std.concurrency andbe given aunique type. Logical cloning would be more like what you describe. Ingeneral,
this discussion has been incredibly useful because I had previously only
considered physical cloning.

This is an interesting and valid observation. But I think you need toleave a door open to customization of the "physical cloning" case too.The ability to avoid cloning unnecessary data is as necessary as theability to easily copying an entire object graph.



--
Michel Fortin
michel.for...@michelf.com
http://michelf.com/

Re: Cloning in D

Reply via email to