Re: Parallelism and Concurrency was Re: Ideas for a (nntp: message (nntp: message 18 of 20) 14 of 20) "Object-Belongs-to-Thread" threading model

nigelsandever Tue, 18 May 2010 07:16:18 -0700

On Tue, 18 May 2010 11:39:04 +0100, Daniel Ruoso <dan...@ruoso.com> wrote:

This is the point I was trying to address, actually. Having *only*
explicitly shared variables makes it very cumbersome to write threaded
code, specially because explicitly shared variables have a lot of
restrictions on what they can be (this is from my experience in Perl 5
and SDL, which was what brought me to the message-passing idea).

Well, do not base anything upon the restrictions and limitations of thePerl 5 threads/shared modules. They are broken-by-design in so many waysthat they are not a good reference point. That particularrestriction--what a :shared var can and cannot hold--is in some cases justan arbitrary restriction for no good reason that I can see.

For example: file handles cannot be assigned to :shared vars is totallyarbitrary. This can be demonstrated in two ways:

1) If you pass the fileno of the filehandle to a thread and have it dup(2)a copy, then it can use it concurrently with the originating threadwithout problems--subject to the obvious locking requirements.

2) I've previously hacked the sources to bypass this restrict by addingSVt_PVGV to the switch in the following function:



SV *
Perl_sharedsv_find(pTHX_ SV *sv)
{
    MAGIC *mg;
    if (SvTYPE(sv) >= SVt_PVMG) {
        switch(SvTYPE(sv)) {
        case SVt_PVAV:
        case SVt_PVHV:
        case SVt_PVGV: // !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
            if ((mg = mg_find(sv, PERL_MAGIC_tied))
                && mg->mg_virtual == &sharedsv_array_vtbl) {
                return ((SV *)mg->mg_ptr);
            }
            break;
        default:
            /* This should work for elements as well as they
             * have scalar magic as well as their element magic
             */
            if ((mg = mg_find(sv, PERL_MAGIC_shared_scalar))
                && mg->mg_virtual == &sharedsv_scalar_vtbl) {
                return ((SV *)mg->mg_ptr);
            }
            break;
        }
    }
    /* Just for tidyness of API also handle tie objects */
    if (SvROK(sv) && sv_derived_from(sv, "threads::shared::tie")) {
        return (S_sharedsv_from_obj(aTHX_ sv));
    }
    return (NULL);
}

And with that one change, sharing file/directory handles in Perl 5 becamepossible and worked.

The problem is, GVs can hold far more than just those handles. And many ofthe glob-modules utilise the other slots in a GV (array/hahs scalaer etc.)for storing state and bless them as objects. At that point--when I triedthe change--the was a conflict between the blessing that Shared.XS uses tomake sharing working and any other type of blessing. The net result wasthat whilst the change lifted the restriction upon simple globs, it stilldidn't work with many of the most useful glob-based module--IO::Socket::*;HTTP::Deamon; etc. I guess that now the sharing of blessed objects hasbeen mage possible, I shoudl try the hack again a see if it would allowthose blessed globs to work.

Anyway, the point is that the limitations and restrictions of the Perl5implementation of the iThreads model, should not be considered asfundamental problems with with the iThreads model itself. They aren't.

However, interpreters already have to detect closed over variablesin
order to 'lift' them and extend their lifetimes beyond their natural
scope.


Actually, the interpreter might choose to to implement the closed-up
variables by keeping that entire associated scope when it is still
referenced by another value, i.e.:

 { my $a;
   { my $b = 1;
     { $a = sub { $b++ } } }

this would happen by the having every lexical scope holding a reference
to its outer scope, so when a scope in the middle exits, but some
coderef was returned keeping it as its lexical outer, the entire scope
would be kept.

This means two things:

1) the interpreter doesn't need to detect the closed over variables, so
even string eval'ed access to such variables would work (which is, imho,
a good thing)

You'd have to explain further for me to understand why it is necessary tokeep whole scopes around:

- in order to make closures accessible from string-eval;
- and why that is desirable?


2) all the values in that lexical scope are also preserved with the
closure, even if they won't be used (which is a bad thing).


Please no! :)

This is essentially the biggest problem with the Perl 5 iThreadsimplementation. It is the *need* (though I have serious doubts that it isactually a need even for Perl 5), to CLONE entire scope stacks every timeyou spawn a thread that makes them costly to use. Both because of the timeit takes to perform the clone at spawn time; and the memory used to keepcopies of all that stuff that simply isn't wanted; and in many cases isn'teven accessible. AFAIK going by what I can find about the history ofiThreads development, this was only done in Perl 5 in order to provide theWindows fork emulation.

But as a predominently windows user I can vouch that that emulation isalmost completely useless. It doesn't allow for portability of forkingcode, because most every forking program also makes use of other POSIXconcepts--like signals, exec, etc.--that windows does not support, and forwhich the Perl 5 emulations are entirely inadequate to allow portability.Far better to simply accept that fork, exec, signals etc. simply do notwork on Windows and move on.

Removing the emulation code from the core would simplify everyones life.And if kernel threads are available with in the core, without all theemulation wrappers and sharing restrictions, external modules can providePOSIX-like capabilities without hamstringing the native use of kernelthreading.

It doesn't seem it would be any harder to lift them to shared
variable status, moving them out of the thread-local lexical pads andinto
the same data-space as process globals and explicitly shared data.


It is still possible to do the detection on the moment of the runtime
lookup, tho...

I realise that CPS gives the ability to keep/maintain entire scope framesalive after their natural end, but:- at what cost in terms of memory? If Perl 5s trhead-cloning is anythingto go by: expensive!- is there any guarentee that *the* Perl interpreter (if there ever issuch a thing) will be CPS-based?


Looking at the history of Parrot it seems to impose huge development costs.

Whereas, detecting--at runtime--that a sub references one or morevariables from earlier scopes seems almost trivial. And lifting those intothe "global scope" seems both relatively trivial and natural.

Better surely that one or two closed-over vars persists (inaccessibly)slightly beyond their strightly required lifetimes; than whole rafts ofunneeded, inaccessible scopes (and all the variables; stack frames;lexpads etc.) they contain, persist for the life of entire threads, justbecause one or two of the vars they contain were closed over?

My currently favoured mechanism for handling shared data, is via
message-passing, but passing references to the shared data, rather than
the data itself. This seems to give the reason-ability, compose-ability
and controlled access of message passing whilst retaining the efficiency
of direct, shared-state mutability.


That was part of my idea too, I wasn't trying to address remote
processes or anything like that, I was considering doing the queues in
shared memory for its efficiency.

There are only two ways I am aware of, of implementing inter-thread(kernelor user-space) queues(whether wrapped over as message passing or someother abstraction or not):


- shared memory.
- serialised streams: pipes or sockets.

And both the latter are simple shared memory disguised. The difference isthat the shared memory in these cases is *kernel* shared memory. Which, inaddition to the overhead of serialising & deserialising transmissions,means that every access has to involve a ring 3->ring 0->ring 3 transitioncycle. To see what that does for efficiency, take a look at benchmarks ofSysV shared memory APIs. They're horrible!

Process shared memory queues are almost trivial to implement and extremelyefficient. If (as I suggest above) they are constrained to holdingreferences--which are all the same size), then a simple C-style array ofmemory (the size of the queue) plus two pointers to the head and tail areall that is required. Organised as a simple ring buffer, no blocking isrequired unless the head meets the tail. With judiious use of CAS, theycan even be made lock-free.

Only the code that declares the shared
data, plus any other thread it choses to send a handle to, has any
knowledge of, and therefore access to the shared state.


If we can overcome the limitations we have in Perl 5 shared values, I'm
entirely in agreement with the above statement (assuming closed-over
values become shared transparently)

We can! :) And I see no reason that lifting closed-overs to global scopepad shouldn't be done? And if the global-scope pad is done using thatwait-free hash table implementation I linked to earlier, then anothersource of locking and syncing bites the dust.

Effectively, allocating a shared entity returns a handle to theunderlying
state, and only the holder of that handle can access it. Such handles
would be indirect references and only usable from the thread thatcreates
them. When a handle is passed as a message to another thread, it is
transformed into a handle usable by the recipient thread during the
transfer and the old handle becomes invalid. Attempt to use an oldhandle
after it has been sent result in a runtime exception.
This is exactly what I meant by RemoteValue, RemoteInvocation and
InvocationQueue in my original idea.

I apologise for misconstruing the meaning of "Remote" in those methodnames.

daniel

Re: Parallelism and Concurrency was Re: Ideas for a (nntp: message (nntp: message 18 of 20) 14 of 20) "Object-Belongs-to-Thread" threading model

Reply via email to