Re: [fpc-devel] Parallel processing in the compiler

Hans-Peter Diettrich Sun, 05 Sep 2010 19:47:08 -0700

Florian Klämpfl schrieb:

Right, that's how it *should* be designed. But try to find out why the
code generation is added, when variables like current_filepos or
current_tokenpos are moved into TModule (current_module) :-(


Why should current_filepos and current_tokenpos go into TModule? They
can be perfectly threadvars.

Good point. So would it be sufficient to retype all such variables asthreadvar, in order to make parallel processing (threads) possible?

I have no real idea, how in such a model the initialization of thethreadvars has to be implemented. That's why I try to assign allstate-related variables to a definite object, whose reference can beeasily copied (or moved?) into any created thread.

Then we also have to draw a border, between possible parallel and stillrequired sequential processing. Sequential processing must be used forall output, be binary or log files. Also encountered errors must bereported from the originating thread to all related (main and other)threads, when they require to stop and shut down compilation in anorderly manner.

Further, I don't see them fitting into
TModule, they describe not the state of a module but are part of the
compilation state. Even more, consider several threads compiling
different procedure of one module: putting current_filepos and
current_tokenpos into TModule won't work in this case.

Right, but I see no chance for such parallelism, before all relatedvariables have been found. See my questions about just these variables,and the according tfileposinfo values in several objects.

Parallel code generation requires that the cg is separated from parsing,so that the next procedure can be parsed while the previously parsedprocedures are compiled.

The last change was to remove ppudump from the Makefile, and this proved
that so far only ppudump is sensitive to changes in the compiler

internals.

Guess why I'am sceptical that it's usefull to use the compiler parser
for other purposes like code tools or documentation: probably once per
week a simple compiler change breaks this external usage (we made this
experience ten years ago ;) ).

I've postponed that initial motivation, to the end of all otherrefactoring. Apart from parallelism I see more chances for theintroduction of really new features in other places, like multiplefront-ends. Such projects require a separation of the mere parser fromthe rest of the infrastructure, i.e. the handling of all symbols,creation of nodes, etc. have to be moved into new and commonly usableinterfaces. After that step it also would be easy, and could breaknothing in the compiler, when a no-cpu target is added to the targetspecific back-ends.

How we can continue? I'll see if I find within the next week time (I
were on holiday for one week) to review the noglobals changes and how we
can split them into usable parts.

IMO the most important decision is about the general direction of therefactoring. Do we want more OO (encapsulation), more codegenseparation, or what else. IMO encapsulation is the most useful firststep, towards any other goal. The current compiler "structure" isdictated by purely *formal* aspects (unit dependencies), and does notreflect the *logical* dependencies between objects, variables,procedures etc. This lack of logical structure, next lack of up-to-datedocumentation, is the most annoying problem with *every* compilerenhancement attempt.


DoDi

_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-devel

Re: [fpc-devel] Parallel processing in the compiler

Reply via email to