[fpc-devel] External assemblers (also modular discussion about free pascal compiler)

Skybuck Flying Wed, 06 Apr 2011 13:46:37 -0700

Hello,

Perhaps some last silly questions about "external assemblers".

First I'd like to say that I saw some tutorial where the compiler itselfproduced some form of assembly output in text... probably real assembly...

So that's kinda interesting to output text this makes it flexible for othertools to then compile that...

It reduces complexity of the compiler because it doesn't actually need tofully assembly the assembly... this can be left to other tools.

So far I understand free pascal has "internal assemblers" (probably mostlyfor i386/x86 target, does it have others as well ?)

But there is also this "external assembler" thingy.... perhaps it's beingused to implement internal assemblers as well...



But my question is the following:

What exactly is being "fed" towards the "external assembler" via theclasses/api ?



(I probably should take a closer look... )

But from what I can remember I saw some kind of node structure... (maybe fplist/node and stuff like that ?)

To me it seemed like some kind of "free pascal classes/data structures"which might not be commonly supported by other languages likeC/C++/Delphi/Etc...



So just in case I am wrong I am asking the following question:

Is there perhaps some kind of "standardized intermediate form (datastructure like)" which all compilers could choose to output ? (Which I amnot aware off ?)

(Or is the external assembler thingy indeed something conceived by thepascal developers themselfes ;))

Also one last question: Is there actually any other external tool which isactually invoked/call via this mechanism ?

(If the mechanism was ment to function via text then I can imagine thatofcourse... but then the question would be: what kind of text would be fed ?(some early form of assembly which needs to be fixed up ? )

Sending assembly towards an assembler seems kind weird... because first thecompiler seems to use these nodes/data structures which are then apperentlysend towards some kind of cpu which seems to use assembly like datastructures as well (but I could be wrong) which would then be turned backinto text ????

(Why then not compile directly to binary ??? Perhaps because the binariesinvolve operating system specific structures and the CPU can actually beused for different operating systems ? (So I think that's probably theanswer here...) The compiler compiles towards a certain CPU and makes surethe assembly is suited for the CPU... but it doesn't necessarily want toknow about the operating system and thus sends it off to an operating systemspecific assembler ?)

(But then again this conflicts a bit with the RTL which does seem a bitoperating system aware... but maybe that is not related to this and the RTLis ment for input towards the compiler and perhaps output as welll but in amore file like manner...)

A guess what I described above is the roll of the linker.... The linkerlinks the assemblies and adds some operating system specific binaryheaders/structures for towards/for the final binary.



Does free pascal actually have an internal linker as well ?

Perhaps all these different aspects of a compiler are confusing things alittle bit for me/noobies.... but perhaps also for the developers itself...

Therefore I wonder if free pascal could benefit from a more modular approachwhere everything is nicely split up into modules... which somehowcommunicate with each other...

This could be executables (though would get a bit difficult via files) or(dll's but that's windows specific) or (sockets... network communicationsseem odd for this...) and finally simply compiled units which would beplaced in seperated module folders or so. (with the necessary headers...could also simply include full source as well).

Currently I wonder how "modular" free pascal is... since I am not acquintedwith the code (yet)... from the looks of it it seems to use classes here andthere... so at least it's "class-modular".... but classes can also starts to"mingle" with each other a bit... and might not form a "clean cut". Probablytempting for developers to mingle stuff up a bit... None-the-less that'sprobably easily avoidable or fixeable...

The problem remains that all source code is in pretty much one huge foldercalled: "compiler" with some "sub folders" for platforms.

At least the platforms are seperated from it... but this seems also a bitout of necessity for the search paths to not produce naming-space-conflictsand such.


The rest of the compiler code seems to be in one huge folder...

Some of the unit names are pretty short and not very descriptive...fortunately most of the unit files have a little comment in the headerexplaining what each file is.

Perhaps it would be interesting to try and split up the compiler folder intosubfolders where each subfolder would represent a "module".

For example tokenizer/lexer, parser, assemblers and what else you can thinkof which would/should be modular.

The platforms/cpu's can then be moved to their own modular sub folder whichcould be called:

"platforms" or perhaps "cpu's" or perhaps "os" or perhaps "targets" whichyou think is best.

With such a more modular approach via subfolders it could then be easier tospot where unwanted depedency might exist within the module itself...

So suppose that a certain module is to be "self standing" and "selfcompileable" and this would immediatly show up if this was not to workbecause of

search paths pointing towards unwanted folders.

At first it might seem the compiler is to much linked towards all modulesbecause ofcourse it goes from phase to phase... but that's where hookinginto modules could come into play.

It could work like this: each module gets it's own input interface (inputfrom the higher phase/modules) and it's own output interface (towards thelower modules)

Perhaps this could also be in both ways... However each module simply copiesthese interfaces so that they can be self standing...

Only the final compiler main program/executable needs to link the modules uptowards each other... perhaps test programs for the modules too. (Thesetest programs should then be in seperate folders as well could be higher/subfolders to the module itself)

One final module which seems to be in play in general theory is the "symboltable" this could also have it's own interface which would be used by allmodules as well.


So all modules would need to be hook-up to this symbol table as well.

(Think of "hooking-up" as delphi's event properties/method pointers/functionpointers stuff like that...I am pretty sure you know that already ! ;) =D)




Bye,

Skybuck.

_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-devel

[fpc-devel] External assemblers (also modular discussion about free pascal compiler)

Reply via email to