Re: [Caml-list] Re: Google summer of Code proposal

Kuba Ober Mon, 23 Mar 2009 10:23:43 -0700

On Mar 21, 2009, at 5:28 PM, Michael Ekstrand wrote:

Joel Reymont <joe...@gmail.com> writes:

On Mar 21, 2009, at 1:38 PM, Jon Harrop wrote:

. You will succumb to ocamlopt's current run-time representation
which is
objectively inefficient (e.g. boxing floats, tuples, records) and
was only
chosen because the compiler lacks capabilities that LLVM already
provides for
you (primarily JIT compilation).


This is probably a stupid suggestion but why not have OCaml directly
generate machine code, without the use of assembler and linker?


This won't help with anything -- why would it? How is this suggestion
relevant to current discussion?

Because that would duplicate the code and logic provided by thesystem's
assembler and linker (esp. linker).  For every platform (and there are
many possible combinations!).

The only problem is that the usual notion of a "linker" is somewhatbroken, even ifwhat we're after is an embedded platform where all of the linking isdone

before the code hits the target (no run-time linking!).

I will show a trivial example where it fails bad. The example is in C.

Suppose you have two platform-specific registers used to set the DMAaddress.

The platform has 12 bit addresses.

#define DMAL (*((volatile unsigned char*)0xFFA))
#define DMAH (*((volatile unsigned char*)0xFF0))

The DMAL takes a whole least significant byte of the address. The DMAHtakestakes one most significant nibble (bits 11:8) of the address, and thenibble must be

left-aligned (occupy bits 7:4 of DMAH).

Now, in your code, you want to point DMA to a static buffer. Thusly

void foo(void)
{
  static char buffer[128];
  DMAL = (unsigned char)&buffer;
  DMAH = (((unsigned int)&buffer) >> 4) & 0xF0;
...
}

Now, while all of the addresses are known constants, there's usuallyno way,in the object file, to tell the linker the expression for the value ofDMAH!


Thus, instead of what amounts to two "load immediate" instructions,

you have one immediate load, followed by a lot of brouhaha to shiftand mask whatamounts to constants known at compile/link time. That's what's usuallycalled

premature pessimization.

That's one issue with contemporary compile/assemble/link systems.Never mindthat even if the assemblers would support such "elaborate" expressionsusing

link-time constants, the compilers don't generate them anyway!

So, writing the code in assembly won't help you! It's only at linktime that youknow where the buffer[] will end up... You can of course hack and putthebuffer at a fixed address -- some C implementations will even havespecialways of doing that (say via gcc's __attribute__ mechanism). That willbackfire assoon as you get to interface more pieces of code: you'll be spendingyour timemoving stuff around just to keep the memory regions from overlapping-- that's the

linker's job, really.

Heck -- many, many assemblers will silently generate utterly wrongcode for the loadinto DMAH, *if* you code this in assembly, not C!! I've got at least adozen production,shipping assemblers, that silently trip-and-fall on the code like theone above. Of coursethey only fail if you code it in assembly, as the C compiler won'teven attempt suchum, "trickery". Silly stuff, really, requiring no advancedoptimization theories, just doing

one's darn job well...

You have a choice: either put some ASTs into the object file, whenevertheexpressions involving link-time constants are involved, or you get ridof the whole

compile-assemble-link separation and get everything into one place.

The latter, incidentally, is what I ended up doing in my godawful LISP-on-its-way-to-ML

platform for Z8 Encore! and SX48.

This would be, "of course", taken care of by a JIT: it would figureout that a whole lotof nothing is done on constant memory addresses, and would replace allthe operationsby a final load. But, on a platform where the code is staticallylinked on the host, there'sno need for any of that, nor for a JIT. This applies to a whole lot ofhard-realtime systemswhere a lot of reasoning can be made trivial by only usingpreallocated memory, and not

doing any runtime memory allocation (or at least limiting it well).

If you use the existing linker, then you can depend on the expertiseof

the authors for each system getting all the logic right for loading
libraries (which may be arbitrary libraries, when you're using C
extensions) and producing a binary in the correct format for that
system.

The "logic" present in many linkers is either pretty trivial, or is anugly hackfor lack of expressiveness in object file records. Then you have linktime optimizations,which are really trivial to do in a whole-project compiler, butrequire a lot of

extra effort in a linker, etc.

Heck, many linkers use ad-hoc horrible quadratic-or-worse timealgorithms that backfireseverely once the size of the project gets sufficiently big. Justfollow the evolution of

gnu ld in face of C++. A farce in multiple acts, at least.

Cheers, Kuba

_______________________________________________
Caml-list mailing list. Subscription management:
http://yquem.inria.fr/cgi-bin/mailman/listinfo/caml-list
Archives: http://caml.inria.fr
Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
Bug reports: http://caml.inria.fr/bin/caml-bugs

Re: [Caml-list] Re: Google summer of Code proposal

Reply via email to