Re: wishlist: support for shorter pointers

David Brown via Gcc Thu, 06 Jul 2023 05:54:43 -0700

On 06/07/2023 09:00, Rafał Pietrak via Gcc wrote:

Hi,
W dniu 5.07.2023 o 19:39, David Brown pisze:
[------------------]
I'm not sure what this means? At compile time, you only haveliterals, so what's missing?
The compiler knows a lot more than just literal values at compile time- lots of things are "compile-time constants" without being literalsthat can be used in string literals. That includes the value ofstatic "const" variables, and the results of calculations or "pure"function
const --> created by a literal.

Technically in C, the only "literals" are "string literals". Somethinglike 1234 is an integer constant, not a literal. But I don't want toget too deep into such standardese - especially not for C++ !

Even in C, there are lots of things that are known at compile timewithout being literals (or explicit constants). In many situations youcan use "constant expressions", which includes basic arithmetic onconstants, enumeration constants, etc. The restrictions on what can beused in different circumstances is not always obvious (if you have"static const N = 10;", then "static const M = N + 1;" is valid but "intxs[N];" is not).

C++ has a very much wider concept of constant expressions at compiletime - many more ways to make constant expressions, and many more waysto use them. But even there, the compiler will know things at compiletime that are not syntactically constant in the language. (If you havecode in a function "if (x < 0) return; bool b = (x >= 0);" then thecompiler can optimise in the knowledge that "b" is a compile-timeconstant of "true".)

calls using compile-time constant data. You can do a great deal more of
"compile time constant data" -> literal
this in C++ than in C ("static const int N = 10; int arr[N];" is validin C++, but not in C). Calculated section names might be useful forsections that later need to be sorted.
To be fair, you can construct string literals by the preprocessor thatwould cover many cases.
OK. We are talking of convenience syntax that allows for using any"name" in c-sources as "const-literal" if only its rooted in literalsonly. That's useful.
+2. :)
I can also add that generating linker symbols from compile-timeconstructed names could be useful, to use (abuse?) the linker to findissues across different source files. Imagine you have a
+1
microcontroller with multiple timers, and several sources that allneed to use timers. A module that uses timer 1 could define a
[----------------------]
     __attribute__((section("jit_buffer,\"ax\"\n@")))
I assume, that adding an attribute should split a particular sectioninto "an old one" and "the new one with new attribute", right?
You can't have the same section name and multiple flags. But yousometimes want to have unusual flag combinations, such as executableram sections for "run from ram" functions.
section flags reflect "semantic" of the section (ro v.s. rw is differentsemantics at that level). So, how do you "merge" RAM (a section called".data"), one with "!x" flag, and the other with "x" flag?
conflicting flags of sections with the same name have to be taken intoconsideration.

It doesn't make sense to merge linker input sections with conflictingflags - this is (and should be) an error at link time. So I am notasking for a way to make a piece of ".data" section with different flagsfrom the standard ".data" section - I am asking about nicer ways to makedifferent sections with different selections of flags. (Input sectionswith different flags can be merged into one output section, as thesemantic information is lost there.)

One would need to have linker logic (and linker script definitions)altered, to follow that (other features so far wouldn't require anychanges to linkers, I think).
to add the flags manually, then a newline, then a line commentcharacter (@ for ARM, but this varies according to target.)
6. Convenient support for non-initialised non-zeroed data sectionsin a standardised way, without having to specify sections manuallyin the source and linker setup.
What gain and under which circumstances you get with this? I mean,why enforce keeping uninitialized memory fragment, while that is justa one shot action at load time?
Very often you have buffers in your programs, which you want to havestatically allocated in ram (so they have a fixed address, perhapsspecially aligned, and so you have a full overview of your memoryusage in your map files), but you don't care about the contents atstartup. Clearing these to 0 is just a waste of processor time.
At startup? Really? Personally I wouldn't care if I waste those cycles.

Usually it is not an issue, but it can be for some systems. I've seensystems where a hardware watchdog has timed out while the startup codeis clearing large buffers unnecessarily. There are also some low-powersystems that are halted until some external event triggers their reset -you want to get to the code that checks the reset source (reset pin orpower on) as fast as possible, and you want much of your data to remainpreserved over soft resets.

And maybe your buffers are allocated in external dynamic ram which isnot accessible until you have configured the ram controller - andthereafter it is accessible as normal ram. For one project I have atthe moment, the chip's on-chip ram blocks can be allocated individuallyto data tightly coupled memory, instruction tightly coupled memory, orgeneral-purpose ram - all at different addresses in the memory map. Youdo not want anything cleared until the blocks have been re-mapped fromtheir default settings to their final settings.

And having that explicitly "vocalized" in sources, I think it'll justmake them harder to read by a maintainer.

It is even harder to read if it is not explicit in the C sources, butonly in the linker files!

Otherwise, from my personal experience, it may or may not be desirable.
7. Convenient support for sections (or variables) placed at specificaddresses, in a standardised way.
Hmm... Frankly, I'm quite comfortable with current features of linkerscript, and I do it like this:
SECTIONS
{
     sfr_devices 0x40000000 (NOLOAD): {
         . = ALIGN(1K);    PROVIDE(TIM2 =    .);
         . = 0x00400;    PROVIDE(TIM3 =    .);
         . = 0x00800;    PROVIDE(TIM4 =    .);
     }
}
The only problem is that so far I'm not aware of command line optionsto "supplement" default linker script with such fragment. Option "-T"replaces it, which is a nuisance.
These are ugly and hard to maintain in practice - the most common wayto give fixed addresses is to use macros that cast the fixed addressto pointers to volatile objects and structs.
Yes, I know that macros are traditionally used here, but personally Ithink using them is just hideous. I'm using the above sectiondefinitions for years and they keep my c-sources nice and clean. And (inparticular with stm32) if I change the target device, I just change thelinker script and don't usually have to change the sources. That'sreally nice. It's like efortless porting.
Having said that. I'm opened to suggestion how to get this better - likehaving a compiler "talk to linker" about those locations.

There are always more than one way to do these things. But I believemost programmers prefer to stick to the C (and/or C++) source files, andavoid anything involving linker files or assembly files. We are lookingfor ideas that could suit a wide range of people, not just you or Ipersonally :-)

But sometimes it is nice to have sections at specific addresses, andit would be a significant gain for most people if these could bedefined entirely in C (or C++), without editing linker files. Manyembedded toolchains support such features - "int reg @ 0x1234;", orsimilar syntax. gcc has an "address" attribute for the AVR, but notas a common attribute. (It is always annoying when one target has anattribute that would be useful on other ports, but only exists on theone target.)
Yes, I know that. Then again (personally) I do prefer to be able to tellthe compiler "-mcpu=atmega128" ... and so have it select appropriatelinker script, while NOT changing my sources, then do it the other wayaround.
[----------------]
Extrapolating your words: Do you think of sections that you wouldhave full control on it's content at compilation, and it isn'tsufficient to do it like this:
char private[] __attribute__((section("something"))) = {
  0xFF, 0x01, 0x02, ....
};
You also need control of the allocation (or lack thereof). This canbe done using sections with flags and/or linker file setup, but againit would be good to have a standardised GCC extension for it. It isfar easier for people to use a GCC attribute than to learn about themessy details of section flags and linker files.
OK. But IMHO, should you move the functionality from linker to GCC, thenall the "mess" just get transferred upstairs. And to know the linker isa must if you do a bare-metal programming anyway.


I like having my messes in one place, rather than scattered around :-)

Still, standardization is good, good, good. But how to you standardizesomething "private" by definition?

You have to pick the right level of standardisation. I don't believeany of this should be at the level of the C standards, for example. ButI think it should be possible to get a generalisation within GCC, sothat it is "standard" across all targets rather than havingtarget-specific attributes or extensions like named address spaces.It's fine for GCC to say that this feature is only guaranteed to workfor binutils gas and ld, or compatible assemblers and linkers, with elfoutputs. That gives you a "standard" for most use-cases.

[------------]
11. Convenient support for building up tables where the contents arescattered across different source files, without having to manuallyedit the linker files.
do you have an example where that is useful?
You might like to have a code organisation where source files coulddefine structures for, say, threads. Each of these would need anentry in a thread table holding priorities, run function pointer,etc. If this table were built up as a single section where eachthread declaration contributed their part of it, then the globalthread table would be built at link time rather than traditional runtime setup. The advantages include a clear static measure of thenumber of the number of threads (see point 9), clear memory usage, andsmaller initialisation code. (Obviously we are talking aboutstatically defined threads here, not dynamically defined threads.)
I still don' get it. (pt.9 - sizes/locations of sections available tocompiler? relevant to this?)
Then again. I wouldn't aspire to understand everything. If that'suseful, let it be.
But I'd object to call this constructs "a table". A programmer shouldhave control of how compiler interprets his/her words. "table" has avery well defined semantics and to have it the way you propose ... it'dbe better to have a different name/syntax for those other objects.

I don't think "table" /does/ have well defined semantics. But I dothink this would be a table!

When you use C++, you already get a table like this for globalconstructors and other initialisation code. Sometimes theinitialisation for a variable - especially class objects where there isa non-trivial constructor - requires some code to be run. Whencompiling a C++ file, every time the compiler needs to run someinitialisation code, it generates a little function, and then makes a".ctors.xxx" section containing a pointer to that function. In thelinker, there is a section like this:


            . = ALIGN(4);
            KEEP (*crtbegin.o(.ctors))
            KEEP (*(EXCLUDE_FILE (*crtend.o) .ctors))
            KEEP (*(SORT(.ctors.*)))
            KEEP (*crtend.o(.ctors))

The ".ctors" section in crtbegin.o defines a "start of constructorstable" symbol, and the matching section in ctrend.o has the end symbol.Linking collects all these constructor pointers into a table, and theC++ start up code can run through the table calling all the functions inorder.

I want to be able to do something similar, with a convenient syntax, butwith my own choice of tables and contents.

Re: wishlist: support for shorter pointers

Reply via email to