On 05/07/2023 18:13, Rafał Pietrak via Gcc wrote:
Hi,
W dniu 5.07.2023 o 16:45, David Brown pisze:
On 05/07/2023 15:29, Rafał Pietrak wrote:
[---------------]
OK. I don't see a problem here, but I admit that mixing semantics
often lead to problems.
I think it also allows better generalisation and flexibility if they
are separate. You might want careful control over where something is
allocated, but the access would be using normal instructions.
Conversely, you might not be bothered about where the data is
allocated, but want control of access (maybe you want interrupts
disabled around accesses to make it atomic).
that would require compiler to know the "semantics" of such section. I
don't think you've listed it below, worth adding. If I understand you
correctly, that means the code generated varies depending on target
section selected. This is linker "talking" to compiler if I'm not mistaken.
No, it's about the access - not the allocation (or section). Access
boils down to a "read" function and a "write" function (or possibly
several, optimised for different sizes - C11 _Generic can make this
neater, though C++ handles it better).
[----------]
Let me try to list some things I think might be useful (there may be
some overlap). I am not giving any particular order here.
1. Adding a prefix to section names rather than replacing them.
OK. +1
2. Adding a suffix to section names.
+1
3. Constructing section names at compile time, rather that just using
a string literal. (String literals can be constructed using the
pre-processor, but that has its limitations.)
I'm not sure what this means? At compile time, you only have literals,
so what's missing?
The compiler knows a lot more than just literal values at compile time -
lots of things are "compile-time constants" without being literals that
can be used in string literals. That includes the value of static
"const" variables, and the results of calculations or "pure" function
calls using compile-time constant data. You can do a great deal more of
this in C++ than in C ("static const int N = 10; int arr[N];" is valid
in C++, but not in C). Calculated section names might be useful for
sections that later need to be sorted.
To be fair, you can construct string literals by the preprocessor that
would cover many cases.
I can also add that generating linker symbols from compile-time
constructed names could be useful, to use (abuse?) the linker to find
issues across different source files. Imagine you have a
microcontroller with multiple timers, and several sources that all need
to use timers. A module that uses timer 1 could define a
"using_timer_1" symbol for link time (but with no allocation to real
memory). Another module might use timer 2 and define "using_timer_2".
If a third module uses timer 1 again, then you'd get a link-time error
with two conflicting definitions of "use_timer_1" and you'd know you
have to change one of the modules.
4. Pragmas to apply section names (or prefixes or suffixes) to a block
of definitions, changing the defaults.
+1
5. Control of section flags (such as read-only, executable, etc.). At
the moment, flags are added automatically depending on what you put
into the section (code, data, read-only data). So if you want to
override these, such as to make a data section in ram that is
executable (for your JIT compiler :-) ), you need something like :
__attribute__((section("jit_buffer,\"ax\"\n@")))
I assume, that adding an attribute should split a particular section
into "an old one" and "the new one with new attribute", right?
You can't have the same section name and multiple flags. But you
sometimes want to have unusual flag combinations, such as executable ram
sections for "run from ram" functions.
One would need to have linker logic (and linker script definitions)
altered, to follow that (other features so far wouldn't require any
changes to linkers, I think).
to add the flags manually, then a newline, then a line comment
character (@ for ARM, but this varies according to target.)
6. Convenient support for non-initialised non-zeroed data sections in
a standardised way, without having to specify sections manually in the
source and linker setup.
What gain and under which circumstances you get with this? I mean, why
enforce keeping uninitialized memory fragment, while that is just a one
shot action at load time?
Very often you have buffers in your programs, which you want to have
statically allocated in ram (so they have a fixed address, perhaps
specially aligned, and so you have a full overview of your memory usage
in your map files), but you don't care about the contents at startup.
Clearing these to 0 is just a waste of processor time.
7. Convenient support for sections (or variables) placed at specific
addresses, in a standardised way.
Hmm... Frankly, I'm quite comfortable with current features of linker
script, and I do it like this:
SECTIONS
{
sfr_devices 0x40000000 (NOLOAD): {
. = ALIGN(1K); PROVIDE(TIM2 = .);
. = 0x00400; PROVIDE(TIM3 = .);
. = 0x00800; PROVIDE(TIM4 = .);
}
}
The only problem is that so far I'm not aware of command line options to
"supplement" default linker script with such fragment. Option "-T"
replaces it, which is a nuisance.
These are ugly and hard to maintain in practice - the most common way to
give fixed addresses is to use macros that cast the fixed address to
pointers to volatile objects and structs.
But sometimes it is nice to have sections at specific addresses, and it
would be a significant gain for most people if these could be defined
entirely in C (or C++), without editing linker files. Many embedded
toolchains support such features - "int reg @ 0x1234;", or similar
syntax. gcc has an "address" attribute for the AVR, but not as a common
attribute. (It is always annoying when one target has an attribute that
would be useful on other ports, but only exists on the one target.)
8. Convenient support for sections that are not allocated space by the
linker in the target memory, but where the contents are still included
in the elf file and map files, where they can be read by other tools.
(This could be used for external analysis tools.)
Isn't it so, that current debugger sections are just that?
They are, yes. But it would be useful to have non-debug user sections
that act in a similar manner - it would not confuse debuggers, and be
easier for user-written parsers to pull out of the elf or map files.
Extrapolating your words: Do you think of sections that you would have
full control on it's content at compilation, and it isn't sufficient to
do it like this:
char private[] __attribute__((section("something"))) = {
0xFF, 0x01, 0x02, ....
};
You also need control of the allocation (or lack thereof). This can be
done using sections with flags and/or linker file setup, but again it
would be good to have a standardised GCC extension for it. It is far
easier for people to use a GCC attribute than to learn about the messy
details of section flags and linker files.
9. Support for getting data from the linker to the code, such as
section sizes and start addresses, without having to manually add the
symbols to the linker file and declare extern symbols in the C or C++
code.
+1
10. Support for structs (or C++ classes) where different parts of the
struct are in different sections. This would mean the struct could
only be statically allocated (no stack allocation - just global or
static), and it would have no accessible address or size (you could
have pointers to fields, but not to the struct objects themselves).
This would let you tie together objects made of multiple parts such as
constant data in flash and writeable data in ram.
+1
11. Convenient support for building up tables where the contents are
scattered across different source files, without having to manually
edit the linker files.
do you have an example where that is useful?
You might like to have a code organisation where source files could
define structures for, say, threads. Each of these would need an entry
in a thread table holding priorities, run function pointer, etc. If
this table were built up as a single section where each thread
declaration contributed their part of it, then the global thread table
would be built at link time rather than traditional run time setup. The
advantages include a clear static measure of the number of the number of
threads (see point 9), clear memory usage, and smaller initialisation
code. (Obviously we are talking about statically defined threads here,
not dynamically defined threads.)
12. I'd supplement the list with a better control on code section names
towards something like code namespaces.
13. and data sections "growing downwords" like stack does. This is for
more flexible planning of memory layouts of micros (limited RAM). One
class of sections you put into RAM sequencialy bottom to top, the other
category from top to bottom.
Fair enough.