Re: [PATCH 3/3] kallsyms: delta-compress lineinfo tables for ~2.7x size reduction

Sasha Levin Wed, 11 Mar 2026 07:53:07 -0700

Thanks for the review!

On Wed, Mar 11, 2026 at 11:34:24AM +0800, Vivian Wang wrote:

Hi Sasha,


I've been trying this out and AFAICT this does work perfectly. Thank you
for this.

There are a few oddities I found:

Firstly I've been building with something like O=_riscv out of
convenience, and the file names have an extra ../ in the front. (This is
just me exiting out of init=/bin/sh.)

[    2.317268] Kernel panic - not syncing: Attempted to kill init! 
exitcode=0x00000000
[    2.320283] CPU: 0 UID: 0 PID: 1 Comm: sh Not tainted 
7.0.0-rc3-00004-g8ad18f1a1a2f #31 PREEMPTLAZY
[    2.322048] Hardware name: riscv-virtio,qemu (DT)
[    2.323220] Call Trace:
[    2.324465] [<ffffffff800172a8>] dump_backtrace+0x1c/0x24 
(../arch/riscv/kernel/stacktrace.c:150)
[    2.329061] [<ffffffff8000241e>] show_stack+0x2a/0x34 
(../arch/riscv/kernel/stacktrace.c:156)
[    2.330334] [<ffffffff8000fe32>] dump_stack_lvl+0x4a/0x68 
(../lib/dump_stack.c:122)
[    2.331462] [<ffffffff8000fe64>] dump_stack+0x14/0x1c 
(../lib/dump_stack.c:130)
[    2.332571] [<ffffffff80002a88>] vpanic+0x108/0x2bc (../kernel/panic.c:651)
[    2.333674] [<ffffffff80002c6e>] panic+0x32/0x34 (../kernel/panic.c:787)
[    2.334427] [<ffffffff8002e97a>] do_exit+0x7ee/0x7f4 (../kernel/exit.c:930)
[    2.335194] [<ffffffff8002eade>] do_group_exit+0x1a/0x88 
(../kernel/exit.c:1099)
[    2.335945] [<ffffffff8002eb62>] __riscv_sys_exit_group+0x16/0x18 
(../kernel/exit.c:1129)
[    2.336763] [<ffffffff80b3e868>] do_trap_ecall_u+0x260/0x45c 
(../arch/riscv/include/asm/syscall.h:112)
[    2.337765] [<ffffffff80b4c034>] handle_exception+0x168/0x174 
(../arch/riscv/kernel/entry.S:233)

This is fine by me, but I've seen mentions of O= builds but I'm not sure
if it's expected.


Could you try v2 and see if it makes it prettier? I tried to tackle this :)

Also, toggling CONFIG_KALLSYMS_LINEINFO seems to rebuild every single
file. I haven't debugged why, but is this expected?


I think that this is because we increase KSYM_SYMBOL_LEN when lineinfo is
enabled. I suppose we can just increase the size irregardless of whether
lineinfo is enabled and ignore the waste?

Or, folks really won't be toggling this option too often for the rebuilds to
matter too much, so we can just enjoy the savings?

I have a few ideas about the code as well. Since this patch 3 touches
most of the files involved, I'll just dump my thoughts on the whole
series here. I want to note that I haven't read the RFC thread too
carefully, but I don't think there were many comments on the implementation.

On 3/4/26 02:21, Sasha Levin wrote:

Replace the flat uncompressed parallel arrays (lineinfo_addrs[],
lineinfo_file_ids[], lineinfo_lines[]) with a block-indexed,
delta-encoded, ULEB128 varint compressed format.

The sorted address array has small deltas between consecutive entries
(typically 1-50 bytes), file IDs have high locality (delta often 0,
same file), and line numbers change slowly.  Delta-encoding followed
by ULEB128 varint compression shrinks most values from 4 bytes to 1.

Entries are grouped into blocks of 64.  A small uncompressed block
index (first addr + byte offset per block) enables O(log(N/64)) binary
search, followed by sequential decode of at most 64 varints within the
matching block.  All decode state lives on the stack -- zero
allocations, still safe for NMI/panic context.

Measured on a defconfig+debug x86_64 build (3,017,154 entries, 4,822
source files, 47,144 blocks):

  Before (flat arrays):
    lineinfo_addrs[]    12,068,616 bytes (u32 x 3.0M)
    lineinfo_file_ids[]  6,034,308 bytes (u16 x 3.0M)
    lineinfo_lines[]    12,068,616 bytes (u32 x 3.0M)
    Total:              30,171,540 bytes (28.8 MiB, 10.0 bytes/entry)

  After (block-indexed delta + ULEB128):
    lineinfo_block_addrs[]    188,576 bytes (184 KiB)
    lineinfo_block_offsets[]  188,576 bytes (184 KiB)
    lineinfo_data[]        10,926,128 bytes (10.4 MiB)
    Total:                 11,303,280 bytes (10.8 MiB, 3.7 bytes/entry)

  Savings: 18.0 MiB (2.7x reduction)

Booted in QEMU and verified with SysRq-l that annotations still work:

  default_idle+0x9/0x10 (arch/x86/kernel/process.c:767)
  default_idle_call+0x6c/0xb0 (kernel/sched/idle.c:122)
  do_idle+0x335/0x490 (kernel/sched/idle.c:191)
  cpu_startup_entry+0x4e/0x60 (kernel/sched/idle.c:429)
  rest_init+0x1aa/0x1b0 (init/main.c:760)

Suggested-by: Juergen Gross <[email protected]>
Assisted-by: Claude:claude-opus-4-6
Signed-off-by: Sasha Levin <[email protected]>
---
 .../admin-guide/kallsyms-lineinfo.rst         |   7 +-
 include/linux/mod_lineinfo.h                  | 103 ++++++++--
 init/Kconfig                                  |   8 +-
 kernel/kallsyms.c                             |  91 +++++++--
 kernel/kallsyms_internal.h                    |   7 +-
 kernel/module/kallsyms.c                      | 107 +++++++---
 scripts/gen_lineinfo.c                        | 192 ++++++++++++++----
 scripts/kallsyms.c                            |   7 +-
 scripts/link-vmlinux.sh                       |  16 +-
 9 files changed, 423 insertions(+), 115 deletions(-)

diff --git a/Documentation/admin-guide/kallsyms-lineinfo.rst 
b/Documentation/admin-guide/kallsyms-lineinfo.rst
index 21450569d5324..fe92c5dde16b3 100644
--- a/Documentation/admin-guide/kallsyms-lineinfo.rst
+++ b/Documentation/admin-guide/kallsyms-lineinfo.rst
@@ -76,10 +76,11 @@ Memory Overhead
 ===============

 The vmlinux lineinfo tables are stored in ``.rodata`` and typically add
-approximately 44 MiB to the kernel image for a standard configuration
-(~4.6 million DWARF line entries, ~10 bytes per entry after deduplication).
+approximately 10-15 MiB to the kernel image for a standard configuration
+(~4.6 million DWARF line entries, ~2-3 bytes per entry after delta
+compression).

-Per-module lineinfo adds approximately 10 bytes per DWARF line entry to each
+Per-module lineinfo adds approximately 2-3 bytes per DWARF line entry to each
 ``.ko`` file.


Maybe this could be given in terms of percentages? It wasn't obvious to
me what 10-15 MiB amounts to.

On riscv64, I'm seeing a 24.2 MiB to 30.2 MiB increase in
arch/riscv/boot/Image size on an approximately defconfig+mod2noconfig
build, which is about a 25% increase. I haven't checked yet, but if 25%
is similar to what other archs get, that's a more useful figure than
10-15 MiB, given that the size increase is correlated to the total
amount of code linked into the kernel/module.


I ended up giving an example instead of percentages because it seemed to vary
wildly between configs and archs. For example, defconfig on x86 yields a 15%
increase compared to the 25% you see with your config on riscv, compared to a
39% increase with defconfig on riscv.

 Known Limitations
diff --git a/include/linux/mod_lineinfo.h b/include/linux/mod_lineinfo.h
index d62e9608f0f82..ab758acfadceb 100644
--- a/include/linux/mod_lineinfo.h
+++ b/include/linux/mod_lineinfo.h
@@ -8,13 +8,19 @@
  *
  * Section layout (all values in target-native endianness):
  *
- *   struct mod_lineinfo_header     (16 bytes)
- *   u32 addrs[num_entries]         -- offsets from .text base, sorted
- *   u16 file_ids[num_entries]      -- parallel to addrs
- *   <2-byte pad if num_entries is odd>
- *   u32 lines[num_entries]         -- parallel to addrs
+ *   struct mod_lineinfo_header     (24 bytes)
+ *   u32 block_addrs[num_blocks]    -- first addr per block, for binary search
+ *   u32 block_offsets[num_blocks]  -- byte offset into compressed data stream
+ *   u8  data[data_size]            -- ULEB128 delta-compressed entries
  *   u32 file_offsets[num_files]    -- byte offset into filenames[]
  *   char filenames[filenames_size] -- concatenated NUL-terminated strings
+ *
+ * Compressed stream format (per block of LINEINFO_BLOCK_ENTRIES entries):
+ *   Entry 0: file_id (ULEB128), line (ULEB128)
+ *            addr is in block_addrs[]
+ *   Entry 1..N: addr_delta (ULEB128),
+ *               file_id_delta (zigzag-encoded ULEB128),
+ *               line_delta (zigzag-encoded ULEB128)
  */
 #ifndef _LINUX_MOD_LINEINFO_H
 #define _LINUX_MOD_LINEINFO_H
@@ -25,44 +31,107 @@
 #include <stdint.h>
 typedef uint32_t u32;
 typedef uint16_t u16;
+typedef uint8_t  u8;
 #endif

+#define LINEINFO_BLOCK_ENTRIES 64
+
 struct mod_lineinfo_header {
        u32 num_entries;
        u32 num_files;
        u32 filenames_size;     /* total bytes of concatenated filenames */
+       u32 num_blocks;
+       u32 data_size;          /* total bytes of compressed data stream */
        u32 reserved;           /* padding, must be 0 */
 };

 /* Offset helpers: compute byte offset from start of section to each array */

-static inline u32 mod_lineinfo_addrs_off(void)
+static inline u32 mod_lineinfo_block_addrs_off(void)
 {
        return sizeof(struct mod_lineinfo_header);
 }

-static inline u32 mod_lineinfo_file_ids_off(u32 num_entries)
+static inline u32 mod_lineinfo_block_offsets_off(u32 num_blocks)
 {
-       return mod_lineinfo_addrs_off() + num_entries * sizeof(u32);
+       return mod_lineinfo_block_addrs_off() + num_blocks * sizeof(u32);
 }

-static inline u32 mod_lineinfo_lines_off(u32 num_entries)
+static inline u32 mod_lineinfo_data_off(u32 num_blocks)
 {
-       /* u16 file_ids[] may need 2-byte padding to align lines[] to 4 bytes */
-       u32 off = mod_lineinfo_file_ids_off(num_entries) +
-                 num_entries * sizeof(u16);
-       return (off + 3) & ~3u;
+       return mod_lineinfo_block_offsets_off(num_blocks) +
+              num_blocks * sizeof(u32);
 }

-static inline u32 mod_lineinfo_file_offsets_off(u32 num_entries)
+static inline u32 mod_lineinfo_file_offsets_off(u32 num_blocks, u32 data_size)
 {
-       return mod_lineinfo_lines_off(num_entries) + num_entries * sizeof(u32);
+       return mod_lineinfo_data_off(num_blocks) + data_size;
 }

-static inline u32 mod_lineinfo_filenames_off(u32 num_entries, u32 num_files)
+static inline u32 mod_lineinfo_filenames_off(u32 num_blocks, u32 data_size,
+                                            u32 num_files)
 {
-       return mod_lineinfo_file_offsets_off(num_entries) +
+       return mod_lineinfo_file_offsets_off(num_blocks, data_size) +
               num_files * sizeof(u32);
 }


I wonder if these headers could use a slightly simpler representation,
with each part represented with its offset from header start and total
size in bytes, a bit like flattened devicetrees. So like, blocks_offset,
blocks_size, files_offset, files_size...

This would make the assembly generation below more readable, and IMO
make understanding offset and array bound calculations way simpler, at
the cost of a few extra words in the header.


Makes sense

(Re: array bounds, I know there are easier ways to break the kernel
intentionally if you're writing kernel code. but these things that run
in the "ouch something bad happened" cases really should be a bit more
defensive against possibly bad data, esp. in dealing with loadable
modules. I haven't looked closely to the in-kernel lookup code, but I
don't see much sanity checks against lineinfo data? I *think* for
badly-sorted binary search just spits out a nonsensical offset, but I
really don't want to find out what happens to the whole
binary-then-linear search code with negative sizes or out-of-bounds
offsets or something like that.)

+/* Zigzag encoding: map signed to unsigned so small magnitudes are small */
+static inline u32 zigzag_encode(int32_t v)
+{
+       return ((u32)v << 1) ^ (u32)(v >> 31);
+}
+
+static inline int32_t zigzag_decode(u32 v)
+{
+       return (int32_t)((v >> 1) ^ -(v & 1));
+}
+
+/*
+ * Read a ULEB128 varint from a byte stream.
+ * Returns the decoded value and advances *pos past the encoded bytes.
+ * If *pos would exceed 'end', returns 0 and sets *pos = end (safe for
+ * NMI/panic context -- no crash, just a missed annotation).


What does that last bit mean...?


This goes back to your previous point about correctness and checks in the
lineinfo code :)

It just means that this function never faults or allocates. On bad input it
returns 0, so the worst case is a missing annotation, not a crash.

+ */
+static inline u32 lineinfo_read_uleb128(const u8 *data, u32 *pos, u32 end)
+{
+       u32 result = 0;
+       unsigned int shift = 0;
+
+       while (*pos < end) {
+               u8 byte = data[*pos];
+               (*pos)++;
+               result |= (u32)(byte & 0x7f) << shift;
+               if (!(byte & 0x80))
+                       return result;
+               shift += 7;
+               if (shift >= 32) {
+                       /* Malformed -- skip remaining continuation bytes */
+                       while (*pos < end && (data[*pos] & 0x80))
+                               (*pos)++;
+                       if (*pos < end)
+                               (*pos)++;
+                       return result;
+               }
+       }
+       return result;
+}
+
+/* Write a ULEB128 varint -- build tool only */
+#ifndef __KERNEL__
+static inline unsigned int lineinfo_write_uleb128(u8 *buf, u32 value)
+{
+       unsigned int len = 0;
+
+       do {
+               u8 byte = value & 0x7f;
+
+               value >>= 7;
+               if (value)
+                       byte |= 0x80;
+               buf[len++] = byte;
+       } while (value);
+       return len;
+}
+#endif /* !__KERNEL__ */
+
 #endif /* _LINUX_MOD_LINEINFO_H */
diff --git a/init/Kconfig b/init/Kconfig
index bf53275bc405a..6e3795b3dbd62 100644
--- a/init/Kconfig
+++ b/init/Kconfig
@@ -2065,8 +2065,9 @@ config KALLSYMS_LINEINFO
            anon_vma_clone+0x2ed/0xcf0 (mm/rmap.c:412)

          This requires elfutils (libdw-dev/elfutils-devel) on the build host.
-         Adds approximately 44MB to a typical kernel image (10 bytes per
-         DWARF line-table entry, ~4.6M entries for a typical config).
+         Adds approximately 10-15MB to a typical kernel image (~2-3 bytes
+         per entry after delta compression, ~4.6M entries for a typical
+         config).

          If unsure, say N.

@@ -2079,7 +2080,8 @@ config KALLSYMS_LINEINFO_MODULES
          so stack traces from module code include (file.c:123) annotations.

          Requires elfutils (libdw-dev/elfutils-devel) on the build host.
-         Increases .ko sizes by approximately 10 bytes per DWARF line entry.
+         Increases .ko sizes by approximately 2-3 bytes per DWARF line
+         entry after delta compression.

          If unsure, say N.


(Same as above, maybe use percentages when talking about sizes?)

diff --git a/kernel/kallsyms.c b/kernel/kallsyms.c
index cea74992e5427..de4aa8fcfd69d 100644
--- a/kernel/kallsyms.c
+++ b/kernel/kallsyms.c
@@ -468,14 +468,20 @@ static int append_buildid(char *buffer,   const char 
*modname,
 #endif /* CONFIG_STACKTRACE_BUILD_ID */

 #ifdef CONFIG_KALLSYMS_LINEINFO
+#include <linux/mod_lineinfo.h>
+
 bool kallsyms_lookup_lineinfo(unsigned long addr, unsigned long sym_start,
                              const char **file, unsigned int *line)
 {


[...]

diff --git a/kernel/kallsyms_internal.h b/kernel/kallsyms_internal.h
index 868a1d5035212..691be44440395 100644
--- a/kernel/kallsyms_internal.h
+++ b/kernel/kallsyms_internal.h
@@ -17,10 +17,11 @@ extern const u8 kallsyms_seqs_of_names[];

 #ifdef CONFIG_KALLSYMS_LINEINFO
 extern const u32 lineinfo_num_entries;
-extern const u32 lineinfo_addrs[];
-extern const u16 lineinfo_file_ids[];
-extern const u32 lineinfo_lines[];
 extern const u32 lineinfo_num_files;
+extern const u32 lineinfo_num_blocks;
+extern const u32 lineinfo_block_addrs[];
+extern const u32 lineinfo_block_offsets[];
+extern const u8  lineinfo_data[];
 extern const u32 lineinfo_file_offsets[];
 extern const char lineinfo_filenames[];
 #endif
diff --git a/kernel/module/kallsyms.c b/kernel/module/kallsyms.c
index 7af414bd65e79..0ead1bb69de4e 100644
--- a/kernel/module/kallsyms.c
+++ b/kernel/module/kallsyms.c
@@ -512,15 +512,19 @@ bool module_lookup_lineinfo(struct module *mod, unsigned 
long addr,
 {


This and kallsyms_lookup_lineinfo() above look like almost exactly the
same code twice. Some refactoring would be nice, just so that others
don't have to read the same code twice, or worse, change the same code
twice later on.


Makes sense

[...]

diff --git a/scripts/gen_lineinfo.c b/scripts/gen_lineinfo.c
index 609de59f47ffd..9507ed9bcbe55 100644
--- a/scripts/gen_lineinfo.c
+++ b/scripts/gen_lineinfo.c
@@ -8,6 +8,9 @@
  * file containing sorted lookup tables that the kernel uses to annotate
  * stack traces with source file:line information.
  *
+ * The output uses a block-indexed, delta-encoded, ULEB128-compressed format
+ * for ~3-4x size reduction compared to flat arrays.
+ *
  * Requires libdw from elfutils.
  */

@@ -53,6 +56,15 @@ static struct file_entry *files;
 static unsigned int num_files;
 static unsigned int files_capacity;

+/* Compressed output */
+static unsigned char *compressed_data;
+static unsigned int compressed_size;
+static unsigned int compressed_capacity;
+
+static unsigned int *block_addrs;
+static unsigned int *block_offsets;
+static unsigned int num_blocks;
+
 #define FILE_HASH_BITS 13
 #define FILE_HASH_SIZE (1 << FILE_HASH_BITS)

@@ -352,6 +364,93 @@ static void deduplicate(void)
        num_entries = j + 1;
 }

+static void compressed_ensure(unsigned int need)
+{
+       if (compressed_size + need <= compressed_capacity)
+               return;
+       compressed_capacity = compressed_capacity ? compressed_capacity * 2 : 
1024 * 1024;
+       while (compressed_capacity < compressed_size + need)
+               compressed_capacity *= 2;
+       compressed_data = realloc(compressed_data, compressed_capacity);
+       if (!compressed_data) {
+               fprintf(stderr, "out of memory\n");
+               exit(1);
+       }
+}
+
+static void compress_entries(void)
+{
+       unsigned int i, block;
+
+       if (num_entries == 0) {
+               num_blocks = 0;
+               return;
+       }
+
+       num_blocks = (num_entries + LINEINFO_BLOCK_ENTRIES - 1) / 
LINEINFO_BLOCK_ENTRIES;
+       block_addrs = calloc(num_blocks, sizeof(*block_addrs));
+       block_offsets = calloc(num_blocks, sizeof(*block_offsets));
+       if (!block_addrs || !block_offsets) {
+               fprintf(stderr, "out of memory\n");
+               exit(1);
+       }
+
+       for (block = 0; block < num_blocks; block++) {
+               unsigned int base = block * LINEINFO_BLOCK_ENTRIES;
+               unsigned int count = num_entries - base;
+               unsigned int prev_addr, prev_file_id, prev_line;
+               unsigned char buf[10]; /* max 5 bytes per ULEB128 */
+
+               if (count > LINEINFO_BLOCK_ENTRIES)
+                       count = LINEINFO_BLOCK_ENTRIES;
+
+               block_addrs[block] = entries[base].offset;
+               block_offsets[block] = compressed_size;
+
+               /* Entry 0: file_id (ULEB128), line (ULEB128) */
+               compressed_ensure(20);
+               compressed_size += lineinfo_write_uleb128(
+                       compressed_data + compressed_size,
+                       entries[base].file_id);
+               compressed_size += lineinfo_write_uleb128(
+                       compressed_data + compressed_size,
+                       entries[base].line);
+
+               prev_addr = entries[base].offset;
+               prev_file_id = entries[base].file_id;
+               prev_line = entries[base].line;
+
+               /* Entries 1..N: deltas */
+               for (i = 1; i < count; i++) {
+                       unsigned int idx = base + i;
+                       unsigned int addr_delta;
+                       int32_t file_delta, line_delta;
+                       unsigned int n;
+
+                       addr_delta = entries[idx].offset - prev_addr;
+                       file_delta = (int32_t)entries[idx].file_id - 
(int32_t)prev_file_id;
+                       line_delta = (int32_t)entries[idx].line - 
(int32_t)prev_line;
+
+                       compressed_ensure(15);
+                       n = lineinfo_write_uleb128(buf, addr_delta);
+                       memcpy(compressed_data + compressed_size, buf, n);
+                       compressed_size += n;
+
+                       n = lineinfo_write_uleb128(buf, 
zigzag_encode(file_delta));
+                       memcpy(compressed_data + compressed_size, buf, n);
+                       compressed_size += n;
+
+                       n = lineinfo_write_uleb128(buf, 
zigzag_encode(line_delta));
+                       memcpy(compressed_data + compressed_size, buf, n);
+                       compressed_size += n;
+
+                       prev_addr = entries[idx].offset;
+                       prev_file_id = entries[idx].file_id;
+                       prev_line = entries[idx].line;
+               }
+       }
+}
+
 static void compute_file_offsets(void)
 {
        unsigned int offset = 0;
@@ -395,28 +494,40 @@ static void output_assembly(void)
        printf("lineinfo_num_files:\n");
        printf("\t.long %u\n\n", num_files);

-       /* Sorted address offsets from _text */
-       printf("\t.globl lineinfo_addrs\n");
+       /* Number of blocks */
+       printf("\t.globl lineinfo_num_blocks\n");
        printf("\t.balign 4\n");
-       printf("lineinfo_addrs:\n");
-       for (unsigned int i = 0; i < num_entries; i++)
-               printf("\t.long 0x%x\n", entries[i].offset);
-       printf("\n");
+       printf("lineinfo_num_blocks:\n");
+       printf("\t.long %u\n\n", num_blocks);

-       /* File IDs, parallel to addrs (u16 -- supports up to 65535 files) */
-       printf("\t.globl lineinfo_file_ids\n");
-       printf("\t.balign 2\n");
-       printf("lineinfo_file_ids:\n");
-       for (unsigned int i = 0; i < num_entries; i++)
-               printf("\t.short %u\n", entries[i].file_id);
+       /* Block first-addresses for binary search */
+       printf("\t.globl lineinfo_block_addrs\n");
+       printf("\t.balign 4\n");
+       printf("lineinfo_block_addrs:\n");
+       for (unsigned int i = 0; i < num_blocks; i++)
+               printf("\t.long 0x%x\n", block_addrs[i]);
        printf("\n");

-       /* Line numbers, parallel to addrs */
-       printf("\t.globl lineinfo_lines\n");
+       /* Block byte offsets into compressed stream */
+       printf("\t.globl lineinfo_block_offsets\n");
        printf("\t.balign 4\n");
-       printf("lineinfo_lines:\n");
-       for (unsigned int i = 0; i < num_entries; i++)
-               printf("\t.long %u\n", entries[i].line);
+       printf("lineinfo_block_offsets:\n");
+       for (unsigned int i = 0; i < num_blocks; i++)
+               printf("\t.long %u\n", block_offsets[i]);
+       printf("\n");
+
+       /* Compressed data stream */
+       printf("\t.globl lineinfo_data\n");
+       printf("lineinfo_data:\n");
+       for (unsigned int i = 0; i < compressed_size; i++) {
+               if ((i % 16) == 0)
+                       printf("\t.byte ");
+               else
+                       printf(",");
+               printf("0x%02x", compressed_data[i]);
+               if ((i % 16) == 15 || i == compressed_size - 1)
+                       printf("\n");
+       }
        printf("\n");


Note how compute_file_offsets() gives symbol names to the data it
generates. Meanwhile...

        /* File string offset table */
@@ -450,33 +561,38 @@ static void output_module_assembly(void)

        printf("\t.section .mod_lineinfo, \"a\"\n\n");

-       /* Header: num_entries, num_files, filenames_size, reserved */
+       /* Header: num_entries, num_files, filenames_size, num_blocks, 
data_size, reserved */
        printf("\t.balign 4\n");
        printf("\t.long %u\n", num_entries);
        printf("\t.long %u\n", num_files);
        printf("\t.long %u\n", filenames_size);
+       printf("\t.long %u\n", num_blocks);
+       printf("\t.long %u\n", compressed_size);
        printf("\t.long 0\n\n");

-       /* addrs[] */
-       for (unsigned int i = 0; i < num_entries; i++)
-               printf("\t.long 0x%x\n", entries[i].offset);
-       if (num_entries)
+       /* block_addrs[] */
+       for (unsigned int i = 0; i < num_blocks; i++)
+               printf("\t.long 0x%x\n", block_addrs[i]);
+       if (num_blocks)
                printf("\n");

For the modules, it's comments in the C code that doesn't end up in the
assembly.

I'm thinking we could have something like:

        printf(".Lmod_lineinfo_block_addrs:\n")
        for (unsigned int i = 0; i < num_entries; i++)
                printf("\t.long 0x%x\n", ...);
        printf("\n")

(And similarly for the other blocks of data.)

This would make the assembly a tiny bit more readable, get rid of the
kinda ugly printf("\n") checks and prints, and would be useful for the
offset + size header format I mentioned earlier.


Makes sense

-       /* file_ids[] */
-       for (unsigned int i = 0; i < num_entries; i++)
-               printf("\t.short %u\n", entries[i].file_id);
-
-       /* Padding to align lines[] to 4 bytes */
-       if (num_entries & 1)
-               printf("\t.short 0\n");
-       if (num_entries)
+       /* block_offsets[] */
+       for (unsigned int i = 0; i < num_blocks; i++)
+               printf("\t.long %u\n", block_offsets[i]);
+       if (num_blocks)
                printf("\n");

-       /* lines[] */
-       for (unsigned int i = 0; i < num_entries; i++)
-               printf("\t.long %u\n", entries[i].line);
-       if (num_entries)
+       /* compressed data[] */
+       for (unsigned int i = 0; i < compressed_size; i++) {
+               if ((i % 16) == 0)
+                       printf("\t.byte ");
+               else
+                       printf(",");
+               printf("0x%02x", compressed_data[i]);
+               if ((i % 16) == 15 || i == compressed_size - 1)
+                       printf("\n");
+       }
+       if (compressed_size)
                printf("\n");


Also, maybe we can use .uleb128/.sleb128 here, and generate something like:

        printf("\t.sleb128 %#x - %#x\n", cur_addr, prev_addr);

And have the assembler do the subtraction and encoding for us? If that
works it should significantly simplify the compression code above.


Makes sense

Speaking of... Why do we use uleb128(zigzag(num)) and not just sleb128(num)?


Because I originally wrote uleb128 to tackle the unsigned fields, and zigzag
was a quick way to handle signed deltas.

But as you point out, if we use .ubleb128/.sleb128 I think we can just drop
zigzag.

[...]

@@ -558,10 +674,11 @@ int main(int argc, char *argv[])
                        skipped_overflow);

        deduplicate();
+       compress_entries();
        compute_file_offsets();

-       fprintf(stderr, "lineinfo: %u entries, %u files\n",
-               num_entries, num_files);
+       fprintf(stderr, "lineinfo: %u entries, %u files, %u blocks, %u compressed 
bytes\n",
+               num_entries, num_files, num_blocks, compressed_size);

        if (module_mode)
                output_module_assembly();
@@ -577,6 +694,9 @@ int main(int argc, char *argv[])
        for (unsigned int i = 0; i < num_files; i++)
                free(files[i].name);
        free(files);
+       free(compressed_data);
+       free(block_addrs);
+       free(block_offsets);

        return 0;
 }
diff --git a/scripts/kallsyms.c b/scripts/kallsyms.c
index 42662c4fbc6c9..94fbdad3df7c6 100644
--- a/scripts/kallsyms.c
+++ b/scripts/kallsyms.c
@@ -80,11 +80,12 @@ static bool is_ignored_symbol(const char *name, char type)
 {
        /* Ignore lineinfo symbols for kallsyms pass stability */
        static const char * const lineinfo_syms[] = {
-               "lineinfo_addrs",
-               "lineinfo_file_ids",
+               "lineinfo_block_addrs",
+               "lineinfo_block_offsets",
+               "lineinfo_data",
                "lineinfo_file_offsets",
                "lineinfo_filenames",
-               "lineinfo_lines",
+               "lineinfo_num_blocks",
                "lineinfo_num_entries",
                "lineinfo_num_files",
        };
diff --git a/scripts/link-vmlinux.sh b/scripts/link-vmlinux.sh
index 640209f2e9eb9..3c122cf9b95c5 100755
--- a/scripts/link-vmlinux.sh
+++ b/scripts/link-vmlinux.sh
@@ -235,12 +235,16 @@ lineinfo_num_entries:
        .balign 4
 lineinfo_num_files:
        .long 0
-       .globl lineinfo_addrs
-lineinfo_addrs:
-       .globl lineinfo_file_ids
-lineinfo_file_ids:
-       .globl lineinfo_lines
-lineinfo_lines:
+       .globl lineinfo_num_blocks
+       .balign 4
+lineinfo_num_blocks:
+       .long 0
+       .globl lineinfo_block_addrs
+lineinfo_block_addrs:
+       .globl lineinfo_block_offsets
+lineinfo_block_offsets:
+       .globl lineinfo_data
+lineinfo_data:
        .globl lineinfo_file_offsets
 lineinfo_file_offsets:
        .globl lineinfo_filenames


The contents of this .tmp_lineinfo.S is fixed, so it feels like it could
just be a proper file, instead of something "generated" dynamically.


Makes sense

--
Thanks,
Sasha

Re: [PATCH 3/3] kallsyms: delta-compress lineinfo tables for ~2.7x size reduction

Reply via email to