Detecting separate debuginfo

Florian Weimer Fri, 28 Mar 2014 06:12:18 -0700

I maintain a database which extracts symbol information from ELF objects(among other things). I would like to enrich that with DWARF producerdata, and perhaps additional DWARF information in the future.

I'd really like to avoid importing the ELF symbol information twice,once from the real object file, and once from the separate debuginfo.

The database performs content-based deduplication, this means I do nothave path name information during extraction. This mean I cannot usefile system paths to disambiguate the real thing and its debugginginformation. Both files are loaded separately and not necessarily atthe same time. I don't want to change that if possible because thiswould result in a scalability issue eventually. I don't want to assumethat *all* debuginfo data has been separated, either.

Based on the previous discussion around program interpreter reporting inreadelf, there is no easy way to detect separate debuginfo to triggerspecial processing for it (e.g., do not extract symbols, onlyDW_at_producer data).

One thing that would help me as well if there is a way to get the exactsame set of exported symbols from the real file and its separatedebuginfo. The I could deduplicate based on that, and processing bothfiles would not matter anymore. eu-readelf shows quite different outputfor the two files, so I'm not sure how to achieve that.

I don't actually use eu-readelf output (but my extraction code isderived from it), and I'm open to suggestions to look at particularsections/headers to get matching output. I'm mainly interested inpublic symbols and undefined symbols. Internal symbols from debugginginformation could be ignored for the time being.


--
Florian Weimer / Red Hat Product Security Team

Detecting separate debuginfo

Reply via email to