Re: [DRAFT] Policy: allow packages with bundled dependencies subject to mandatory labelling

Simon Richter Tue, 03 Feb 2026 07:56:34 -0800

Hi,

On 2/4/26 12:10 AM, Dmitry E. Oboukhov wrote:

One simple approach would be to package vendored dependencies as
separate .orig archives, ideally (if they come from git submodules)
with the git-archive commit ID annotation inside the archive. The
security tracker could import these annotations from all known
archives, map them to their origin projects, and then check if the
packaged commit ID is a descendant of a commit that introduces a
particular fix.

How would this approach work for, say, Python packages listed in
requirements.txt? Would we download them and package them as
separate .orig archives?

Yes, because whatever we do, we need all dependencies available afterunpacking the source package and installing all listed dependencies, soeither we merge all the archives together to one big blob, or we keepthem separate and attach them all to the same dsc.

In the specific case of Python, where there is a strong culture ofbuilding somewhat stable APIs, I expect that vendoring dependencies willbe the exception rather than the norm anyway.

The most problematic ecosystems, in my opinion, are cargo, npm andgolang, but we mostly lack insight here -- we have no statistics on howmuch duplication exists inside the archive, how many of these could beavoided by folding them together, and how many binary packages we couldsave in the other direction by vendoring libraries that only have asingle user and are statically linked anyway. That's why my proposalgoes towards making these transparent and trackable first.

The rule against vendoring dependencies was informed by the effort ittook to clean up the hundreds of embedded copies of zlib and make allpackages use the common implementation. If we are to relax the rulesagain, we need to make sure that we can at least find any embeddedcopies and quickly determine their version and security status.

This will likely require an external archive scanner anyway, but we dohave a few of those anyway, like scanning for undeclared file conflicts.

Thinking in lintian unpack levels, the less a package needs to beprocessed by this scanner, the better -- so an "XS-Ecosystem" tag wouldfind its way into the Sources file, and a python specific scanner coulddisregard any packages without a requirements.txt file beforedownloading them.

Likewise, the tags I proposed for scanning git ancestry (i.e.ecosystem-agnostic) would also work either from the Sources file if weforward these tags there, or we'd generate a generic "Has-Git-Info" tagand make the scanner download the dsc files, that is still somewhatlightweight.

IMO, we should also include API/ABI stability as a factor into whetherwe actually want to release a package as part of a stable distribution.

We have several upstreams who are bluntly telling us that they areunwilling to support users running Debian stable, so at this point thedecision to release such a package as part of a stable release is acommitment by the Debian maintainer to provide this user support.

For a package with a lot of version-pinned dependencies, that commitmentcan be massive, and should not be taken on lightly; on the other hand,for a lot of these packages vendoring is the only approach that makesthis feasible in the first place.


   Simon

Re: [DRAFT] Policy: allow packages with bundled dependencies subject to mandatory labelling

Reply via email to