FYI, egg-info directories can store arbitrary data (see e.g. the
EggTranslations package, which uses it for localization resources), so
you may want to impose some restrictions on *which* metadata files to
include.

You mean, so that it doesn't include malware, porn, or other spam?
That would be useful, I guess.

Second, if a user uploads a source distribution built with setuptools,
it will include an .egg-info directory automatically, so you can simply
extract it from there. Conversely, if the source distribution is *not*
built with setuptools, then building it with setuptools would not
produce much information in the egg-info anyway.

I see. I'll try to start with that as an assumption, but will still try
to validate it wrt. real data.

PKG-INFO
SOURCES.txt # a manifest of the sdist contents
top_level.txt # list of top-level package/module names
scripts/ # source code of scripts in the package)
zip-safe or not-zip-safe # flag file

I don't think that most of these are useful for PyPI searches, though I
suppose a listing of the name of the scripts the package includes could
be useful.

I really try PyPI not to interpret any of these data. So I rather err on
the inclusive side.

I'm not sure how useful it is to just have URLs for accessing the files,
though, vs. having actual searches on structured data provided in the
files. For example, an index of projects by package or module names, or
of projects that provide a particular entry point.

In principle, it should be possible to let other services provide such
indices. I'd rather provide the files as-is for the moment, and see
what kind of facilities people would desire on top of it.

Regards,
Martin
_______________________________________________
Catalog-SIG mailing list
Catalog-SIG@python.org
http://mail.python.org/mailman/listinfo/catalog-sig

Reply via email to