So, as most folks are aware PyPI has long had a cumulative download count 
available in it’s API. This has been on and off again broken for a *long* time 
and arguably the numbers in there have been “wrong” even when it was working 
because we had no way to reproduce them from scratch (and thus whenever a bug 
occurred we’d flat out lose data or add incorrect data with no way to correct 
it).

In the meantime, we’ve gotten a much better source of querying for download 
information available inside of Google’s BigQuery database [1][2]. Not only is 
this able to be recreated “from scratch” so we can, if needed, fix massive data 
bugs but it provides MUCH more information than the previous downloads and a 
very powerful query language to go along with it.

Unless there is some sort of massive outcry, I plan to deprecate and ultimately 
remove the download counts available in the PyPI API, instead preferring people 
to start using the BigQuery data instead. This more or less reflects the 
current state of things, since it has been on and off broken (typically broken) 
for something like a year now.

[1] https://mail.python.org/pipermail/distutils-sig/2016-May/028986.html 
<https://mail.python.org/pipermail/distutils-sig/2016-May/028986.html>
[2] https://langui.sh/2016/12/09/data-driven-decisions/ 
<https://langui.sh/2016/12/09/data-driven-decisions/>

—
Donald Stufft



_______________________________________________
Distutils-SIG maillist  -  [email protected]
https://mail.python.org/mailman/listinfo/distutils-sig

Reply via email to