File tree Expand file tree Collapse file tree 2 files changed +27
-2
lines changed Expand file tree Collapse file tree 2 files changed +27
-2
lines changed Original file line number Diff line number Diff line change
1
+ BigQuery Datasets
2
+ =================
3
+
4
+ We use BigQuery to serve our public datasets. PyPI offers two tables whose
5
+ data is sourced from projects on PyPI. The tables and its pertaining data are licensed
6
+ under the `Creative Commons License <https://creativecommons.org/licenses/by/4.0/ >`_.
7
+
8
+ Download Statistics Table
9
+ -------------------------
10
+
11
+ The download statistics table allows you learn more about downloads patterns of
12
+ packages hosted on PyPI. This table is populated through the `Linehaul
13
+ project <https://github.com/pypa/linehaul> `_ by streaming download logs from PyPI
14
+ to BigQuery. For more information on analyzing PyPI package downloads, see the `Python
15
+ Package Guide <https://packaging.python.org/guides/analyzing-pypi-package-downloads/> `_
16
+
17
+ Project Metadata Table
18
+ ----------------------
19
+
20
+ We also have a table that provides access to distribution metadata
21
+ as outlined by the `core metadata specifications <https://packaging.python.org/specifications/core-metadata/ >`_.
22
+ The table is meant to be a data dump of metadata from every release on PyPI, which
23
+ means that the rows in this BigQuery table are immutable and are not removed even if a release or project is deleted.
24
+ This data can be accessible under the ``the-psf.pypi.distribution_metadata `` public dataset on BigQuery.
Original file line number Diff line number Diff line change @@ -56,8 +56,8 @@ use our RSS feeds.
56
56
No new integrations should use the XML-RPC APIs as they are planned for
57
57
deprecation. Existing consumers should migrate to JSON/RSS/Legacy APIs.
58
58
59
- Available APIs
60
- --------------
59
+ Available APIs & Datasets
60
+ -------------------------
61
61
62
62
.. toctree ::
63
63
:maxdepth: 2
@@ -68,3 +68,4 @@ Available APIs
68
68
stats
69
69
xml-rpc
70
70
integration-guide
71
+ bigquery-dataset
You can’t perform that action at this time.
0 commit comments