-
Notifications
You must be signed in to change notification settings - Fork 0
Feat/release stats #284
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Feat/release stats #284
Conversation
|
We could use big query to get the older stats. Zarr Python 3 was first released in January but released from 3.0 - 3.0.7 were yanked. SELECT COUNT(*) as downloads
FROM `bigquery-public-data.pypi.file_downloads`
WHERE file.project = 'zarr'
AND DATE(timestamp) BETWEEN '2024-12-01' AND '2024-12-31'1076076 SELECT COUNT(*) as downloads
FROM `bigquery-public-data.pypi.file_downloads`
WHERE file.project = 'zarr'
AND DATE(timestamp) BETWEEN '2025-09-01' AND '2025-09-30'2340374 |
|
@maxrjones You said zarr-python 3.0 was first released in January but the last 2.x release, 2.18.7, was released April 9th, so I'm not sure we can associate downloads prior to April 9th with release 2.18.x or 3.0.x. (NB: There were also all of the 3.0.0-alpha releases in 2024) I think there are 2 opttions:
Or we could use the following last minor releases:
Also thank you for sharing that big query option! Much easier than the pypi stats API. |
Zarr Python is still supporting 2.x for a period of time after 3.0 was released to facilitate migration, meaning bug-fix releases are coming out. Since the PyPI stats aggregate V2 and V3 downloads, I think the most helpful comparison would be x downloads per day in the month preceding v3.0.0 release (December 9, 2024-January 8, 2025) vs x downloads per day in the last complete month. I would prefer this solution because I think you'd already be catching some of the V3 upswing (if there is any) if using April as the baseline. Perhaps it would be even easier to explain a comparison between October 2025 to October 2024, which would also include any signal from the V3 release. |
|
For zarr:
Virtualizarr:
Virtualizarr's 2.0 release has resulted in over 9x more downloads (should we say this? This can also be attributed to general knowledge of the project) over the same month a year prior OR a saw a 74% increase in downloads month over month from the release. |
Maybe we could say "virtualizarr 2.0 development" to account for both the release and outreach about the release These are some pretty sweet stats |
Pypi stats only go back 180 days so I don't think we can generate a comparable report for zarr 3.0 unfortunately since it was released in April.
I looked into the github API for repo downloads for zarr-python but can't see there is actually a downloads metric to be had.
Thoughts @maxrjones ?