Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chore: add download stats #3283

Closed
wants to merge 1 commit into from
Closed

chore: add download stats #3283

wants to merge 1 commit into from

Conversation

ianna
Copy link
Collaborator

@ianna ianna commented Oct 11, 2024

I thought, it would be nice to show it. See https://www.pepy.tech/projects/awkward

I thought, it would be nice to show it. See https://www.pepy.tech/projects/awkward
@jpivarski
Copy link
Member

Yeah, I don't know if we should show those numbers because they're significantly inflated by distributed jobs that run pip install as the first line. You can see this by separating MacOS and Windows downloads from Linux, the only handle that the PyPI dataset provides for this. (It's much more common for distributed jobs to run on Linux.)

image

The Linux downloads are more than 10× MacOS and Windows; they need to be down-scaled with a separate (right) axis to show them both on the same plot. But also, the Linux downloads are more volatile, since they depend on when very large jobs run, rather than smoother patterns like when people decide to try out the software and how frequently they update their versions. The downloads that are tracked by the site you pointed to are very spiky, and I also don't know how to reconcile its much larger numbers with what I found in Google BigQuery.

I think download statistics are only interesting when they're measuring human choices—that is, when there's a one-to-one relationship between a pip install and a happy customer, rather than "I think I'll run with 100k jobs today, rather than 10k" (which is also a human choice, but not what I mean). So I'd rather not do this. We might find ourselves in a position of having to defend these numbers, and I don't think I can do that.

@jpivarski jpivarski closed this Oct 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants