-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Downloads per month for datasets and rollups for publishers #117
Comments
Hi Morten, |
I agree that we should provide better analytics API services for downloads, there one thing we should consider for this specific request: it is not very common but the publisher of a dataset can change from time to time, we do not store the publisher of a dataset at the time a download was created if we use the current publisher of a dataset we might be providing inaccurate results, I know this would be very rare, but it can happen, for example, we store the dataset title at the time the download was created to avoid inconsistencies |
Hi @fmendezh , I have a SQL statement for a 'complete' csv table, that covers most of the statistics requested by publishers on GBIF data usage. It is broken down to dataset level by month. These are the headers: For the year 2020, this amounts to ~400K rows. Is this something you could use to support the analytics API, or put into https://analytics-files.gbif-uat.org/download/csv/ ? |
Naturalis has also been requesting API based stats on user downloads: I have come quite far in analysing the 811k downloads from the 41 occurrence datasets that Naturalis shares with GBIF. We are very interested in information on the country or continent where the users of our data come from. The country report of the Netherlands does include metrics on downloads of Dutch users, so this information should be stored somewhere with the downloads. Is there a way to access this information through the API? This could be based on a table made of user-downloads count by country, and by year. |
In a similar vein, but at dataset granularity= For these 10 datasets, could we please get a count of download events and occurrences within those download events for 2020 broken down by dataset and month? Something like this:
|
Continuing Naturalis: We are (additionally) interested in the regions where the download requests come from, i.e. where are the users of our data located. Maybe something for the next API release? |
…gKey to the list of parameters #117
The download stats api has been improved with some new services and parameters:
|
@jeroencreuwels take a look at the new download stats api functions. |
Thanks for all the work! Seems perfect for our goals! |
To support
gbif/portal-feedback#1912
gbif/portal16#761
gbif/portal16#138
we need an API to get
We might want to expand this to downloads last 356 days. year to date etc., but above should be a good start.
So something along:
or we could do something simple for now (if at all simpler)
aside from counting download events, we could perhaps count records from the dataset as well?
so that
/occurrence/download/dataset/7f2edc10-f762-11e1-a439-00145eb45e9a
would also return the total record count?The text was updated successfully, but these errors were encountered: