-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Clarify definition of dct:accrualPeriodicity in the context of DCAT #728
Comments
This issue was triggered by a comment submitted by Daniel Pop [3] [3] https://lists.w3.org/Archives/Public/public-dxwg-comments/2019Jan/0013.html |
Concerning specifically the relevant Daniel Pop's comment:
|
The concern applies primarily to datasets that are time-series, which might be issued on a different schedule to the rate of addition of data to the dataset. For example, a dataset showing rainfall at 5-minute intervals might be formally published and registered just once a day. The DCAT definition appears to specify the latter, while the DCMI definition encompasses both. I think we need to either
Else
@andrea-perego , @kcoyle , @davebrowning do you have backward compatibility evidence here? How is |
Collection in Dublin Core relates to the Collection Application Profile. It clearly comes out of the archival community. I will ask if anyone has examples of use. The wording is specifically about the addition of entries to a collection or a catalog defining a collection, not about publication. Presumably the periodicity of publication could be the same as the periodicity that items are added to the collection, but obviously they could also differ. |
@dr-shorthair asked:
Yep. This is definitely used. According to the report on usage statics of DCAT-AP in the European Data Portal (EDP) (Oct, 2017), So, there's indeed a backward compatibility issue to be taken into account. |
The report says: "Use of dct:accrualPeriodicity This does seem different from the semantics of the dct definition. I've asked in the DC community if anyone there has examples of use. Although the definitions differ, I wonder if the end result is not the same, which is "frequency of addition of resources to a catalog." |
Thanks @kcoyle - I thought my concern came down to thinking about whether the 'collection' under consideration is And even "at what frequency a dataset is updated by its owner" still does not nail down "at what frequency items (values) are added to (spaced in) a time-series" since a daily dataset update might still include hourly data. From a user's perspective, these differences really matter: The dataset-update rate is about currency, while the data-rate will probably determine whether the data is fit for purpose. This is not an esoteric concern. It was prompted by an investigation in a recent metadata workshop, where we were trying to find datasets about related phenomena to do some modeling and analysis. Time-series with daily, seasonal and annual spacing can't be used together for most tasks, but that information is mostly only available by reading the abstract, or inspecting the downloaded data. If it was an explicit dataset statistic then a query could be constructed. That's why I referred over to #84 . But I do think there is some clarification required for |
@dr-shorthair I agree that there is a significant difference between adding items to a set and updating items in a set. The dct term, being from the archival world, it definitely about the former. So I could consider accrualPeriodicity to be about adding to the set, which in DCAT would be the catalog. Updating the dataset itself is closer in my mind to versioning, as the dataset undergoes a change. dqm's "expected update interval" seems closer to the latter. If both are needed then they obviously should be distinguished. |
Isn't @dr-shorthair 's question about the temporal resolution of the data-- if the data set is reporting air temperature, are measurements recorded every minute, every hour, every day ... The accessible representation for the dataset might be updated once a day or once a year, but that is orthogonal to the temporal resolution of the data. There is a similar consideration for spatial resolution. These both appear to be different properties than what is intended by accrualPeriodicity. |
@smrgeoinfo Re-reading his post, I think you are right, so in that case the dct property is definitely of a different nature (although still accruing and still periodic, but different subjects being acted on). The DCAT definition "frequency at which a dataset is published" is something else yet again. We may need a well-done chart to sort out the meanings and their combinations. |
Like I said - I think we probably just need to improve the wording around |
@kcoyle said:
Not necessarily. If the dataset is updated by adding new data items (e.g., adding new rows in a table), without modifying the existing ones, then we are in the same case of the catalogue - where each dataset can be considered as a data item. We are talking here about different ways (not mutually exclusive) on how a dataset can be updated, and how they are relevant for users. I think that, in the most general case, users are interested in knowing how frequently a dataset is updated, and they are not very much interested in how this is done. If we identify the need to also clarify whether the update was done by modifying the existing data, adding new data items, or both, fine. But we don't have to forget the most general use case. About |
A new property e.g.
|
The DCMI definition of
dct:accrualPeriodicity
is [1]However, the DCAT definition is [2]
which has a rather different meaning. In particular, the rate of addition of data to a time series might be quite different to the frequency of its release or publication.
This needs to be clarified.
[1] http://dublincore.org/documents/dcmi-terms/#terms-accrualPeriodicity
[2] https://w3c.github.io/dxwg/dcat/#Property:dataset_frequency
The text was updated successfully, but these errors were encountered: