-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Digest for DCAT distributions #1287
Comments
@cristianolongo , I wonder whether the use case you are proposing is one of those addressed in DCAT-AP and its extensions by using |
thanks @andrea-perego , yes spdx:checksum do the case. However I can't find the specification of DCAT-AP |
|
great, thanks |
Hi @cristianolongo @andrea-perego - this seems to me a quite generic use case that might be good to consider in DCAT too. |
of course, it is relevant for all downloadable datasets, and may be also for datasets provided via a SPARQL endpoint. Should I reopen the issue? |
@agbeltran said:
Agreed. For our records: DCAT-AP 2.0.1 describes the purpose of
An example of its use: @prefix spdx: <http://spdx.org/rdf/terms#> .
@prefix xsd: <http://www.w3.org/2001/XMLSchema#> .
<https://europeandataportal.eu/set/distribution/2f3d36a4-79de-4cfb-85d7-706519be7b25> spdx:checksum _:b621486445 .
_:b621486445 a spdx:Checksum ;
spdx:algorithm spdx:checksumAlgorithm_sha1 ;
spdx:checksumValue "71bf58e542a47d7092ed1924f34db91bb24fe2c2"^^xsd:hexBinary . |
@cristianolongo: we discussed the adoption of spdx:checksum in the last DCAT meeting (see meeting minutes). |
Yes, I dump my dataset via a construct query like
With appropriate ordering clauses, the output should be predictable (depending on the knowledge base content of course). Other cases may be dataset exposed via a REST API which returns json-ld. However, I'm not fully convinced that these examples are in the scope of DCAT. |
For what it's worth, I have used checksums for datasets in a web app, though not with SPARQL. The use case was differentiating datasets of electronic potentials for use in quantum mechanical calculations. |
Agree to the need for a "Checksum" (algorithm + value) for dataset integrity (like spdx). |
I've created a draft PR to integrate the relevant SPDX class and properties: #1323 A preview of the newly added sections:
I've included a couple of EDNOTEs about additional issues to be discussed. Please review. |
I would update the definition to be "The Checksum includes the algorithm and value that allows the integrity of a file to be verified to ensure no errors were detected in transmission or storage." |
@riannella said:
I've added it as a usage note to |
The relevant updates have been merged into the ED via PR #1323 Unless there are any objections, I propose we close this issue. |
We are closing this issue as proposed above and as a result of tonight's DCAT subgroup meeting |
A digest of the file may be useful for downloadable dataset distributions, in order to verify the authenticity of the downloaded file and to verify that the dataset has not been updated after that the digest has been created (the digest should be updated just on the last update time)
The text was updated successfully, but these errors were encountered: