Skip to content

Commit

Permalink
[ENH] Add warning about deidentification when sharing sourcedata (#1769)
Browse files Browse the repository at this point in the history
* reset

* Apply suggestions from code review

Co-authored-by: Oscar Esteban <code@oscaresteban.es>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* semantic line break

* Update src/common-principles.md

---------

Co-authored-by: Oscar Esteban <code@oscaresteban.es>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Chris Markiewicz <effigies@gmail.com>
  • Loading branch information
4 people authored Apr 22, 2024
1 parent 3e6d1a7 commit 01025da
Showing 1 changed file with 22 additions and 0 deletions.
22 changes: 22 additions & 0 deletions src/common-principles.md
Original file line number Diff line number Diff line change
Expand Up @@ -309,6 +309,28 @@ field in `dataset_description.json` of each subdirectory of `derivatives` to:
}
```

!!! danger "Caution"

Sharing source data may help amend errors and missing data discovered
only with the reuse of the raw dataset in practice.
Therefore, from an Open Science perspective, it is RECOMMENDED to share
the source data whenever it is possible.

However, more stringent sharing limitations may apply to the source data
than those applicable to the raw data.
For example, human data almost always requires deidentification
before they can be redistributed,
or the subjects' consent form did not explicitly state that the source files
would be shared after deidentification.
Further examples in which sharing source data may not be possible
include original data formats that are not redistributable
as per the acquisition device's license.

As for raw data, all regulatory, ethical, and legal aspects SHOULD
be carefully considered before sharing data
through the `sourcedata/` directory mechanism.
In the case of source data, these aspects are likely more stringent.

### Storage of derived datasets

Derivatives can be stored/distributed in two ways:
Expand Down

0 comments on commit 01025da

Please sign in to comment.