Dataset de-identification levels
Jump to navigation
Jump to search
These categories indicates the identifiers present in the data (whether or not pseudonymising keys are present) and thus gives an indication of the level of de-identification. It is included to clarify the degree of further processing that might be required. The possible values are:
id | name | description | source |
---|---|---|---|
1 | None | A dataset with no direct or indirect identifiers. Would be rare as scientific utility is likely to be severely affected, but could be a subset of data used for a particular purpose. | ECRIN |
2 | De-identified | A dataset with no direct identifiers, and with indirect identifiers modified by established de-identification steps (e.g. amalgamation of categories, rebasing of dates, removal of text comments) so that it is no longer possible to identify any individuals within the data set. | ECRIN |
3 | Has Indirect Identifiers | Dataset contains no direct identifiers, but does contain data fields that when considered in combination might be used to identify some of the individuals. In some cases, access would also be required to other systems. | ECRIN |
4 | Has Direct Identifiers | The dataset contains at least one direct identifier, i.e. a name, code, system id or other data that allow the individual to the identified unambiguously – in some cases requiring access to an additional system. This would be very rare in the context of shared data. | ECRIN |
9 | Comment on identifiers present | Indicators or comment on identifiers present but not classifiable as one of types 1-4. Details field should be used for the comment. | ECRIN |
0 | Not yet known | Dummy value supplied by default on entity creation. | ECRIN |