Digital tools for risk assessment and protective measures in tabular data containing personal information

Woman looking at screen with the dmp checklist

Anyone working with quantitative tabular data must have an understanding of concepts like re-identification and statistical disclosure control. For that reason, SND has developed a workshop to introduce basic concepts and tools for statistical disclosure control for tabular data.

The workshop highlights the possibilities and limitations of existing tools and methods for pseudonymization, and introduces the R package sdcMicro to illustrate how statistical methods can be applied to assess the risk of disclosure and re-identification of individuals in tabular datasets. Participants get a basic overview of the principles and functions of sdcMicro, and can then apply their knowledge to a synthetic dataset. They learn to interpret and assess the results of pseudonymization and recoding of data, and how these methods affect the usefulness of the data.

No previous knowledge of R is needed. 

The workshop material is freely accessible on Zenodo.

The material contains:

  • the workshop program
  • instructions to the workshop leader
  • a workshop presentation
  • advice on how to install the necessary software
  • a synthetic dataset to practice on.

Note that the workshop is in Swedish.