Data Masking

Concept

Data involved in any data-masking or obfuscation must remain meaningful at several levels:

  • The data must remain meaningful for the application logic.
  • The data must undergo enough changes so that it is not obvious that the masked data is from a source of production data.

Keyword

  • PII: Personally Identifiable Information.
  • EI: explicit identifiers.
  • QI: Quasi-identifiers.
  • SD: Sensitive data.
  • NSD: Nonsensitive data.

Algorithm

  • Randomization
  • Generalization
  • K-Anonimization
  • L-Diversity
  • T-Closeness

Static data masking (SDM)

data at rest.

Dynamic data masking (DDM)

data in transit.

Techniques

  • Substitution

  • Shuffling

  • Number and date variance

  • Encryption

  • Nulling out or deletion

  • Masking out

  • Additional complex rules

Other

  • Management
  • Rule
  • Audit