Learn R Programming

FakeDataR (version 0.2.2)

detect_sensitive_columns: Detect sensitive columns by name

Description

Uses a broad, configurable regex library to match likely PII columns. You can extend it with extra_patterns (they get ORed in) or replace everything with a single override_regex.

Usage

detect_sensitive_columns(x_names, extra_patterns = NULL, override_regex = NULL)

Value

Character vector of names from x_names that matched.

Arguments

x_names

Character vector of column names to check.

extra_patterns

Character vector of additional regexes to OR in. Examples: c("MRN", "NHS", "Aadhaar", "passport")

override_regex

Optional single regex string that fully replaces the defaults (case-insensitive). When supplied, extra_patterns is ignored.

Examples

Run this code
detect_sensitive_columns(c("id","email","home_phone","zip","notes"))
detect_sensitive_columns(names(mtcars), extra_patterns = c("^vin$", "passport"))

Run the code above in your browser using DataLab