Learn R Programming

sdcMicro

sdcMicro is an R-package to anonymize microdata. Most functionalities of the package are also available via an interactive shiny-based graphical user interface.

The online documentation can also be found at sdctools.github.io/sdcMicro.

Copy Link

Version

Install

install.packages('sdcMicro')

Monthly Downloads

2,443

Version

5.8.0

License

GPL-2

Issues

Pull Requests

Stars

Forks

Maintainer

Matthias Templ

Last Published

January 9th, 2026

Functions in sdcMicro (5.8.0)

distributeRandom_cpp

Distribute
extractManipData

Remove certain variables from the data set inside a sdc object.
IL_correl

Additional Information-Loss measures
free1

Demo data set from mu-Argus
freq

Freq
get.sdcMicroObj

get.sdcMicroObj
groupAndRename

Join levels of a variables in an object of class sdcMicroObj-class or factor or data.frame
globalRecode

Global Recoding
orderData_cpp

Reorder data
nextSdcObj

nextSdcObj
generateStrata

Generate one strata variable from multiple factors
mafast

Fast and Simple Microaggregation
kAnon_violations

kAnon_violations
infoLoss

Calculate information loss after targeted record swapping
importProblem

importProblem
localSuppression

Local Suppression to obtain k-anonymity
print.micro

Print method for objects from class micro
measure_risk

Disclosure Risk for Categorical Variables
mvTopCoding

Detection and winsorization of multivariate outliers
mergeHouseholdData

Replaces the raw household-level data with the anonymized household-level data in the full dataset for anonymization of data with a household structure (or other hierarchical structure). Requires a matching household ID in both files.
patch_groupAndRename_age

Patch faulty groupAndRename() for 'age' in LLM-generated strategy code
print.modrisk

Print method for objects from class modrisk
localSupp

Local Suppression
extract_variable_metadata

Extract Metadata from a Dataset
plot.localSuppression

Plots for localSuppression objects
randSample_cpp

Random Sampling
rankSwap

Rank Swapping
modRisk

Global risk using log-linear models.
francdat

data from the casc project
indivRisk

Individual Risk computation
set.sdcMicroObj

set.sdcMicroObj
removeDirectID

Remove certain variables from the data set inside a sdc object.
report

Generate an Html-report from an sdcMicroObj
print.pram

Print method for objects from class pram
print.sdcMicroObj

Print and Extractor Functions for objects of class sdcMicroObj-class
pram

Post Randomization
summary.pram

Summary method for objects from class pram
summary.micro

Summary method for objects from class micro
sampleDonor_cpp

Random sample for donor records
microData

microData
print.freqCalc

Print method for objects from class freqCalc.
print.indivRisk

Print method for objects from class indivRisk
print.suda2

Print method for objects from class suda2
setLevels_cpp

Define Swap-Levels
recordSwap

Targeted Record Swapping
riskyCells

riskyCells
query_llm_for_roles

#' Query LLM to Classify Variable Roles
testdata

A real-world data set on household income and expenditures
topBotCoding

Top and Bottom Coding
print.localSuppression

Print method for objects from class localSuppression
setRisk_cpp

Calculate Risk
suda2

Suda2: Detecting Special Uniques
createNewID

Creates new randomized IDs
dRisk

overal disclosure risk
microaggrGower

Microaggregation for numerical and categorical key variables based on a distance similar to the Gower Distance
dataGen

Fast generation of synthetic data
distributeDraws_cpp

Distribute number of swaps
valTable

Comparison of different microaggregation methods
summary.freqCalc

Summary method for objects from class freqCalc
readMicrodata

readMicrodata
varToFactor

Change the a keyVariable of an object of class sdcMicroObj-class from Numeric to Factor or from Factor to Numeric
microaggregation

Microaggregation
plotMicro

Comparison plots
plot.sdcMicroObj

Plotfunctions for objects of class sdcMicroObj
sdcApp

sdcApp
shuffle

Shuffling and EGADP
subsetMicrodata

subsetMicrodata
recordSwap_cpp

Targeted Record Swapping
show,sdcMicroObj-method

Show
read_codebook_snippet

Read and summarize a codebook file for LLM prompt
sdcMicro-package

sdcMicro: Statistical Disclosure Control Methods for Anonymization of Data and Risk Estimation
sdcMicroObj-class

Class "sdcMicroObj"
writeSafeFile

writeSafeFile
selectHouseholdData

Creates a household level file from a dataset with a household structure.
Tarragona

Tarragona data set
KI_createSdcObj

Create an sdcMicro Object with LLM Assistance
KI_applyAnonymization

KI_applyAnonymization: Automatically apply anonymization strategy using LLM
argus_rankswap

argus_rankswap
argus_microaggregation

argus_microaggregation
CASCrefmicrodata

Census data set
addGhostVars

addGhostVars
LocalRecProg

Local recoding via Edmond's maximum weighted matching algorithm
addNoise

Adding noise to perturb data
EIA

EIA data set
calcRisks

Recompute Risk and Frequencies for a sdcMicroObj
clean_llm_code

Clean LLM-generated R code
casc1

Small Artificial Data set
createDat

Dummy Dataset for Record Swapping
build_llm_prompt

Build LLM Prompt for Variable Role Classification
dUtility

Data-Utility measures
dRiskRMD

RMD based disclosure risk
build_anonymization_prompt

Build anonymization strategy prompt from summary info Build anonymization strategy prompt from summary info
freqCalc

Frequencies calculation for risk estimation