Learn R Programming

sdcMicro

sdcMicro is an R-package to anonymize microdata. Most functionalities of the package are also available via an interactive shiny-based graphical user interface.

The online documentation can also be found at sdctools.github.io/sdcMicro.

Copy Link

Version

Install

install.packages('sdcMicro')

Monthly Downloads

1,029

Version

5.8.1

License

GPL-2

Issues

Pull Requests

Stars

Forks

Maintainer

Matthias Templ

Last Published

March 10th, 2026

Functions in sdcMicro (5.8.1)

distributeDraws_cpp

Distribute number of swaps
argus_rankswap

argus_rankswap
build_agentic_prompt

Build prompt for agentic batch strategy generation
extract_variable_metadata

Extract Metadata from a Dataset
execute_tool_calls

Execute structured tool calls on an sdcMicroObj
groupAndRename

Join levels of a variables in an object of class sdcMicroObj-class or factor or data.frame
francdat

data from the casc project
globalRecode

Global Recoding
get.sdcMicroObj

get.sdcMicroObj
dataGen

Fast generation of synthetic data
get_tool_schemas

Get tool schemas for structured LLM tool calling
dUtility

Data-Utility measures
indivRisk

Individual Risk computation
kAnon_violations

kAnon_violations
freq

Freq
extractManipData

Remove certain variables from the data set inside a sdc object.
free1

Demo data set from mu-Argus
mafast

Fast and Simple Microaggregation
dRisk

overal disclosure risk
localSupp

Local Suppression
dRiskRMD

RMD based disclosure risk
localSuppression

Local Suppression to obtain k-anonymity
mergeHouseholdData

Replaces the raw household-level data with the anonymized household-level data in the full dataset for anonymization of data with a household structure (or other hierarchical structure). Requires a matching household ID in both files.
distributeRandom_cpp

Distribute
generateStrata

Generate one strata variable from multiple factors
microData

microData
freqCalc

Frequencies calculation for risk estimation
measure_risk

Disclosure Risk for Categorical Variables
infoLoss

Calculate information loss after targeted record swapping
pram

Post Randomization
plot.sdcMicroObj

Plotfunctions for objects of class sdcMicroObj
microaggregation

Microaggregation
casc1

Small Artificial Data set
clean_llm_code

Clean LLM-generated R code
print.indivRisk

Print method for objects from class indivRisk
plotMicro

Comparison plots
print.localSuppression

Print method for objects from class localSuppression
print.micro

Print method for objects from class micro
print.modrisk

Print method for objects from class modrisk
orderData_cpp

Reorder data
microaggrGower

Microaggregation for numerical and categorical key variables based on a distance similar to the Gower Distance
importProblem

importProblem
IL_correl

Additional Information-Loss measures
mvTopCoding

Detection and winsorization of multivariate outliers
plot.localSuppression

Plots for localSuppression objects
randSample_cpp

Random Sampling
print.freqCalc

Print method for objects from class freqCalc.
read_codebook_snippet

Read and summarize a codebook file for LLM prompt
query_llm_for_roles

Query LLM to Classify Variable Roles
removeDirectID

Remove certain variables from the data set inside a sdc object.
recordSwap

Targeted Record Swapping
readMicrodata

readMicrodata
rankSwap

Rank Swapping
recordSwap_cpp

Targeted Record Swapping
setRisk_cpp

Calculate Risk
set.sdcMicroObj

set.sdcMicroObj
selectHouseholdData

Creates a household level file from a dataset with a household structure.
setLevels_cpp

Define Swap-Levels
shuffle

Shuffling and EGADP
show,sdcMicroObj-method

Show
summary.freqCalc

Summary method for objects from class freqCalc
summary.micro

Summary method for objects from class micro
valTable

Comparison of different microaggregation methods
topBotCoding

Top and Bottom Coding
modRisk

Global risk using log-linear models.
subsetMicrodata

subsetMicrodata
print.pram

Print method for objects from class pram
print.sdcMicroObj

Print and Extractor Functions for objects of class sdcMicroObj-class
riskyCells

riskyCells
report

Generate an Html-report from an sdcMicroObj
sampleDonor_cpp

Random sample for donor records
nextSdcObj

nextSdcObj
sdcApp

sdcApp
query_llm

Query an LLM (provider-agnostic)
print.suda2

Print method for objects from class suda2
sdcMicro-package

sdcMicro: Statistical Disclosure Control Methods for Anonymization of Data and Risk Estimation
sdcMicroObj-class

Class "sdcMicroObj"
testdata

A real-world data set on household income and expenditures
summary.pram

Summary method for objects from class pram
suda2

Suda2: Detecting Special Uniques
writeSafeFile

writeSafeFile
varToFactor

Change the a keyVariable of an object of class sdcMicroObj-class from Numeric to Factor or from Factor to Numeric
AI_applyAnonymization

AI_applyAnonymization: Automatically apply anonymization strategy using LLM
AI_createSdcObj

Create an sdcMicro Object with LLM Assistance
LocalRecProg

Local recoding via Edmond's maximum weighted matching algorithm
argus_microaggregation

argus_microaggregation
createDat

Dummy Dataset for Record Swapping
createNewID

Creates new randomized IDs
addNoise

Adding noise to perturb data
build_anonymization_prompt

Build anonymization strategy prompt from summary info
CASCrefmicrodata

Census data set
EIA

EIA data set
addGhostVars

addGhostVars
ai_utility_score

Combined utility score for anonymization quality
Tarragona

Tarragona data set
build_llm_prompt

Build LLM Prompt for Variable Role Classification
calcRisks

Recompute Risk and Frequencies for a sdcMicroObj
build_refinement_prompt

Build prompt for refinement iteration