Learn R Programming

⚠️There's a newer version (5.7.8) of this package.Take me there.

sdcMicro

sdcMicro is an R-package to anonymize microdata. Most functionalities of the package are also available via an interactive shiny-based graphical user interface.

Online documentation

Copy Link

Version

Install

install.packages('sdcMicro')

Monthly Downloads

685

Version

5.5.1

License

GPL-2

Issues

Pull Requests

Stars

Forks

Maintainer

Matthias Templ

Last Published

February 11th, 2020

Functions in sdcMicro (5.5.1)

freq

Freq
kAnon_violations

kAnon_violations
LocalRecProg

Local recoding via Edmond's maximum weighted matching algorithm
free1

Demo data set from mu-Argus
CASCrefmicrodata

Census data set
dataGen

Fast generation of synthetic data
dRisk

overal disclosure risk
mergeHouseholdData

Replaces the raw household-level data with the anonymized household-level data in the full dataset for anonymization of data with a household structure (or other hierarchical structure). Requires a matching household ID in both files.
nextSdcObj

nextSdcObj
importProblem

importProblem
casc1

Small Artificial Data set
addGhostVars

addGhostVars
dUtility

data utility
createNewID

Creates new randomized IDs
microData

microData
groupAndRename

indivRisk

Individual Risk computation
mvTopCoding

Detection and winsorization of multivariate outliers
modRisk

Global risk using log-linear models.
extractManipData

Remove certain variables from the data set inside a sdc object.
dRiskRMD

RMD based disclosure risk
localSupp

Local Suppression
localSuppression

Local Suppression to obtain k-anonymity
EIA

EIA data set
plot.sdcMicroObj

selectHouseholdData

Creates a household level file from a dataset with a household structure.
freqCalc

Frequencies calculation for risk estimation
get.sdcMicroObj

get.sdcMicroObj
print.sdcMicroObj

print.pram

Print method for objects from class pram
plot.localSuppression

plot method for localSuppression objects
readMicrodata

readMicrodata
francdat

data from the casc project
print.freqCalc

Print method for objects from class freqCalc.
pram

Post Randomization
testdata

A real-world data set on household income and expenditures
measure_risk

Disclosure Risk for Categorical Variables
set.sdcMicroObj

set.sdcMicroObj
print.micro

Print method for objects from class micro
globalRecode

Global Recoding
sdcMicroObj-class

Class "sdcMicroObj"
rankSwap

Rank Swapping
show,sdcMicroObj-method

Show
removeDirectID

Remove certain variables from the data set inside a sdc object.
generateStrata

Generate one strata variable from multiple factors
topBotCoding

Top and Bottom Coding
print.suda2

Print method for objects from class suda2
mafast

Fast and Simple Microaggregation
microaggregation

Microaggregation
plotMicro

Comparison plots
suda2

Suda2: Detecting Special Uniques
microaggrGower

Microaggregation for numerical and categorical key variables based on a distance similar to the Gower Distance
print.modrisk

Print method for objects from class modrisk
report

Generate an Html-report from an sdcMicroObj
riskyCells

riskyCells
sdcApp

sdcApp
print.indivRisk

Print method for objects from class indivRisk
subsetMicrodata

subsetMicrodata
summary.freqCalc

Summary method for objects from class freqCalc
sdcMicro-package

Statistical Disclosure Control (SDC) for the generation of protected microdata for researchers and for public use.
shuffle

Shuffling and EGADP
writeSafeFile

writeSafeFile
summary.pram

Summary method for objects from class pram
valTable

Comparison of different microaggregation methods
summary.micro

Summary method for objects from class micro
print.localSuppression

Print method for objects from class localSuppression
varToFactor

Change the a keyVariable of an object of class sdcMicroObj-class from Numeric to Factor or from Factor to Numeric
Tarragona

Tarragona data set
argus_rankswap

argus_rankswap
addNoise

Adding noise to perturb data
calcRisks

Recompute Risk and Frequencies for a sdcMicroObj
argus_microaggregation

argus_microaggregation
LLmodGlobalRisk

Global risk using log-linear models.