data.frame
and produces misclassifed data.
Probabilities for the missclassification are given in the mc.matrix
.misclass(data.org, mc.matrix, k)
data.frame
containing the factor variabels. Must be factors
.list
of matrices giving the probabilities for the misclassification.
Names of the list
must correspond to the variable names in data.org
.
The colnames
must be named according to the factor levelsdata.frame
containing the misclassified variablesmcsimex
, check.mc.matrix
x1 <- factor(rbinom(100,1,0.5))
x2 <- factor(rbinom(100,2,0.5))
p1 <- matrix(c(1,0,0,1), nrow = 2)
p2 <- matrix(c(0.8,0.1,0.1,0.1,0.8,0.1,0.1,0.1,0.8), nrow = 3)
colnames(p1) <- levels(x1)
colnames(p2) <- levels(x2)
x <- data.frame(x1 = x1, x2 = x2)
mc.matrix <- list(x1 = p1, x2 = p2)
x.mc <- misclass(data.org = x, mc.matrix = mc.matrix,k = 1)
identical(x[,1],x.mc[,1]) # T
identical(x[,2],x.mc[,2]) # F
Run the code above in your browser using DataLab