eqmcc: Minimization with Enhanced Quine-McCluskey Algorithm

Description

Being the core of the QCA package, this function performs the minimization of Boolean or multivalent output functions. It is called "eqmcc" because it is an enhancement of the classical Quine-McCluskey minimization algorithm.

Usage

eqmcc(data, outcome = c(""), neg.out = FALSE, conditions = c(""), 
      relation = "suf", n.cut = 1, incl.cut1 = 1, incl.cut0 = 1, 
      explain = c("1"), include = c(""), row.dom = FALSE, 
      all.sol = FALSE, omit = c(), dir.exp = c(), details = FALSE, 
      show.cases = FALSE, use.tilde = FALSE, use.letters = FALSE, 
      inf.test = c(""), ...)

is.qca(x)

Arguments

data

A truth table object or a data set of bivalent crisp or fuzzy set variables or multivalent crisp set variables.

outcome

A vector of outcome sets.

neg.out

Logical, use negation of outcome set (ignored if data is a truth table object).

conditions

A vector of condition sets.

relation

The relation between the condition and the outcome sets.

n.cut

The minimum number of cases with set membership score above 0.5 for an output function value of "0", "1" or "C".

incl.cut1

The minimum sufficiency inclusion score for an output function value of "1".

incl.cut0

The maximum sufficiency inclusion score for an output function value of "0".

explain

A vector of output function values to be explained.

include

A vector of additional output function values to be included in the minimization.

row.dom

Logical, apply row dominance to eliminate dominated inessential prime implicants.

all.sol

Logical, derive all possible (non-overlapping) solutions, irrespective of complexity.

omit

A vector of configuration index values or matrix of configurations to be omitted from minimization.

dir.exp

A vector of directional expectations for deriving intermediate solutions.

details

Logical, present solution details.

show.cases

Logical, also print case names if details = TRUE.

use.tilde

Logical, use tilde for negation with bivalent variables.

use.letters

Logical, use simple letters instead of original variable names.

inf.test

A vector specifying the inference-statistical test to be performed (currently only "binom") and the critical significance level.

...

Other arguments (for backward compatibility).

An object of class "qca".

Value

An object of class "qca" for single outcomes, and "mqca" for multiple outcomes. Objects of class "qca" are invisible lists with the following components:
ttThe truth table object.
excludedThe line number(s) of the negative configuration(s).
initialsThe initial positive configuration(s).
PIsThe prime implicant(s).
PIchartA list containing the PI chart(s).
solutionA list of solution(s).
essentialA list of essential PI(s).
pimsA list of PI membership scores.
SAA list of simplifying assumptions.
i.solA list of components specific to intermediate solution(s), including the prime implicant chart, prime implicant membership scores, (non-simplifying) easy counterfactuals and difficult counterfactuals.

Details

The argument data can be a truth table object (an object of class "tt" returned by truthTable()) or a suitable data set. Suitable data sets have the following structure: values of 0 and 1 for bivalent crisp set variables, values between 0 and 1 for bivalent fuzzy set variables, and values beginning with 0 at increments of 1 for multivalent crisp set variables. The placeholder "-" indicates a "don't care" value in auxiliary condition variables that specify temporal order between other substantive condition variables in tQCA. These values lead to the exclusion of the auxiliary condition variable from the computation of parameters of fit. The value/s to the argument outcome can be specified in the notation for multivalent crisp set variables using curly brackets or traditional notation for bivalent crisp set variables. Possible outcomes can be single sets not simultaneously passed to conditions, or any subset of the sets specified in conditions if data is not a truth table object. In this case, truth tables will not be printed. The argument outcome specifies the outcome(s) to be explained. If multiple outcomes are specified, they must also be specified in the conditions. In this case, truth tables and solution details will not be printed by default (see the example on mimicing Coincidence Analysis below). Multivalent outcomes require curly-bracket notation (X{value}). At least one outcome has to be specified. The logical argument neg.out controls whether outcome is to be explained or its negation. If outcome is from a multivalent variable, neg.out = TRUE has the effect that the union of all remaining values becomes the new outcome to be explained. The argument conditions specifies the condition variables. If omitted, all variables in data are used except outcome for single outcomes. With multiple outcomes, all variables in data are used. The argument relation specifies the relation between conditions and outcome to be analyzed. It accepts either suf or sufnec. If relation = "suf" (default), sufficiency relations are identified as indicated by a right arrow (=>). If the solution also proves to be necessary, this will be indicated by a double arrow (<=>). If relation = "sufnec", solutions must be both sufficient and necessary to be identified. The argument incl.cut1 then acts as the cut-off for the sufficiency inclusion of a configuration as well as the necessity inclusion of the final solution. Configurations that contain fewer than n.cut cases with membership scores above 0.5 are coded as logical remainders (OUT = "?"). If the number of such cases is at least n.cut, configurations with an inclusion score of at least incl.cut1 are coded positive (OUT = "1"), configurations with an inclusion score below incl.cut1 but with at least incl.cut0 are coded as a contradiction (OUT = "C"), and configurations with an inclusion score below incl.cut0 are coded negative (OUT = "0"). If incl.cut0 is unspecified, it is set equal to incl.cut1 and no contradictions can occur. The argument explain specifies a vector of suitable values of the output function to be minimized. Vectors of such values are "1", "C", "0", c("1", "C") and c("0", "C"), but not c("1", "0") and c("1", "0", "C"). Note that for "0", "C" and c("0", "C"), configurations will be reduced but no solution details printed. The argument include specifies a vector of suitable values of the output function to be included in addition to the value(s) specified in explain. All combinations allowed separately for explain are also allowed for include in combination with explain. The logical argument row.dom controls the application of the principle of row dominance. An inessential prime implicant $P_{1}$ dominates another $P_{2}$ if all configurations covered by $P_{2}$ are also covered by $P_{1}$, but they are not interchangeable (cf. McCluskey 1956, p. 1425). Given a minimum number of prime implicants to complete a solution, the complexity of each prime implicant is disregarded. Users should be aware that if row dominance is applied, certain ambiguities in the data will be invisible. The argument is therefore primarily intended for demonstration purposes only. For more details on the application of row dominance in other QCA software, see Thiem and Dusa (2013). The argument all.sol solves the prime implicant chart by deriving all non-redundant solutions, irrespective of individual prime implicant or overall solution complexity. If all.sol = TRUE is used in conjunction with row.dom = FALSE, all ambiguities in the data will be revealed. Extreme ambiguities may indicate inappropriate data. The argument omit can be used to omit any configuration (positive, negative or remainder) from the minimization process. It accepts a vector of row numbers from the truth table or a matrix of configurations of the same order of conditions as passed to truthTable() (if eqmcc() is passed a truth table object) or as specified in the argument conditions. The dir.exp argument specifies directional expectations for separating easy from difficult counterfactuals in simplifying assumptions. For bivalent crisp and fuzzy sets, expectations should be given as a vector of the same length and in the same order of condition variables as provided in conditions. A value of "1" indicates that a condition is expected to contribute to a positive output function value, "0" that its negation is expected to contribute to a positive output function value, and "-" indicates no directional expectation. In the case of multivalent condition variables, directional expectations require a value specification for each condition. Multiple values for the same condition variable have to be enclosed by double quotes and separated by a semicolon (see mvQCA example using Hartmann and Kemmerzell (2010) below). In some situations, directional expectations in mvQCA generate easy counterfactuals that do not contribute to parsimony. These non-simplifying easy counterfactuals will not be part of the solution (see mvQCA example using Sager and Andereggen (2012) below). If details = TRUE, paramters of fit (inclusion, raw coverage, and unique coverage) will be printed for each solution and its respective prime implicants. Essential prime implicants are listed first in the solution output and in the top part of the parameters-of-fit table. Inessential prime implicants are listed in brackets in the solution output and in the middle part of the parameters-of-fit table, together with their unique coverage scores under each individual solution. Inclusion and coverage scores for each solution are provided in the bottom part of the parameters-of-fit table. The logical argument show.cases controls whether case names are displayed next to their corresponding truth table configurations and/or prime implicants (do not use with many cases and/or long case names). In the parameters-of-fit table, semicolons indicate that cases come from different truth table configurations, whereas commas indicate cases from the same configuration. If the conditions have already been named with single letters, the argument use.letters will have no effect. The argument inf.test provides functionality for basing output function value codings on inference-statistical tests of the observed configurations for (bivalent and multivalent) crisp set variables. It requires a vector of length two, comprising the test (currently only exact binomial: "binom") and a significance level. If the empirical inclusion score of a configuration is not significantly lower than incl.cut1, it will be coded positive (OUT = "1"). If it is significantly lower than incl.cut1 but significantly higher than incl.cut0, it will be coded as a contradiction (OUT = "C"). If it is not significantly higher than incl.cut0, it will be coded negative (OUT = "0").

References

Baumgartner, Michael. 2009. Inferring Causal Complexity. Sociological Methods & Research 38 (1):71-101. Emmenegger, Patrick. 2011. Job Security Regulations in Western Democracies: A Fuzzy Set Analysis. European Journal of Political Research 50 (3):336-64. Hartmann, Christof, and Joerg Kemmerzell. 2010. Understanding Variations in Party Bans in Africa. Democratization 17 (4):642-65. McCluskey, Edward J. 1956. Minimization of Boolean Functions. Bell Systems Technical Journal 35 (6):1417-44. Krook, Mona Lena. 2010. Women's Representation in Parliament: A Qualitative Comparative Analysis. Political Studies 58 (5):886-908. Ragin, Charles C., and Sarah Ilene Strand. 2008. Using Qualitative Comparative Analysis to Study Causal Order: Comment on Caren and Panofsky (2005). Sociological Methods & Research 36 (4):431-41. Thiem, Alrik, and Adrian Dusa. 2013. Boolean Minimization in Social Science Research: A Review of Current Software for Qualitative Comparative Analysis (QCA). Social Science Computer Review 31 (4):505-21.

Examples

Run this code

# csQCA using Krook (2010)
#-------------------------
data(d.Kro)
head(d.Kro)

# conservative solution
eqmcc(d.Kro, outcome = "WNP")

# negated outcome, conservative solution
eqmcc(d.Kro, outcome = "WNP", neg.out = TRUE)

# parsimonious solution with details and case names
Kro.sp <- eqmcc(d.Kro, outcome = "WNP", include = "?", 
  details = TRUE, show.cases = TRUE)
Kro.sp

# check PI chart
Kro.sp$PIchart

# simplifying assumptions (SAs)
Kro.sp$SA

# minimized expressions for SAs using fake outcome
for(i in 1:2){
  print(eqmcc(cbind(Kro.sp$SA[[i]], O = 1), outcome = "O"))
}  
  
# conservative solution with truth table object
Kro.tt <- truthTable(d.Kro, outcome = "WNP")
Kro.sc <- eqmcc(Kro.tt)
Kro.sc

# fsQCA using Emmenegger (2011)
#------------------------------
data(d.Emm)
head(d.Emm)

# parsimonious solution with details
eqmcc(d.Emm, outcome = "JSR", incl.cut1 = 0.9, include = "?", 
  details = TRUE)

# intermediate solution
Emm.si <- eqmcc(d.Emm, outcome = "JSR", incl.cut1 = 0.9, 
  include = "?", dir.exp = c(1,1,1,1,1,0), details = TRUE)
Emm.si

# are the prime implicants also sufficient for the negation of the outcome?
pof(Emm.si$i.sol$C1P1$pims, outcome = "JSR", d.Emm, neg.out = TRUE,
  relation = "suf")

# check PI chart for intermediate solution;
# C1P1: first conservative and first parsimonious solution
Emm.si$i.sol$C1P1$PIchart

# same intermediate solution, but not same SAs
identical(rownames(Emm.si$SA$S1), rownames(Emm.si$SA$S2))

# check easy counterfactuals; same
(EC1 <- Emm.si$i.sol$C1P1$EC)
(EC2 <- Emm.si$i.sol$C1P2$EC)
identical(rownames(EC1), rownames(EC2))

# minimized expressions for ECs using fake outcome
eqmcc(cbind(Emm.si$i.sol$C1P1$EC, O = 1), outcome = "O")

# plot all four prime implicants of the intermediate solution
PIsc <- Emm.si$i.sol$C1P1$pims
par(mfrow = c(2, 2))
for(i in 1:4){
 plot(PIsc[, i], d.Emm$JSR, pch = 19, ylab = "JSR",
  xlab = names(PIsc)[i], xlim = c(0, 1), ylim = c(0, 1),
  main = paste("Prime Implicant", print(i)))
 mtext(paste(
  "Inclusion = ", round(Emm.si$i.sol$C1P1$IC$incl.cov$incl[i], 3),
  "; Coverage = ", round(Emm.si$i.sol$C1P1$IC$incl.cov$cov.r[i], 3)), 
  cex = 0.7, line = 0.4)
 abline(h = 0.5, lty = 2, col = gray(0.5))
 abline(v = 0.5, lty = 2, col = gray(0.5))
 abline(0, 1)
}

# mvQCA using Hartmann and Kemmerzell (2010)
#-------------------------------------------
data(d.HK)
head(d.HK)

# create vector of condition variables
conds <- c("C", "F", "T", "V")

# parsimonious solution, with contradictions included
HK.sp <- eqmcc(d.HK, outcome = "PB", conditions = conds, 
  incl.cut0 = 0.4, include = c("?", "C"), details = TRUE)
HK.sp

# Venn diagram of solution S1;
# first extract PI membership scores
PIms <- HK.sp$pims

require(VennDiagram)
vennHK.suf <- venn.diagram(
 x = list(
  "PB{1}" = which(d.HK$PB == 1),
  "C{0,1}" = which(PIms[, 1] == 1 | PIms[, 2] == 1),
  "T{2}" = which(PIms[, 4] == 1),
  "T{1}*V{0}" = which(PIms[, 5] == 1)),
 filename = NULL,
 cex = 2.5, cat.cex = 2, cat.pos = c(180, 180, 0, 0),
 cat.dist = c(0.4, 0.4, 0.12, 0.12),
 fill = gray(c(0.3, 0.5, 0.7, 0.9))
)
grid.draw(vennHK.suf)

# which are the two countries in T{2} but not PB{1}?
rownames(d.HK[d.HK$T == 2 & d.HK$PB != 1, ])

# minimize contradictions (only one contradiction)
eqmcc(d.HK, outcome = "PB", conditions = conds, incl.cut0 = 0.4, 
  explain = "C")

# intermediate solution with directional expectations:
# C{1}, F{1,2}, T{2}, V contribute to OUT = 1
HK.si <- eqmcc(d.HK, outcome = "PB", conditions = conds, 
  include = "?", dir.exp = c(1, "1;2", 2, 1), details = TRUE)
HK.si

# mvQCA using Sager and Andereggen (2012)
#----------------------------------------
data(d.SA)
head(d.SA)

# directional expectation of FED{0} leads to non-simplifying
# easy counterfactual
SA.si <- eqmcc(d.SA, outcome = "ACC", conditions = names(d.SA)[1:5],
  include = "?", dir.exp = c(0,1,0,1,1), details = TRUE)
SA.si

SA.si$i.sol$C1P1$NSEC

# tQCA using Ragin and Strand (2008)
#-----------------------------------
data(d.RS)
head(d.RS)

# conservative solution with details and case names;
# auxiliary temporal order condition "EBA" automatically excluded 
# from parameters of fit
eqmcc(d.RS, outcome = "REC", details = TRUE, show.cases = TRUE)

# QCA path model ("causal chain" in CNA); data from Baumgartner (2009);
# note that CNA chains and QCA path models are not always equal
#--------------------------------------------------------------
d.Bau <- data.frame(U = c(1,1,1,1,0,0,0,0), D = c(1,1,0,0,1,1,0,0), 
  L = c(rep(1,6),0,0), G = rep(c(1,0), 4), E = c(rep(1,7),0), 
  row.names = letters[1:8])
head(d.Bau)

# with multiple outcomes, no solution details are printed 
Bau.cna <- eqmcc(d.Bau, outcome = names(d.Bau), relation = "sufnec", 
  include = "?", all.sol = TRUE)
Bau.cna

# get the truth table, solution details and case names for outcome "E"
print(Bau.cna$E, details = TRUE, show.cases = TRUE)

# QCA with multiple multivalent outcome variables
#------------------------------------------------
d.mmv <- data.frame(A = c(2,0,0,1,1,1,2,2), B = c(2,2,2,2,1,1,0,0), 
                    C = c(0,1,0,0,0,2,1,0), D = c(2,1,2,2,3,1,3,0), 
                    E = c(3,2,3,3,0,1,3,2), 
  row.names = letters[1:8])
head(d.mmv)

mmv.s <- eqmcc(d.mmv, outcome = c("D{2}", "E{3}"))
mmv.s

# use quotes with curly-brackets notation
print(mmv.s$"E{3}", details = TRUE, show.cases = TRUE)

# negation of outcome from multivalent variable is union of all other values;
# many ambiguities (10 solutions)
mmv.s <- eqmcc(d.mmv, outcome = "E{3}", neg.out = TRUE, include = "?")
mmv.s

Run the code above in your browser using DataLab