The generalized Mantel-Haenszel statistic (Somes, 1986) can be used to detect uniform differential item functioning among multiple groups,
without requiring an item response model approach (Penfield, 2001).
The Data is a matrix whose rows correspond to the subjects and columns to the items. In addition, Data can hold the vector of group membership.
If so, group indicates the column of Data which corresponds to the group membership, either by specifying its name or by giving the column number.
Otherwise, group must be a vector of same length as nrow(Data).
Missing values are allowed for item responses (not for group membership) but must be coded as NA values. They are discarded from sum-score computation.
The vector of group membership must hold at least three value, either as numeric or character. The focal groups are defined by the values of the argument
focal.names. If there is a unique focal group, then difGMH returns the output of difMH (without continuity correction).
The threshold (or cut-score) for classifying items as DIF is computed as the quantile of the chi-squared distribution with lower-tail
probability of one minus alpha and with as many degrees of freedom as the number of focal groups.
The matching criterion can be either the test score or any other continuous or discrete variable to be passed in the genMantelHaenszel function. This is specified by the match argument. By default, it takes the value "score" and the test score (i.e. raw score) is computed. The second option is to assign to match a vector of continuous or discrete numeric values, which acts as the matching criterion. Note that for consistency this vector should not belong to the Data matrix.
Item purification can be performed by setting purify to TRUE. Purification works as follows: if at least one item detected as functioning
differently at the first step of the process, then the data set of the next step consists in all items that are currently anchor (DIF free) items, plus the
tested item (if necessary). The process stops when either two successive applications of the method yield the same classifications of the items (Clauser and Mazor,
1998), or when nrIter iterations are run without obtaining two successive identical classifications. In the latter case a warning message is printed.
Adjustment for multiple comparisons is possible with the argument p.adjust.method. The latter must be an acronym of one of the available adjustment methods of the p.adjust function. According to Kim and Oshima (2013), Holm and Benjamini-Hochberg adjustments (set respectively by "Holm" and "BH") perform best for DIF purposes. See p.adjust function for further details. Note that item purification is performed on original statistics and p-values; in case of adjustment for multiple comparisons this is performed after item purification.
A pre-specified set of anchor items can be provided through the anchor argument. It must be a vector of either item names (which must match exactly the column names of Data argument) or integer values (specifying the column numbers for item identification). In case anchor items are provided, they are used to compute the test score (matching criterion), including also the tested item. None of the anchor items are tested for DIF: the output separates anchor items and tested items and DIF results are returned only for the latter. Note also that item purification is not activated when anchor items are provided (even if purify is set to TRUE). By default it is NULL so that no anchor item is specified.
The output of the difGMH, as displayed by the print.GMH function, can be stored in a text file provided that save.output is set to TRUE
(the default value FALSE does not execute the storage). In this case, the name of the text file must be given as a character string into the first component
of the output argument (default name is "out"), and the path for saving the text file can be given through the second component of output. The
default value is "default", meaning that the file will be saved in the current working directory. Any other path can be specified as a character string: see
the Examples section for an illustration.
The plot.GMH function displays the DIF statistics in a plot, with each item on the X axis. The type of point and the colour are fixed by the usual pch
and col arguments. Option number permits to display the item numbers instead. Also, the plot can be stored in a figure file, either in PDF or JPEG
format. Fixing save.plot to TRUE allows this process. The figure is defined through the components of save.options. The first two components
perform similarly as those of the output argument. The third component is the figure format, with allowed values "pdf" (default) for PDF file and
"jpeg" for JPEG file.