Consensus relations “synthesize” the information in the
elements of a relation ensemble into a single relation, often by
minimizing a criterion function measuring how dissimilar consensus
candidates are from the (elements of) the ensemble (the so-called
“optimization approach”), typically of the form
\(L(R) = \sum w_b d(R_b, R) ^ p\), where \(d\) is a suitable
dissimilarity measure (see relation_dissimilarity),
\(w_b\) is the case weight given to element \(R_b\) of the
ensemble, and \(p \ge 1\). Such consensus relations are called
“central relations” in R<U+00E9>gnier (1965). For \(p = 1\), we
obtain (generalized) medians; \(p = 2\) gives (generalized) means
(least squares consensus relations).
Available built-in methods are as follows. Apart from Condorcet's and
the unrestricted Manhattan and Euclidean consensus methods, these are
applicable to ensembles of endorelations only.
"Borda"the consensus method proposed by Borda (1781).
For each relation \(R_b\) and object \(x\), one determines the
Borda/Kendall scores, i.e., the number of objects \(y\) such
that \(y R_b x\). These are then aggregated across relations
by weighted averaging. Finally, objects are ordered according to
their aggregated scores. Note that this may result in a weak order
(i.e., with objects being tied).
One can enforce a linear order by setting the control parameter
L to TRUE, and obtain a relation ensemble with up to
n or all such solutions by additionally setting the control
parameter n to some positive integer or "all",
respectively.
"Copeland"the consensus method proposed by Copeland
(1951). For each relation \(R_b\) and object \(x\), one
determines the Copeland scores, i.e., the number of objects
\(y\) such that \(y R_b x\), minus the number of objects
\(y\) such that \(x R_b y\). Like the Borda method, these are
then aggregated across relations by weighted averaging. Finally,
objects are ordered according to their aggregated scores.
Note that this may result in a weak order
(i.e., with objects being tied).
One can enforce a linear order by setting the control parameter
L to TRUE, and obtain a relation ensemble with up to
n or all such solutions by additionally setting the control
parameter n to some positive integer or "all",
respectively.
"Condorcet"the consensus method proposed by Condorcet
(1785). For a given ensemble of crisp relations, this minimizes
the criterion function \(L\) with \(d\) as symmetric
difference distance and \(p = 1\) over all possible crisp
relations. In the case of endorelations, consensus is obtained by
weighting voting, such that \(x R y\) if the weighted number of
times that \(x R_b y\) is no less than the weighted number of
times that this is not the case. Even when aggregating linear
orders, this can lead to intransitive consensus solutions
(“effet Condorcet”).
One can obtain a relation ensemble with up to n or all such
solutions consensus relations by setting the control parameter
n to some positive integer or "all", respectively.
"CS"the consensus method of Cook and Seiford (1978)
which determines a linear order minimizing the criterion function
\(L\) with \(d\) as generalized Cook-Seiford (ranking)
distance and \(p = 1\) via solving a linear sum assignment
problem.
One can obtain a relation ensemble with up to n or all such
consensus relations by setting the control parameter n to
some positive integer or "all", respectively.
"SD/F"an exact solver for determining the
consensus relation of an ensemble of crisp endorelations by
minimizing the criterion function \(L\) with \(d\) as
symmetric difference distance (“SD”) and \(p = 1\) over a
suitable class (“Family”) of crisp endorelations as
indicated by F, with values:
Ggeneral (crisp) endorelations.
Aantisymmetric relations.
Ccomplete relations.
Eequivalence relations: reflexive, symmetric, and
transitive.
Llinear orders: complete, reflexive,
antisymmetric, and transitive.
Mmatches: complete and reflexive.
Opartial orders: reflexive, antisymmetric and
transitive.
Ssymmetric relations.
Ttournaments: complete, irreflexive and
antisymmetric (i.e., complete and asymmetric).
Wweak orders (complete preorders, preferences,
“orderings”): complete, reflexive and transitive.
preorderpreorders: reflexive and transitive.
transitivetransitive relations.
Consensus relations are determined by reformulating the consensus
problem as a binary program (for the relation incidences), see
Hornik and Meyer (2007) for details. The solver employed can be
specified via the control argument solver, with currently
possible values "glpk", "lpsolve", "symphony"
or "cplex" or a unique abbreviation thereof, specifying to
use the solvers from packages Rglpk (default),
lpSolve, Rsymphony, or Rcplex, respectively.
Unless control option sparse is false, a sparse formulation
of the binary program is used, which is typically more efficient.For fitting equivalences and weak orders (cases E and
W) it is possible to specify the number of classes \(k\)
using the control parameter k. For fitting weak orders,
one can also specify the number of elements in the classes via
control parameter l.
Additional constraints on the incidences of the consensus solution
can be given via the control parameter constraints, in the
form of a 3-column matrix whose rows give row and column indices
\(i\) and \(j\) and the corresponding incidence \(I_{ij}\).
(I.e., incidences can be constrained to be zero or one on an
object by object basis.)
One can obtain a relation ensemble with up to n or all such
consensus relations by setting the control parameter n to
some positive integer or "all", respectively.
(See the examples.)
"manhattan"the (unrestricted) median of the
ensemble, minimizing \(L\) with \(d\) as Manhattan (symmetric
difference) distance and \(p = 1\) over all (possibly fuzzy)
relations.
"euclidean"the (unrestricted) mean of the ensemble,
minimizing \(L\) with \(d\) as Euclidean distance and
\(p = 2\) over all (possibly fuzzy) relations.
"euclidean/F"an exact solver for determining
the restricted least squares Euclidean consensus relation of an
ensemble of endorelations by minimizing the criterion function
\(L\) with \(d\) as Euclidean difference distance and
\(p = 2\) over a suitable family of crisp endorelations as
indicated by F, with available families and control
parameters as for methods "SD/F".
"majority"a generalized majority method for which the
consensus relation contains of all tuples occurring with a
relative frequency of more than \(100 p\) percent (of 100
percent if \(p = 1\)). The fraction \(p\) can be specified
via the control parameter p. By default, \(p = 1/2\) is
used.
"CKS/F"an exact solver for determining the
consensus relation of an ensemble of crisp endorelations by
minimizing the criterion function \(L\) with \(d\) as
Cook-Kress-Seiford distance (“CKS”) and \(p = 1\) over a
suitable class (“Family”) of crisp endorelations as
indicated by F, with available families and control
parameters as for methods "SD/F".
One can obtain a relation ensemble with up to n or all such
consensus relations by setting the control parameter n to
some positive integer or "all", respectively.