SemiSupervised
Controls various aspects of fitting any ‘SemiSupervised’ object.
SemiSupervised.control(normalize=TRUE,stability=NULL,k=NULL,nok=FALSE,
dissimilar=TRUE,l.eps=1e-5,l.thresh=25L,h.thresh=1e-5,U.as.anchor.thresh=600L,
U.as.anchor=TRUE,sig.est=TRUE,sig.frac=0.5,iter.max=1000L,
anchor.seed=100,sfac=5L,cn=4L,LAE.thresh=100L,LAE.eps=1e-4,
cv.fold=3L,cv.seed=100L,cv.cl=TRUE,cv.type="scv",cv.adjust=0.001)
flags whether or not the normalized or combinatorial graph operator should be used. This is not used by ‘jtharm’. Further, it has no effect on anchor graph based approaches.
stabilization parameter for necessary inverses. A NULL value allows it to be set internally (recommended) but can be set manually.
in fitting a distance graph with the ‘formula’ as y~. this will set the default k-NN graph parameter. It is not used otherwise. In the case of an anchor graph the k parameter is the number of anchors used by k-means.
flags the y~. ‘formula’ call to either fit or not fit a k-NN parameter which overrides the k argument.
the ‘s’ parameter for the LAE method.
the ‘cn’ parameter for the LAE method.
thresh hold for LAE algorithm.
convergence tolerance for LAE algorithm.
maximum number of iterations for kmeans
.
sets the seed for the kmeans
algorithm.
use an internal estimation scheme to estimate the parameter for an rbf kernel.
when ‘sig.est’ is true, the fraction of training data necessary for the computaton is used.
max iteration parameter for the underlying logistic regression algorithm when fitting classification.
threshold for stopping the underlying logistic regression algorithm when fitting classification.
if n>U.as.anchor.thresh then the anchor points are fitted as the unlabeled cases to speed up the approach. This only works in the formula call where the graph is unspecified, i.e., y~.
threshold for determining when the unlabeled are to be anchors.
minimum allowable choice for h in the grid used by CV.
the number of folds used for K-fold CV.
the seed to generate the folds for K-fold CV.
forces in classification at least two distinct responses in each fold and should be set to TRUE.
use "scv" or "cv". The "scv" is faster and should be used.
forces an inverse stabilization in "scv".