plmm_fit: PLMM fit: A function that fits a PLMM using the values returned by `plmm_prep()`

Description

PLMM fit: A function that fits a PLMM using the values returned by plmm_prep()

Usage

plmm_fit(
  prep,
  y,
  std_X_details,
  fbm_flag,
  penalty,
  gamma = 3,
  alpha = 1,
  lambda_min,
  nlambda = 100,
  lambda,
  eps = 1e-04,
  max_iter = 10000,
  init = NULL,
  dfmax = NULL,
  warn = TRUE,
  ...
)

Value

A list which includes 21 items:

y: The outcome vector used in model fitting.
std_scale_beta: The matrix of estimated coefficients on the standardized scale. Rows are predictors (with the first row being the intercept), and columns are values of lambda.
std_Xbeta: A matrix of the linear predictors on the scale of the standardized design matrix. Rows are predictors, columns are values of lambda. Note: std_Xbeta will not include rows for the intercept or for constant features.
centered_y: The centered outcome vector.
s: a vector of the non-zero eigenvalues of the relatedness matrix K (note: K is the kinship matrix for genetic/genomic data; see the article on notation for details)
U: a matrix of the eigenvectors of K associated with s
lambda: A numeric vector of the tuning parameter values used in model fitting.
penalty: A character string indicating the penalty with which the model was fit (e.g., 'MCP')
penalty_factor: A vector of indicators corresponding to each predictor, where 1 = predictor was penalized.
iter: An integer vector with the number of iterations needed in model fitting for each value of lambda
converged: A vector of logical values indicating whether the model fitting converged at each value of lambda
loss: A vector with the numeric values of the loss at each value of lambda (calculated on the ~rotated~ scale)
eta: A double between 0 and 1 representing the estimated proportion of the variance in the outcome attributable to population/correlation structure.
gamma: A numeric value indicating the tuning parameter used for the SCAD or MCP penalties. Not relevant for lasso models.
alpha: A numeric value indicating the elastic net tuning parameter.
nlambda Length of the sequence of lambda.
eps: Convergence threshold. The algorithm iterates until the RMSE for the change in linear predictors for each coefficient is less than eps
max_iter: Maximum number of iterations (total across entire path)
warn: Return warning messages for failures to converge and model saturation?
trace: If set to TRUE, inform the user of progress by announcing the beginning of each step of the modeling process
std_X: If design matrix is filebacked, the descriptor for the filebacked data is returned using bigmemory::describe().

Arguments

prep: A list as returned from plmm_prep
y: The original (not centered) outcome vector. Need this for intercept estimate
std_X_details: A list with components center (values used to center X), scale (values used to scale X), and ns (indices for nonsingular columns of X)
fbm_flag: Logical: is std_X a filebacked big.matrix object? Passed from plmm().
penalty: The penalty to be applied to the model. Either "MCP" (the default), "SCAD", or "lasso".
gamma: The tuning parameter of the MCP/SCAD penalty (see details). Default is 3 for MCP and 3.7 for SCAD.
alpha: Tuning parameter for the Mnet estimator which controls the relative contributions from the MCP/SCAD penalty and the ridge, or L2 penalty. alpha = 1 is equivalent to MCP/SCAD penalty, while alpha = 0 would be equivalent to ridge regression. However, alpha = 0 is not supported; alpha may be arbitrarily small, but not exactly 0.
lambda_min: The smallest value for lambda, as a fraction of the maximum lambda. Default is .001 if the number of observations is larger than the number of covariates and .05 otherwise.
nlambda: Length of the sequence of lambda. Default is 100.
lambda: A user-specified sequence of lambda values. By default, a sequence of values of length nlambda is computed, equally spaced on the log scale.
eps: Convergence threshold. The algorithm iterates until the RMSE for the change in linear predictors for each coefficient is less than eps. Default is 1e-4.
max_iter: Maximum number of iterations (total across entire path). Default is 10000.
init: Initial values for coefficients. Default is 0 for all columns of X.
dfmax: Maximum number of non-zero coefficients that may enter the model. Default is NULL (no maximum).
warn: Return warning messages for failures to converge and model saturation? Default is TRUE.
...: Additional arguments that can be passed to biglasso::biglasso_simple_path()