Optimizes model parameters under the HMM SELON model
SelonHMMOptimize(nuc.data.path, n.partitions = NULL, phy,
edge.length = "optimize", edge.linked = TRUE, nuc.model = "GTR",
global.nucleotide.model = TRUE, diploid = TRUE, verbose = FALSE,
n.cores = 1, max.tol = .Machine$double.eps^0.5, max.evals = 1e+06,
cycle.stage = 12, max.restarts = 10, output.by.restart = TRUE,
output.restart.filename = "restartResult", fasta.rows.to.keep = NULL)
Provides the path to the directory containing the gene specific fasta files that contains the nucleotide data.
The number of partitions to analyze. The order is based on the Unix order of the fasta files in the directory.
The phylogenetic tree to optimize the model parameters.
Indicates whether or not edge lengths should be optimized. By default it is set to "optimize", other option is "fixed", which user-supplied branch lengths.
A logical indicating whether or not edge lengths should be optimized separately for each gene. By default, a single set of each lengths is optimized for all genes.
Indicates what type nucleotide model to use. There are three options: "JC", "GTR", or "UNREST".
assumes nucleotide model is shared among all partitions
A logical indicating whether or not the organism is diploid or not.
Logical indicating whether each iteration be printed to the screen.
The number of cores to run the analyses over.
Supplies the relative optimization tolerance.
Supplies the max number of iterations tried during optimization.
Specifies the number of cycles per restart. Default is 12.
Supplies the number of random restarts.
Logical indicating whether or not each restart is saved to a file. Default is TRUE.
Designates the file name for each random restart.
Indicates which rows to remove in the input fasta files.
SELON stands for SELection On Nucleotides. This function takes a user supplied topology and a set of fasta formatted sequences and optimizes the parameters in the SELON model. Selection is based on selection towards an optimal nucleotide at each site, which is based simply on the majority rule of the observed data. The strength of selection is then varied along sites based on a Taylor series, which scales the substitution rates. Still a work in development, but so far, seems very promising.