Learn R Programming

⚠️There's a newer version (0.1-34) of this package.Take me there.

phyclust

  • License:
  • Download:
  • Author: Wei-Chen Chen and Karin Dorman

Phylogenetic clustering (phyloclustering) is an evolutionary Continuous Time Markov Chain model-based approach to identify population structure from molecular data without assuming linkage equilibrium. The package phyclust (Chen 2011) provides a convenient implementation of phyloclustering for DNA and SNP data, capable of clustering individuals into subpopulations and identifying molecular sequences representative of those subpopulations. It is designed in C for performance, interfaced with R for visualization, and incorporates other popular open source programs including ms (Hudson 2002), seq-gen (Rambaut and Grassly 1997), Hap-Clustering (Tzeng 2005) and PAML baseml (Yang 1997, 2007), for simulating data, additional analyses, and searching the best tree. See the phyclust website for more information, documentations and examples.

Installation

phyclust requires

  • R version 3.0.0 or higher.
  • R package ape.

The package can be installed from the CRAN via the usual install.packages("phyclust"), or via the devtools package:

library(devtools)
install_github("snoweye/phyclust")

Copyright

See phyclust/inst/Documents/ for files in src/msdir/, src/seq-gen/, src/paml_baseml, and R/ttzeng-*.r.

Copy Link

Version

Install

install.packages('phyclust')

Monthly Downloads

1,620

Version

0.1-22

License

GPL (>= 2)

Maintainer

Wei-Chen Chen

Last Published

December 2nd, 2017

Functions in phyclust (0.1-22)

.code.type

Code Types of Dataset and Substitution Models
.boundary.method

Boundary Methods for Population Proportions
.identifier

Identifiers for Evolution Models
.init.method

Initialization Methods for EM Algorithms
.edist.model

Evolution Distance Model
phyclust-package

Phyloclustering -- Phylogenetic Clustering
.Color

Colors for Identifying Clusters in Plots
.EMC

EM Control
.em.method

EM Methods and Algorithms
.se.model

Sequencing Error Model
.EMControl

EM Control Generator
.show.option

Show Available Options
.substitution.model

Substitution Models for Mutation Processes
data.phylip.pony

Great Pony 524 EIAV rev Dataset in the phylip Format
bootstrap.seq

Bootstrap Sequences from a Fitted Model and Star Tree.
data.fasta.pony

Great Pony 625 EIAV rev Dataset in the Fasta Format
bootstrap.star.trees

Bootstrap a Star Tree from a Fitted Model.
file.write

Write Data to Files by Formats
code2nid

Transfer Codes (A, G, C, T, -) and nids (0, 1, 2, 3, 4)
bootstrap.seq.data

Bootstrap a seq.data from a Fitted Model.
code2snp

Transfer Nucleotide Codes / nids and SNPs / sids
find.best

Find the Best Solution of phyclust
.init.procedure

Initialization Procedures for EM Algorithms
gen.unit.K

Generate Comprehensive Trees.
.label.method

Label Method
get.rooted.tree.height

Get a Rooted Tree Height
standard.code

Standard Codes and ids for Nucleotides, SNPs, Codon, Amino Acid and Genetic Code
find.consensus

Find the Consensus Sequence
RRand

Rand Index and Adjusted Rand Index
gen.equal.star.anc.dec

Generate Comprehensive Trees.
as.star.tree

Coerce a Rooted Tree to a Star Tree in Class phylo
ms

Generating Samples under a Wright-Fisher Neutral Model of Genetic Variation
data.phylip.crohn

Crohn's Disease SNP Dataset in the phylip Format
nid.aid.cid

Transfer nids (0, 1, ..., 4) , aids (0, 1, ..., 21) and cids (0, 1, ..., 64)
file.read

Read Data from Files by Formats and Return a seq.data Object
paml.baseml

Phylogenetic Analysis by Maximum Likelihood for Nucleotide Sequences
phyclust.Pt

Transition Probabilities of phyclust Given Time
gen.seq

Generate Sequences Given a Rooted Tree.
phyclust

The Main Function of phyclust
gen.star.tree

Generate a Rooted Tree with a Star Shape
phyclust.e.step

One E-Step of phyclust
bootstrap.star.trees.seq

Bootstrap Sequences from a Fitted Model.
plotnj

Plot an Unrooted Trees.
getcut.fun

Tzeng's Method: Finding the Best Number of Clusters
plotstruct

Struct Plots of Observations Based on Posterior Probabilities
haplo.post.prob

Tzeng's Method: Haplotype Grouping for SNP Sequences
read.seqgen

Read seqgen's Results and Return a seq.data
rescale.rooted.tree

Rescale a Rooted Tree's Height
phyclust.logL

Log-Likelihood of phyclust
all.internal.tzeng

All Internal Functions of Tzeng's Methods
phyclust.m.step

One M-Step of phyclust
print.object

Functions for Printing or Summarizing Objects According to Classes
phyclust.edist

Evolution Distance of Sequences
prune.Mu

Prune the Center Sequences Mu
phyclust.em.step

One EM-step of phyclust
seq.data

A Toy Dataset in Class seq.data
phyclust.update

Update phyclust Results
seqgen

Seq-Gen
plotdots

Dots Plots of Sequences for Visual Comparisons
snp2sid

Transfer SNP codes (1, 2, -) and sids (0, 1, 2)
all.internal

All Internal Functions of phyclust
phyclust.se

The Main Function of phyclust for Sequencing Error Models
plothist

Plot Histogram to Compare Number of Mutations.
phyclust.se.update

Update phyclust Results by the Sequencing Error Model
plotgaps

Gaps Plots of Sequences for Visual Comparisons