Learn R Programming

expands (version 1.7.1)

buildMultiSamplePhylo: Relations between inter- and intra-sample subpopulations

Description

Predicts phylogenetic relations between subpopulations from subpopulation specific copy number and point mutation profiles, while including information about sample origin of each subpopulation. This function differs from buildPhylo in that it integrates the subpoulations predicted in multiple, geographically distinct tumor-samples into one common phylogeny and in that it includes point mutations in addition to copy number variations to infer inter-sample phylogenetic relations.

Usage

buildMultiSamplePhylo(samGr,out,treeAlgorithm="bionjs",ambigSg=F,plotF=1,spRes=1)

Arguments

samGr
List with three fields: cbs - Matrix in which each row corresponds to a copy number segment. CBS is typically the output of a circular binary segmentation algorithm. Columns in CBS must be labeled and must include chr, startpos, endpos and CN_Estim
out
Prefix of file to which multi-sample phylogeny will be saved.
treeAlgorithm
Neighbor joining algorithm used for phylogeny reconstruction (from library ape). Options: bionjs (default), njs.
ambigSg
Input parameter "keepAmbigSeg" for called function: assignQuantityToSP.
plotF
Option for displaying a visual representation of the phylogenetic tree (0 - no display; 1 - display). Default: 1.
spRes
Option on whether or not to ignore the subpopulations calculated for each sample and instead treat every geographical tumor-sample as one single tumor-metapopulation (Default value: 1 - subpopulation resolution; 0 - metapopulation resolution).

Value

  • An object of class "phylo" (library ape).

Details

Reconstructs phylogenetic relationships between subpopulations using neighbor-joining algorithms provided by R-package 'ape'. Pairwise distances between subpopulations $i$ and $j$ are calculated as: $d_{ij}:=(cnv_{i=j} + snv_{i=j})/(cnv_{ij}+snv_{ij})$, where $cnv_{i=j}$ is the number of copy number segments for which subpopulations $i$ and $j$ have the same copy number; $snv_{i=j}$ is the number of point mutations for which subpopulations $i$ and $j$ have the same mutation status and $cnv_{ij}, snv_{ij}$ are the total number of copy number segments and mutations respectively, for which both subpopulations have available information. Subpopulations with insufficient ploidy and point mutations information are excluded from phylogeny.

See Also

buildPhylo