assignMutations(dm, finalSPs, max_PM=6, peakselection='localsum')
clusterCellFrequencies.
cellfrequency_pdf.
cellfrequency_pdf
).cellfrequency_pdf
):
1. $x:=s$ --> Separate fit of SNV and CNV. CNV does not influence ploidy of the SNV, either because CNV occurs before SNV or because SNV and CNV occur independently from each other (i.e. they are never co-propagated during the same clonal expansion)
2. $x:=p$ --> Partial dependency of SNV ploidy on CNV. The SNV is propagated during the expansion of $C$. Subsequently, the CNV is propagated during a clonal expansion of a cell-member of $C$.
3. $x:=j$ --> Joint fit of SNV and CNV, assuming they co-occur together in the same cell and are propagated during the exact same clonal expansion.
If peakselection is set to 'maximum', then the SNV is assigned to subpopulation:
$C:=argmax_C (P_s(f_C), P_p(f_C), P_j(f_C))$.
If peakselection is set to 'localsum', then the SNV is assigned to subpopulation:
$C:=argmax_C (L(P_s, f_C), L(P_p, f_C), L(P_j, f_C))$,where:
$L(P,f_C):= \sum_{f \in peak(f_C)} P(f)$ calculates the sum of all probabilities in individual peaks.
The mutated loci assigned to each subpopulation cluster represent the genetic profile of each predicted subpopulation.
The assignment between subpopulation $C$ and locus $l$ only implies that the SNV at $l$ has been first propagated during the clonal expansion that gave rise to $C$. So SNVs present in $C$ may not be exclusive to $C$ but may also be present in subpopulations smaller than $C$. Whether or not this is the case can sometimes be inferred from the phylogenetic structure of the subpopulation composition. See also buildPhylo
.clusterCellFrequencies