Learn R Programming

doppelgangR (version 1.0.2)

Identify likely duplicate samples from genomic or meta-data

Description

The main function is doppelgangR(), which takes as minimal input a list of ExpressionSet object, and searches all list pairs for duplicated samples. The search is based on the genomic data (exprs(eset)), phenotype/clinical data (pData(eset)), and "smoking guns" - supposedly unique identifiers found in pData(eset).

Copy Link

Version

Version

1.0.2

License

GPL (>=2.0)

Issues

Pull Requests

Stars

Forks

Repository

https://github.com/lwaldron/doppelgangR

Maintainer

Levi Waldron

Last Published

February 15th, 2017

Functions in doppelgangR (1.0.2)

DoppelGang-class

DoppelGang S4 class

Histograms of all pairwise sample correlations, showing identified doppelgangers.

Print a DoppelGang object

Skew-t Distribution

Calculate distance between two vectors, rows of one matrix/dataframe, or rows of two matrices/dataframes.

summary-methods

Summarizes a DoppelGang object

Identifies outliers in a similarity matrix.

vectorWeightedDist

Calculate a weighted distance between two vectors, using pairwise complete observations.

Maximum likelihood estimation for a (multivariate) skew-t distribution

Calculate pair-wise correlations between samples using the expr() slots of a list of two ExpressionSets.

vectorHammingDist

Calculate Hamming Distance between two vectors, using pairwise complete observations.

Calculate pairwise similarities of phenoData between samples for a list containing two ExpressionSets

doppelgangR-package

Identify likely duplicate samples from genomic or meta-data

Show a DoppelGang object

smokingGunFinder

Find doppelgangers based on "smoking gun" phenotypes - those that should be unique to each patient.