Learn R Programming

⚠️There's a newer version (3.0.11) of this package.Take me there.

ICAMS

In-depth Characterization and Analysis of Mutational Signatures (‘ICAMS’)

Purpose

Analysis and visualization of experimentally elucidated mutational signatures – the kind of analysis and visualization in Boot et al., “In-depth characterization of the cisplatin mutational signature in human cell lines and in esophageal and liver tumors”, Genome Research 2018, https://doi.org/10.1101/gr.230219.117. ‘ICAMS’ stands for In-depth Characterization and Analysis of Mutational Signatures. ‘ICAMS’ has functions to read in variant call files (VCFs) and to collate the corresponding catalogs of mutational spectra and to analyze and plot catalogs of mutational spectra and signatures. Handles both “counts-based” and “density-based” catalogs of mutational spectra or signatures.

Installation

Install development version of ICAMS from GitHub with the R command line:

devtools::install_github("steverozen/ICAMS")

Reference manual

https://github.com/steverozen/ICAMS/blob/master/data-raw/ICAMS_2.0.8.pdf

Copy Link

Version

Install

install.packages('ICAMS')

Monthly Downloads

563

Version

2.0.8

License

GPL-3

Issues

Pull Requests

Stars

Forks

Maintainer

Steve Rozen

Last Published

July 24th, 2019

Functions in ICAMS (2.0.8)

GetExomeKmerCounts

Generate exome k-mer abundance from a given reference genome
GetVAF

Extract the VAFs (variant allele frequencies) from a VCF file.
GetGenomeKmerCounts

Generate k-mer abundance from a given genome
CanonicalizeID

Given vectors of insertions and deletions in contexts categorize them.
CatalogRowOrder

Standard order of row names in a catalog.
ICAMS

ICAMS: In-depth Characterization and Analysis of Mutational Signatures
CreatePentanucAbundance

Create pentanucleotide abundance
CreatePPMFromSBSVCFs

Create position probability matrices (PPM) from a list of SBS vcfs
ReadBedRanges

Read chromosome and position information from a bed format file.
PlotPPM

Plot position probability matrix (PPM) for *one* sample from a Variant Call Format (VCF) file.
ReadAndSplitStrelkaSBSVCFs

Read and split Strelka SBS VCF files.
PlotCatalogToPdf

Plot catalog to a PDF file.
CreateDinucAbundance

Create dinucleotide abundance
Canonicalize1ID

Given a single insertion or deletion in context categorize it.
Canonicalize1INS

Given an insertion and its sequence context, categorize it.
CreateExomeStrandedRanges

Create exome transcriptionally stranded regions
FindMaxRepeatDel

Return the number of repeat units in which a deletion is embedded.
FindMaxRepeatIns

Return the number of repeat units in which an insertion is embedded.
AddTranscript

Add transcript information to a data frame with mutation records
Canonicalize1DEL

Given a deletion and its sequence context, categorize it.
CreateTransRanges

Create a transcript range file from the raw GFF3 File
CreateTetranucAbundance

Create tetranucleotide abundance
IsGRCh38

Test if object is BSgenome.Hsapiens.UCSC.hg38.
IsGRCh37

Test if object is BSgenome.Hsapiens.1000genome.hs37d5.
CreateOnePPMFromSBSVCF

Create position probability matrix (PPM) for *one* sample from a Variant Call Format (VCF) file.
CreateOneColSBSMatrix

Create the matrix an SBS catalog for *one* sample from an in-memory VCF.
IsGRCm38

Test if object is BSgenome.Mmusculus.UCSC.mm10.
MakeDataFrameFromMutectVCF

Read in the data lines of a Variant Call Format (VCF) file created by Mutect
SplitListOfMutectVCFs

Split each Mutect VCF into SBS, DBS, and ID VCFs (plus two VCF-like data frame with left-over rows).
RevcSBS96

Reverse complement strings that represent stranded SBSs
CheckSeqContextInVCF

Check that the sequence context information is consistent with the value of the column REF.
RemoveRangesOnBothStrand

Remove ranges that fall on both strands
PlotPPMToPdf

Plot position probability matrices (PPM) to a PDF file
ReadAndSplitMutectVCFs

Read and split Mutect VCF files.
ReadMutectVCFs

Read Mutect VCF files.
RevcDBS144

Reverse complement strings that represent stranded DBSs
ReadMutectVCF

Read in the data lines of a Variant Call Format (VCF) file created by Mutect
AddAndCheckSequenceID

Add sequence context to a data frame with ID (insertion/deletion) mutation records, and confirm that they match the given reference genome.
CollapseCatalog

"Collapse" a catalog.
as.catalog

Create a catalog from a numeric matrix or numeric data.frame.
StrelkaSBSVCFFilesToCatalogAndPlotToPdf

Create SBS and DBS catalogs from Strelka SBS VCF files and plot them to PDF
VCFsToSBSCatalogs

Create SBS catalogs from SBS VCFs
StrelkaSBSVCFFilesToCatalog

Create SBS and DBS catalogs from Strelka SBS VCF files.
VCFsToIDCatalogs

Create ID (insertion and deletion) catalog from ID VCFs
all.abundance

K-mer abundances.
AddSeqContext

Add sequence context to a data frame with mutation records
TranscriptRanges

Transcript ranges data
StopIfNrowIllegal

Stop if the number of rows in object is illegal
StopIfCatalogTypeIllegal

Stop if catalog.type is illegal.
TestMakeCatalogFromStrelkaSBSVCFs

This function is to make catalogs from the sample Strelka SBS VCF files to compare with the expected catalog information.
NormalizeGenomeArg

PlotCatalog

Plot one spectrum or signature.
CreateStrandedDinucAbundance

Create stranded dinucleotide abundance
GenerateEmptyKmerCounts

Generate an empty matrix of k-mer abundance
CreateStrandedTrinucAbundance

Create stranded trinucleotide abundance
GenerateKmer

Generate all possible k-mers of length k.
ReadStrelkaIDVCFs

Read Strelka ID (insertion and deletion) VCF files.
CreateOneColDBSMatrix

Create double base catalog for *one* sample from a Variant Call Format (VCF) file
CreateOneColIDMatrix

Create one column of the matrix for an indel catalog from *one* in-memory VCF.
ReadStrelkaSBSVCF

Read in the data lines of an SBS VCF created by Strelka version 1
FindDelMH

Return the length of microhomology at a deletion.
InferAbundance

Infer abundance given a matrix-like object and additional information.
InferClassOfCatalogForRead

Infer the class of catalog in a file.
CreateTrinucAbundance

Create trinucleotide abundance
SplitStrelkaSBSVCF

Split an in-memory Strelka VCF into SBS, DBS, and variants involving > 2 consecutive bases
StrelkaIDVCFFilesToCatalog

Create ID (indel) catalog from Strelka ID VCF files
TransformCatalog

Transform between counts and density spectrum catalogs and counts and density signature catalogs.
VCFsToDBSCatalogs

Create DBS catalogs from VCFs
StrelkaIDVCFFilesToCatalogAndPlotToPdf

Create ID (indel) catalog from Strelka ID VCF files and plot them to PDF
StandardChromName

Standardize the chromosome name annotations for a data frame.
TestMakeCatalogFromStrelkaIDVCFs

This function is to make catalogs from the sample Strelka ID VCF files to compare with the expected catalog information.
TestMakeCatalogFromMutectVCFs

This function makes catalogs from the sample Mutect VCF file and compares it with the expected catalog information.
GetSequenceKmerCounts

Generate k-mer abundance from given nucleotide sequences
ReadCatalog

Read catalog.
MakeDataFrameFromStrelkaSBSVCF

Read in the data lines of an SBS VCF created by Strelka version 1
StopIfRegionIllegal

Stop if region is illegal.
ReadDukeNUSCat192

Read a 192-channel spectra (or signature) catalog in Duke-NUS format.
ReadTranscriptRanges

Read transcript ranges and strand information from a gff3 format file. Use this one for the new, cut down gff3 file (2018 11 24)
MakeVCFDBSdf

MakeVCFDBSdf Take DBS ranges and the original VCF and generate a VCF with dinucleotide REF and ALT alleles.
ReadStrelkaSBSVCFs

Read Strelka SBS (single base substitutions) VCF files.
GetStrandedKmerCounts

Generate stranded k-mer abundance from a given genome and gene annotation file
StopIfTranscribedRegionIllegal

Stop if region is illegal for an in-transcript catalogs
MutectVCFFilesToCatalogAndPlotToPdf

Create SBS, DBS and Indel catalogs from Mutect VCF files and plot them to PDF
ReadStrelkaIDVCF

Read in the data lines of an ID VCF created by Strelka version 1
MutectVCFFilesToCatalog

Create SBS, DBS and Indel catalogs from Mutect VCF files
ReadStapleGT96SBS

Read a 96-channel spectra (or signature) catalog where rownames are e.g. "A[C>A]T"
SplitListOfStrelkaSBSVCFs

Split a list of in-memory Strelka SBS VCF into SBS, DBS, and variants involving > 2 consecutive bases
SplitOneMutectVCF

Split a mutect2 VCF into SBS, DBS, and ID VCFs, plus a list of other mutations
TCFromDenSigDen

density -> <anything> density.signature -> density.signature, counts.signature
WriteCat

Write a catalog to a file.
TCFromCouSigCou

Source catalog type is counts or counts.signature
WriteCatalog

Write a catalog
revc

Reverse complement every string in string.vec.