Learn R Programming

⚠️There's a newer version (1.13.1) of this package.Take me there.

wrProteo (version 1.5.0)

Proteomics Data Analysis Functions

Description

Data analysis of proteomics experiments by mass spectrometry is supported by this collection of functions mostly dedicated to the analysis of (bottom-up) quantitative (XIC) data. Fasta-formatted proteomes (eg from UniProt Consortium ) can be read with automatic parsing and multiple annotation types (like species origin, abbreviated gene names, etc) extracted. Quantitative proteomics measurements frequently contain multiple NA values, due to physical absence of given peptides in some samples, limitations in sensitivity or other reasons. The functions provided here help to inspect graphically the data to investigate the nature of NA-values via their respective replicate measurements and to help/confirm the choice of NA-replacement by low random values. Dedicated filtering and statistical testing using the framework of package 'limma' can be run, enhanced by multiple rounds of NA-replacements to provide robustness towards rare stochastic events. Multi-species samples, as frequently used in benchmark-tests (eg Navarro et al 2016 , Ramus et al 2016 ), can be run with special options separating the data into sub-groups during normalization and testing. As example the data-set from Ramus et al 2016 ) is provided quantified by MaxQuant (Tyanova et al 2016 ), ProteomeDiscoverer, OpenMS () and Proline (Bouyssie et al 2020 ). Meta-data provided in sdrf format can be integrated to the analysis. Subsequently, ROC curves (Hand and Till 2001 ) can be constructed to compare multiple analysis approaches.

Copy Link

Version

Install

install.packages('wrProteo')

Monthly Downloads

1,294

Version

1.5.0

License

GPL-3

Maintainer

Wolfgang Raffelsberger

Last Published

January 10th, 2022

Functions in wrProteo (1.5.0)

combineMultFilterNAimput

Combine multiple filters on NA-imputed data
VolcanoPlotW2

Volcano-plot (Statistical Test Outcome versus Relative Change)
countNoOfCommonPeptides

AucROC

AUC from ROC-curves
corColumnOrder

Order columns in list of matrixes (or matrix)
cleanListCoNames

Selective batch cleaning of sample- (ie column-) names in list
AAmass

Molecular mass for amino-acids
convAASeq2mass

Molecular mass for amino-acids
extrSpeciesAnnot

Extract species annotation
extractTestingResults

Extract results from moderated t-tests
razorNoFilter

Filter based on either number of total peptides and specific peptides or number of razor petides
readFasta2

Read file of protein sequences in fasta format Read fasta formatted file (from UniProt) to extract (protein) sequences and name. If tableOut=TRUE output may be organized as matrix for separating meta-annotation (eg GeneName, OrganismName, ProteinName) in separate columns.
foldChangeArrow2

Add arrow for expected Fold-Change to VolcanoPlot or MA-plot
isolNAneighb

Isolate NA-neighbours
readMassChroQFile

Read tabulated files imported from MassChroQ
replMissingProtNames

Complement missing EntryNames in annotation
readMaxQuantFile

Read proteinGroups.txt files exported from MaxQuant
summarizeForROC

Summarize statistical test result for plotting ROC-curves
readProtDiscovFile

Read tabulated files exported by ProteomeDiscoverer
matrixNAinspect

Histogram of content of NAs in matrix
massDeFormula

molecular mass from chemical formula
readProlineFile

Read csv or txt files exported from Proline and MS-Angel
readOpenMSFile

Read csv files exported by OpenMS
readPDExport

Read tabulated files imported from Thermo ProteomeDiscoverer
readUniProtExport

Read protein annotation as exported from UniProt batch-conversion
test2grp

t-test each line of 2 groups of data
removeSampleInList

Remove samples/columns from list of matrixes Remove samples (ie columns) from every instance of list of matrixes. Note: This function assumes same order of columns in list-elements 'listElem' !
matrixNAneighbourImpute

Imputation of NA-values based on non-NA replicates
testRobustToNAimputation

Pair-wise testing robust to NA-imputation
readUCSCtable

Read annotation files from UCSC
readSdrf

Read proteomics meta-data as sdrf file This function allows reading proteomics meta-data from sdrf file, as they are provided on https://github.com/bigbio/proteomics-metadata-standard. Thus function requires the package utils being installed.
plotROC

Plot ROC curves