Learn R Programming

⚠️There's a newer version (2.0.0.2) of this package.Take me there.

wrProteo (version 1.4.1)

Proteomics Data Analysis Functions

Description

Data analysis of proteomics experiments by mass spectrometry is supported by this collection of functions mostly dedicated to the analysis of (bottom-up) quantitative (XIC) data. Fasta-formatted proteomes (eg from UniProt Consortium ) can be read with automatic parsing and multiple annotation types (like species origin, abbreviated gene names, etc) extracted. Quantitative proteomics measurements frequently contain multiple NA values, due to physical absence of given peptides in some samples, limitations in sensitivity or other reasons. The functions provided here help to inspect graphically the data to investigate the nature of NA-values via their respective replicate measurements and to help/confirm the choice of NA-replacement by low random values. Dedicated filtering and statistical testing using the framework of package 'limma' can be run, enhanced by multiple rounds of NA-replacements to provide robustness towards rare stochastic events. Multi-species samples, as frequently used in benchmark-tests (eg Navarro et al 2016 , Ramus et al 2016 ), can be run with special options separating the data into sub-groups during normalization and testing. As example the data-set from Ramus et al 2016 ) is provided quantified by MaxQuant (Tyanova et al 2016 ), ProteomeDiscoverer, OpenMS () and Proline (Bouyssie et al 2020 ). Subsequently, ROC curves (Hand and Till 2001 ) can be constructed to compare multiple analysis approaches.

Copy Link

Version

Install

install.packages('wrProteo')

Monthly Downloads

1,515

Version

1.4.1

License

GPL-3

Maintainer

Wolfgang Raffelsberger

Last Published

April 15th, 2021

Functions in wrProteo (1.4.1)

convAASeq2mass

Molecular mass for amino-acids
AucROC

AUC from ROC-curves
cleanListCoNames

Selective batch cleaning of sample- (ie column-) names in list
combineMultFilterNAimput

Combine multiple filters on NA-imputed data
corColumnOrder

Order columns in list of matrixes (or matrix)
AAmass

Molecular mass for amino-acids
extrSpeciesAnnot

Extract species annotation
VolcanoPlotW2

Volcano-plot (Statistical Test Outcome versus Relative Change)
extractTestingResults

Extract results from moderated t-tests
countNoOfCommonPeptides

foldChangeArrow2

add arrow for Fold-Change to VolcanoPlot or MA-plot
isolNAneighb

Isolate NA-neighbours
readProtDiscovFile

Read tabulated files exported by ProteomeDiscoverer
plotROC

Plot ROC curves
readProlineFile

Read csv or txt files exported from Proline and MS-Angel
matrixNAneighbourImpute

Imputation of NA-values based on non-NA replicates
summarizeForROC

Summarize statistical test result for plotting ROC-curves
massDeFormula

molecular mass from chemical formula
matrixNAinspect

Histogram of content of NAs in matrix
test2grp

t-test each line of 2 groups of data
razorNoFilter

Filter based on either number of total peptides and specific peptides or number of razor petides
readFasta2

Read file of protein sequences in fasta format Read fasta formatted file (from UniProt) to extract (protein) sequences and name. If tableOut=TRUE output may be organized as matrix for separating meta-annotation (eg GeneName, OrganismName, ProteinName) in separate columns.
readOpenMSFile

Read csv files exported by OpenMS
readMaxQuantFile

Read proteinGroups.txt files exported from MaxQuant
removeSampleInList

Remove samples/columns from list of matrixes Remove samples (ie columns) from every instance of list of matrixes. Note: This function assumes same order of columns in list-elements 'listElem' !
readUniProtExport

Read protein annotation as exported from UniProt batch-conversion
readMassChroQFile

Read tabulated files imported from MassChroQ
replMissingProtNames

Complement missing EntryNames in annotation
readUCSCtable

Read annotation files from UCSC
testRobustToNAimputation

Pair-wise testing robust to NA-imputation
readPDExport

Read tabulated files imported from Thermo ProteomeDiscoverer