Learn R Programming

⚠️There's a newer version (1.13.1) of this package.Take me there.

wrProteo (version 1.4.2)

Proteomics Data Analysis Functions

Description

Data analysis of proteomics experiments by mass spectrometry is supported by this collection of functions mostly dedicated to the analysis of (bottom-up) quantitative (XIC) data. Fasta-formatted proteomes (eg from UniProt Consortium ) can be read with automatic parsing and multiple annotation types (like species origin, abbreviated gene names, etc) extracted. Quantitative proteomics measurements frequently contain multiple NA values, due to physical absence of given peptides in some samples, limitations in sensitivity or other reasons. The functions provided here help to inspect graphically the data to investigate the nature of NA-values via their respective replicate measurements and to help/confirm the choice of NA-replacement by low random values. Dedicated filtering and statistical testing using the framework of package 'limma' can be run, enhanced by multiple rounds of NA-replacements to provide robustness towards rare stochastic events. Multi-species samples, as frequently used in benchmark-tests (eg Navarro et al 2016 , Ramus et al 2016 ), can be run with special options separating the data into sub-groups during normalization and testing. As example the data-set from Ramus et al 2016 ) is provided quantified by MaxQuant (Tyanova et al 2016 ), ProteomeDiscoverer, OpenMS () and Proline (Bouyssie et al 2020 ). Subsequently, ROC curves (Hand and Till 2001 ) can be constructed to compare multiple analysis approaches.

Copy Link

Version

Install

install.packages('wrProteo')

Monthly Downloads

857

Version

1.4.2

License

GPL-3

Maintainer

Wolfgang Raffelsberger

Last Published

July 5th, 2021

Functions in wrProteo (1.4.2)

corColumnOrder

Order columns in list of matrixes (or matrix)
cleanListCoNames

Selective batch cleaning of sample- (ie column-) names in list
AAmass

Molecular mass for amino-acids
convAASeq2mass

Molecular mass for amino-acids
combineMultFilterNAimput

Combine multiple filters on NA-imputed data
countNoOfCommonPeptides

extrSpeciesAnnot

Extract species annotation
VolcanoPlotW2

Volcano-plot (Statistical Test Outcome versus Relative Change)
AucROC

AUC from ROC-curves
extractTestingResults

Extract results from moderated t-tests
readMassChroQFile

Read tabulated files imported from MassChroQ
foldChangeArrow2

Add arrow for expected Fold-Change to VolcanoPlot or MA-plot
plotROC

Plot ROC curves
matrixNAneighbourImpute

Imputation of NA-values based on non-NA replicates
readMaxQuantFile

Read proteinGroups.txt files exported from MaxQuant
readFasta2

Read file of protein sequences in fasta format Read fasta formatted file (from UniProt) to extract (protein) sequences and name. If tableOut=TRUE output may be organized as matrix for separating meta-annotation (eg GeneName, OrganismName, ProteinName) in separate columns.
razorNoFilter

Filter based on either number of total peptides and specific peptides or number of razor petides
isolNAneighb

Isolate NA-neighbours
readUniProtExport

Read protein annotation as exported from UniProt batch-conversion
matrixNAinspect

Histogram of content of NAs in matrix
massDeFormula

molecular mass from chemical formula
readUCSCtable

Read annotation files from UCSC
readOpenMSFile

Read csv files exported by OpenMS
removeSampleInList

Remove samples/columns from list of matrixes Remove samples (ie columns) from every instance of list of matrixes. Note: This function assumes same order of columns in list-elements 'listElem' !
replMissingProtNames

Complement missing EntryNames in annotation
readPDExport

Read tabulated files imported from Thermo ProteomeDiscoverer
test2grp

t-test each line of 2 groups of data
summarizeForROC

Summarize statistical test result for plotting ROC-curves
testRobustToNAimputation

Pair-wise testing robust to NA-imputation
readProtDiscovFile

Read tabulated files exported by ProteomeDiscoverer
readProlineFile

Read csv or txt files exported from Proline and MS-Angel