Learn R Programming

⚠️There's a newer version (1.7-5) of this package.Take me there.

protr (version 0.3-1)

Protein Sequence Descriptor Calculation and Similarity Computation with R

Description

The protr package focus on offering a unique and comprehensive toolkit for protein sequence descriptor calculation and similarity computation. The descriptors included in the protr package are extensively utilized in Bioinformatics and Chemogenomics research. The qualitative descriptors listed in protr include Amino Acid Composition (Amino Acid Composition/Dipeptide Composition/Tripeptide Composition) descriptor, Autocorrelation (Normalized Moreau-Broto Autocorrelation/Moran Autocorrelation/Geary Autocorrelation) descriptor, CTD (Composition/Transition/Distribution) descriptor, Conjoint Traid descriptor, Quasi-sequence Order (Sequence Order Coupling Number/Quasi-sequence Order Descriptors) descriptor and Pseudo Amino Acid Composition (Pseudo Amino Acid Composition/Amphiphilic Pseudo Amino Acid Composition) descriptor. The quantitative descriptors, for Proteochemometric (PCM) Modeling, includes the Generalized Scales-Based Descriptors derived by Principal Components Analysis, Generalized Scales-Based Descriptors derived by AA-Properties (AAindex), Generalized Scales-Based Descriptors derived by 20+ classes of 2D and 3D Molecular Descriptors (Topological, WHIM, VHSE, etc.), Generalized Scales-Based Descriptors derived by Factor Analysis, Generalized Scales-Based Descriptors derived by Multidimensional Scaling, and Generalized BLOSUM/PAM Matrix-Derived Descriptors. The protr package also integrates the functionality of parallellized similarity computation derived by protein sequence alignment and Gene Ontology (GO) semantic similarity measures between a list of protein sequences / GO terms / Entrez Gene IDs. ProtrWeb, the web service built on protr, is located at: http://cbdd.csu.edu.cn:8080/protrweb/ . The protr package is developed by Computational Biology and Drug Design (CBDD) Group, Central South University.

Copy Link

Version

Install

install.packages('protr')

Monthly Downloads

581

Version

0.3-1

License

BSD 3-clause License + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Nan Xiao

Last Published

August 23rd, 2014

Functions in protr (0.3-1)

AAPAM120

PAM120 Matrix for 20 Amino Acids
AAGeom

Geometrical Descriptors for 20 Amino Acids calculated by Dragon
AATopo

Topological Descriptors for 20 Amino Acids calculated by Dragon
acc

Auto Cross Covariance (ACC) for Generating Scales-Based Descriptors of the Same Length
AA2DACOR

2D Autocorrelations Descriptors for 20 Amino Acids calculated by Dragon
AAEigIdx

Eigenvalue-Based Indices Descriptors for 20 Amino Acids calculated by Dragon
AAPAM250

PAM250 Matrix for 20 Amino Acids
AABLOSUM100

BLOSUM100 Matrix for 20 Amino Acids
AAGETAWAY

GETAWAY Descriptors for 20 Amino Acids calculated by Dragon
AARandic

Randic Molecular Profiles Descriptors for 20 Amino Acids calculated by Dragon
AAEdgeAdj

Edge Adjacency Indices Descriptors for 20 Amino Acids calculated by Dragon
AABLOSUM50

BLOSUM50 Matrix for 20 Amino Acids
extractCTDD

CTD Descriptors - Distribution
AABurden

Burden Eigenvalues Descriptors for 20 Amino Acids calculated by Dragon
extractAAC

Amino Acid Composition Descriptor
AABLOSUM62

BLOSUM62 Matrix for 20 Amino Acids
AA3DMoRSE

3D-MoRSE Descriptors for 20 Amino Acids calculated by Dragon
extractDescScales

Scales-Based Descriptors with 20+ classes of Molecular Descriptors
extractSOCN

Sequence-Order-Coupling Numbers
AAInfo

Information Indices Descriptors for 20 Amino Acids calculated by Dragon
extractMDSScales

Generalized Scales-Based Descriptors derived by Multidimensional Scaling
AAindex

AAindex Data of 544 Physicochemical and Biological Properties for 20 Amino Acids
protseg

Protein Sequence Segmentation
AATopoChg

Topological Charge Indices Descriptors for 20 Amino Acids calculated by Dragon
parGOSim

Protein Sequence Similarity Calculation based on Gene Ontology (GO) Similarity
extractPropScales

Generalized AA-Properties Based Scales Descriptors
extractPAAC

Pseudo Amino Acid Composition Descriptor
getUniProt

Get Protein Sequences from UniProt by Protein ID
AAMOE2D

2D Descriptors for 20 Amino Acids calculated by MOE 2011.10
protcheck

Check if the protein sequence's amino acid types are in the 20 default types
extractCTDT

CTD Descriptors - Transition
AAPAM30

PAM30 Matrix for 20 Amino Acids
AABLOSUM45

BLOSUM45 Matrix for 20 Amino Acids
AAFGC

Functional Group Counts Descriptors for 20 Amino Acids calculated by Dragon
AAPAM40

PAM40 Matrix for 20 Amino Acids
AAMOE3D

3D Descriptors for 20 Amino Acids calculated by MOE 2011.10
extractGeary

Geary Autocorrelation Descriptor
OptAA3d

OptAA3d.sdf - 20 Amino Acids Optimized with MOE 2011.10 (Semiempirical AM1)
AAConn

Connectivity Indices Descriptors for 20 Amino Acids calculated by Dragon
parSeqSim

Parallellized Protein Sequence Similarity Calculation based on Sequence Alignment
extractAPAAC

Amphiphilic Pseudo Amino Acid Composition Descriptor
extractMoreauBroto

Normalized Moreau-Broto Autocorrelation Descriptor
extractDC

Dipeptide Composition Descriptor
twoSeqSim

Protein Sequence Alignment for Two Protein Sequences
AAWalk

Walk and Path Counts Descriptors for 20 Amino Acids calculated by Dragon
readPDB

Read Protein Sequences in PDB Format
AARDF

RDF Descriptors for 20 Amino Acids calculated by Dragon
twoGOSim

Protein Similarity Calculation based on Gene Ontology (GO) Similarity
AAACF

Atom-Centred Fragments Descriptors for 20 Amino Acids calculated by Dragon
AABLOSUM80

BLOSUM80 Matrix for 20 Amino Acids
AAMolProp

Molecular Properties Descriptors for 20 Amino Acids calculated by Dragon
AACPSA

CPSA Descriptors for 20 Amino Acids calculated by Discovery Studio
protr-package

Protein Sequence Descriptor Calculation and Similarity Computation with R
readFASTA

Read Protein Sequences in FASTA Format
extractCTriad

Conjoint Triad Descriptor
AADescAll

All 2D Descriptors for 20 Amino Acids calculated by Dragon
extractMoran

Moran Autocorrelation Descriptor
extractTC

Tripeptide Composition Descriptor
extractQSO

Quasi-Sequence-Order (QSO) Descriptor
AAPAM70

PAM70 Matrix for 20 Amino Acids
AAMetaInfo

Meta Information for the 20 Amino Acids
AAWHIM

WHIM Descriptors for 20 Amino Acids calculated by Dragon
extractBLOSUM

Generalized BLOSUM and PAM Matrix-Derived Descriptors
extractFAScales

Generalized Scales-Based Descriptors derived by Factor Analysis
extractScales

Generalized Scales-Based Descriptors derived by Principal Components Analysis
AAConst

Constitutional Descriptors for 20 Amino Acids calculated by Dragon
extractCTDC

CTD Descriptors - Composition