Learn R Programming

canprot (version 0.1.2)

protcomp: Protein Compositions

Description

Get amino acid and chemical compositions of proteins.

Usage

protcomp(uniprot = NULL, ip = NULL, basis = "QEC",
           aa_file = NULL, updates_file = NULL)

Arguments

uniprot

character, UniProt IDs of proteins

ip

numeric, indices of active proteins in CHNOSZ

basis

character, keyword for basis species to use

aa_file

character, file name

updates_file

character, file name

Value

The function returns a list with elements protein.formula (elemental compositions of the proteins), ZC (average oxidation state of carbon), protein.basis (compositions of the proteins in terms of the basis species), protein.length (lengths of the amino acid sequences), residue.basis (per-residue compositions of the proteins in terms of the basis species), residue.formula (per-residue elemental compositions of the proteins), and aa (amino acid compositions of the proteins).

Details

This function retrieves the amino acid compositions of one or more proteins specified by uniprot or ip, then calculates some chemical compositional properties using functions provided by CHNOSZ. The basis argument is used to select the basis species using a keyword (see basis). For example, use CHNOS for , , , , and , or QEC (the default) for glutamine, glutamic acid, cysteine, , and .

This function depends on the amino acid compositions of human proteins, which are stored in the canprot environment when the package is attached. If aa_file is specified, additional amino acid compositions to be considered are read from this file, which should be in the same format as e.g. human_extra.csv (see also thermo$protein). If updates_file is specified, these ID mappings are included alongside the packaged uniprot_updates.

Examples

Run this code
# NOT RUN {
library(CHNOSZ)
protcomp("P24298")
# }

Run the code above in your browser using DataLab