Get amino acid and chemical compositions of proteins.
protcomp(uniprot = NULL, ip = NULL, basis = "QEC",
aa_file = NULL, updates_file = NULL)
character, UniProt IDs of proteins
numeric, indices of active proteins in CHNOSZ
character, keyword for basis species to use
character, file name
character, file name
The function returns a list with elements protein.formula
(elemental compositions of the proteins), ZC
(average oxidation state of carbon), protein.basis
(compositions of the proteins in terms of the basis species), protein.length
(lengths of the amino acid sequences), residue.basis
(per-residue compositions of the proteins in terms of the basis species), residue.formula
(per-residue elemental compositions of the proteins), and aa
(amino acid compositions of the proteins).
This function retrieves the amino acid compositions of one or more proteins specified by uniprot
or ip
, then calculates some chemical compositional properties using functions provided by CHNOSZ.
The basis
argument is used to select the basis species using a keyword (see basis
).
For example, use CHNOS for , , , , and , or QEC (the default) for glutamine, glutamic acid, cysteine, , and .
This function depends on the amino acid compositions of human proteins, which are stored in the canprot
environment when the package is attached.
If aa_file
is specified, additional amino acid compositions to be considered are read from this file, which should be in the same format as e.g. human_extra.csv
(see also thermo$protein
).
If updates_file
is specified, these ID mappings are included alongside the packaged uniprot_updates
.
# NOT RUN {
library(CHNOSZ)
protcomp("P24298")
# }
Run the code above in your browser using DataLab