Learn R Programming


title: "README" author: "Jose V Die" date: "28/2/2019" output: html_document

geneHummus

An Automated Pipeline to Study Plant Gene Families based on Protein Domain Organization using Auxin Response Factors in chickpea and other Legumes as an Example. The pipeline is convenient if you are interested in plant gene families characterization, or you'd like to perform some downstream analysis such as phylogenetic construction or gene expression profiling, to name a few examples.

Accessing the package

A stable version of this package is available on CRAN and can be installed directly from there:

install.packages("geneHummus")

The lastest development version of the package can also be loaded directly from GitHub using the devtools package:

library(devtools)
install_github("NCBI-Hackathons/GeneHummus")
library(GeneHummus)

Publication

Die JV, Elmassry MM, Leblanc KH, Awe OI, Dillman A, Busby B (2018) GeneHummus: A pipeline to define gene families and their expression in legumes and beyond, bioRxiv 436659; doi: https://doi.org/10.1101/436659

Copy Link

Version

Install

install.packages('geneHummus')

Monthly Downloads

179

Version

1.0.11

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Jose V. Die

Last Published

April 4th, 2019

Functions in geneHummus (1.0.11)

getArch_labels

Get the description label for a protein architecture identifier
geneHummus

genehummus: A pipeline to define gene families in Legumes and beyond
proteins_warning

Get RefSeq protein identifiers for the given taxonomic species
sizeIds

Build a list containing N elements per element list
accessions_warning

Get acessions and organism for each protein identifier
archids_warning

Get architecture identifiers for the conserved domains
accessions_by_spp

Compute the total number of accession proteins per species
getSparcleArchs

Get the electronic architecture for a conserved domain
getProtlinks

Get the protein identifiers for a given architecture
accessions_from_spp

Extract the accession ids (XP accession) for a given organism
getAccessions

Get the acessions ids and the organism for each protein identifier
getArch_ids

Get the potential architecture identifiers for the conserved domains
extract_proteins

Get the protein identifiers
filterArch_ids

Filter the protein architectures based on conserved domains
get_spp

Get the species name from the description sequence
labels_warning

Get description label for a protein architecture identifier
legumesIds

NCBI taxonomy ids for the legume family
my_legumes

ARF proteins per legume specie
filterarchids_warning

Filter protein architectures based on conserved domains
getProteins_from_tax_ids

Get the RefSeq protein identifiers for the given taxonomic species