
getGenome(db = "refseq", kingdom, organism, path = file.path("_ncbi_downloads", "genomes"))
refseq
or genbank
.
Right now only the ref seq database is included. Later version of biomartr will also allow
sequence retrieval from additional databases.path
= file.path("_ncbi_downloads","genomes")
.refseq: ftp://ftp.ncbi.nlm.nih.gov/genomes/refseq/
genbank: ftp://ftp.ncbi.nlm.nih.gov/genomes/genbank/ and creates a directory '_ncbi_downloads/genomes' to store the genome of interest as fasta file for future processing. In case the corresponding fasta file already exists within the '_ncbi_downloads/genomes' folder and is accessible within the workspace, no download process will be performed.
getProteome
, getCDS
, meta.retrieval
, read_genome
## Not run:
#
# # download the genome of Arabidopsis thaliana from refseq
# # and store the corresponding genome file in '_ncbi_downloads/genomes'
# getGenome( db = "refseq",
# kingdom = "plant",
# organism = "Arabidopsis thaliana",
# path = file.path("_ncbi_downloads","genomes"))
#
# file_path <- file.path("_ncbi_downloads","genomes","Arabidopsis_thaliana_genomic.fna.gz")
# Ath_genome <- read_genome(file_path, format = "fasta")
#
#
# # download the genome of Arabidopsis thaliana from genbank
# # and store the corresponding genome file in '_ncbi_downloads/genomes'
# getGenome( db = "genbank",
# kingdom = "plant",
# organism = "Arabidopsis thaliana",
# path = file.path("_ncbi_downloads","genomes"))
#
# file_path <- file.path("_ncbi_downloads","genomes","Arabidopsis_thaliana_genomic.fna.gz")
# Ath_genome <- read_genome(file_path, format = "fasta")
# ## End(Not run)
Run the code above in your browser using DataLab