- sqlFile
character string giving the file location to store the SQLite database
- tmpDir
location for storing the downloaded files from NCBI. (Note that it may be useful to store these somewhere convenient to avoid redownloading)
- getAccessions
if TRUE download the very large accesssion2taxid files necessary to convert accessions to taxonomic IDs
- vocal
if TRUE output messages describing progress
- ...
Arguments passed on to getNamesAndNodes
, getAccession2taxid
, read.accession2taxid
url
the url where taxdump.tar.gz is located
fileNames
the filenames desired from the tar.gz file
baseUrl
the url of the directory where accession2taxid.gz files are located
types
the types if accession2taxid.gz files desired where type is the prefix of xxx.accession2taxid.gz. The default is to download all nucl_ accessions. For protein accessions, try types=c('prot')
.
extraSqlCommand
for advanced use. A string giving a command to be called on the SQLite database before loading data. A couple potential uses:
indexTaxa
if TRUE add an index for taxa ID. This would only be necessary if you want to look up accessions by taxa ID e.g. getAccessions
overwrite
If TRUE, delete accessionTaxa table in database if present and regenerate