A dataset containing NCBI information of 1000 eukaryotes. The variables are as follows:
data(LM_eukaryotes)
A lifemap object - a list containing the basemap used to fetch data and df, a data frame with 2760 rows and 26 variables:
Organism name at the species level
NCBI taxid
BioProject Accession number (from BioProject database)
BioProject ID
Commonly used organism groups: Animals, Fungi, Plants, Protists
NCBI Taxonomy level below group: Mammals, Birds, Fishes, Flatworms, Insects, Amphibians, Reptiles, Roundworms, Ascomycetes, Basidiomycetes, Land Plants, Green Algae, Apicomplexans, Kinetoplasts
Total length of DNA submitted for the project
Percent of nitrogenous bases (guanine or cytosine) in DNA submitted for the project
Name of the genome assembly (from NCBI Assembly database)
Number of replicons in the assembly
Four-letter Accession prefix followed by version as defined in WGS division of GenBank/INSDC
Number of scaffolds in the assembly
Number of Genes annotated in the assembly
Number of Proteins annotated in the assembly
First public sequence release for the project
Sequence modification date for the project
Highest level of assembly:
Chromosomes: one or more chromosomes are assembled
Scaffolds or contigs: sequence assembled but no chromosomes
Origin of the sample
BioSample Accession number
longitude of taxids on a specific basemap
latitude of taxids on a specific basemap
scientific name of taxids
zoom of taxids on a specific basemap
the list of all ancestors of taxids on a specific basemap
either "requested" if the taxid was given, "ancestor" if gotten from the database
the direct ancestor oftaxids on a specific basemap