The living atlases store content on hundreds of different fields, and users
often require thousands or millions of records at a time. To reduce time taken
to download data, and limit complexity of the resulting data.frame, it is
sensible to restrict the fields returned by atlas_occurrences().
This function allows easy selection of fields, or commonly-requested groups
of columns, following syntax shared with dplyr::select().
galah_select(..., group = c("basic", "event", "assertions"))zero or more individual column names to include
string: (optional) name of one or more column groups to
include. Valid options are "basic", "event" and
"assertion"
An object of class data.frame and galah_select
specifying the name and type of each column to include in the
call to atlas_counts() or atlas_occurrences().
Download occurrence records of Perameles taken in 2001, only returning scientific name and event date
galah_config(email = "your-email@email.com")
galah_call() |>
galah_identify("perameles")|>
galah_filter(year == 2001) |>
galah_select(scientificName, eventDate) |>
atlas_occurrences()
Download occurrence record of Perameles taken in 2001, returning the "basic" group of columns plus the Basis of Record
galah_call() |>
galah_identify("perameles") |>
galah_filter(year == 2001) |>
galah_select(group = "basic", basisOfRecord) |>
atlas_occurrences()
Calling the argument group = "basic" returns the following columns:
decimalLatitude
decimalLongitude
eventDate
scientificName
taxonConceptID
recordID
dataResourceName
Using group = "event" returns the following columns:
eventRemarks
eventTime
eventID
eventDate
samplingEffort
samplingProtocol
Using group = "assertions" returns all quality assertion-related
columns. The list of assertions is shown by search_fields(type = "assertions").
search_taxa(), galah_filter() and
galah_geolocate() for other ways to restrict the information returned
by atlas_occurrences() and related functions; atlas_counts()
for how to get counts by levels of variables returned by galah_select.