Learn R Programming

naturaList

The goal of the {naturaList} package is providing tools for checking the identification reliability in species occurrence datasets. The main functionality of the {naturaList} package is providing an automated way to identify for the taxon of interest, which records has the most reliable level of classification, i.e, those records identified by specialists. In addition other characteristics of the records could be used to derive up to six levels of confidence.

Installation

You can install the package from CRAN:

install.packages("naturaList")

Or install the latest released development version from github using:

install.packages("devtools")
devtools::install_github("avrodrigues/naturaList")

Levels of confidence

The package allows to classify the occurrence records in confidence levels through the function classify_occ(), that comprises the main function of naturaList package. The most reliable identification of a specimen is made by a specialist in the taxa. The other levels are derived from information contained in the occurrence dataset. The default order of confidence levels used in classification process are:

  • Level 1 - species was identified by a specialist, if not;
  • Level 2 - who identified the species was not a specialist name, if not;
  • Level 3 - occurrence record has an image associated, if not;
  • Level 4 - the specimen is preserved in a scientific collection, if not;
  • Level 5 - the identification was done in filed observation, if not;
  • Level 6 - no criteria was met.

The user can alter this order, depending on his/her objectives, except for the Level 1 that is always a species determined by a specialist.

Basics of naturaList package

An extensive explanation of all the features of naturaList package is provided through vignette articles. To conduct a basic classification process through classify_occ function the user must provide only two data frames. The first containing the occurrence records and the second with a list of specialists. The classify_occ() function add a new column in the occurrences dataset named naturaList_levels, which contains the classification.

library(naturaList)
data("A.setosa")
data("speciaLists")

occ.cl <- classify_occ(A.setosa, speciaLists)

Naturalist also offer an interactive module that allows to visualize occurrence in space, get information by pointing the occurrence of interest and manually edit occurrence records by point and click. This interactive module is activate through function map_module. An article explaining all features of map_module function can be accessed in this article

Other resources

Auxiliary functions that allows the user to access the effects of filtering procedures based on classification levels are clean_eval and grid_filter functions. A complete example of the usage of these functions can be found in this article.

See vignette for all articles describing the functionalities of naturaList package.

Copy Link

Version

Install

install.packages('naturaList')

Monthly Downloads

238

Version

0.5.2

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Arthur Vinicius Rodrigues

Last Published

February 6th, 2024

Functions in naturaList (0.5.2)

reduce.df

Internal function of naturaList - reduce data.frame of occurrence for a minimal column length
map_module

Check the occurrence records in a interactive map module
rm.coord.dup

Internal function of naturaList - Remove duplicate occurrence
func.det.by.esp

Internal function of naturaList - Detect if a string has a specialist name
r.temp.prec

Raster of temperature and precipitation
get_det_names

Get the names in the 'identified.by' column
verify.specialist

Internal function of naturaList - Verify if a string has unambiguous specialist name
spec_names_ex

Example of specialist names with accent marks
specialist.conference

Internal function of naturaList - Confirm if an occurrence record was identified by a specialist without ambiguity
speciaLists

Specialists of ferns and lycophytes of Brazil
classify_occ

Classify occurrence records in levels of confidence in species identification
create_spec_df

Create specialist data frame from character vector
A.setosa

Occurrence records of Alsophila setosa downloaded from Global Biodiversity Information Facility (GBIF).
pttn.all.specialist

Internal function of naturaList - Return specialists names in a collapsed string
cyathea.br

Occurrence records of Cyathea species in Brazil downloaded from Global Biodiversity Information Facility (GBIF).
check.spec

Internal function of naturaList - Manual check of ambiguity in specialist's name
abrev.pttn

Internal function of naturaList - Return abbreviation collapsed
clean_eval

Evaluate the cleaning of occurrences records
grid_filter

Filter the occurrence with most confidence in species identification inside grid cells
define_env_space

Define environmental space for species occurrence
has.det.ID

Internal function of naturaList - Identifies if a occurrence has a name for the identifier of the specimen
make.polygon

Internal function of naturaList - Create SpatialPolygons from a list of coordinates
BR

Brazil boundary
pol.coords

Internal function of naturaList - Get coordinates from polygons created in leaflet map
%>%

Pipe operator
env_grid_filter

Filter occurrences in environmental space