Learn R Programming

rgbif (version 0.9.6)

dataset_search: Search datasets in GBIF.

Description

This function does not search occurrence data, only metadata on the datasets that contain occurrence data.

Usage

dataset_search(query = NULL, country = NULL, type = NULL, keyword = NULL, owningOrg = NULL, publishingOrg = NULL, hostingOrg = NULL, publishingCountry = NULL, decade = NULL, facet = NULL, facetMincount = NULL, facetMultiselect = NULL, limit = 100, start = NULL, pretty = FALSE, return = "all", ...)

Arguments

query
Query term(s) for full text search. The value for this parameter can be a simple word or a phrase. Wildcards can be added to the simple word parameters only, e.g. q=*puma*
country
NOT YET IMPLEMENTED. Filters by country as given in isocodes$gbif_name, e.g. country=CANADA.
type
Type of dataset, options include OCCURRENCE, etc.
keyword
Keyword to search by. Datasets can be tagged by keywords, which you can search on. The search is done on the merged collection of tags, the dataset keywordCollections and temporalCoverages. SEEMS TO NOT BE WORKING ANYMORE AS OF 2016-09-02.
owningOrg
Owning organization. A uuid string. See organizations
publishingOrg
Publishing organization. A uuid string. See organizations
hostingOrg
Hosting organization. A uuid string. See organizations
publishingCountry
Publishing country. See options at isocodes$gbif_name
decade
Decade, e.g., 1980. Filters datasets by their temporal coverage broken down to decades. Decades are given as a full year, e.g. 1880, 1960, 2000, etc, and will return datasets wholly contained in the decade as well as those that cover the entire decade or more. Facet by decade to get the break down, e.g. /search?facet=DECADE&facet_only=true (see example below)
facet
A list of facet names used to retrieve the 100 most frequent values for a field. Allowed facets are: datasetKey, highertaxonKey, rank, status, extinct, habitat, and nameType. Additionally threat and nomenclaturalStatus are legal values but not yet implemented, so data will not yet be returned for them.
facetMincount
Used in combination with the facet parameter. Set facetMincount=# to exclude facets with a count less than #, e.g. http://bit.ly/1bMdByP only shows the type value 'ACCEPTED' because the other statuses have counts less than 7,000,000
facetMultiselect
Used in combination with the facet parameter. Set facetMultiselect=true to still return counts for values that are not currently filtered, e.g. http://bit.ly/19YLXPO still shows all status values even though status is being filtered by status=ACCEPTED
limit
Number of records to return. Default: 100. Maximum: 1000.
start
Record number to start at. Default: 0. Use in combination with limit to page through results.
pretty
Print informative metadata using cat. Not easy to manipulate output though.
return
What to return. One of meta, descriptions, data, facets, or all (Default).
...
Further named parameters, such as query, path, etc, passed on to modify_url within GET call. Unnamed parameters will be combined with config.

Value

A data.frame, list, or message printed to console (using pretty=TRUE).

References

http://www.gbif.org/developer/registry#datasetSearch

Examples

Run this code
## Not run: 
# # Gets all datasets of type "OCCURRENCE".
# dataset_search(type="OCCURRENCE", limit = 10)
# 
# # Fulltext search for all datasets having the word "amsterdam" somewhere in
# # its metadata (title, description, etc).
# dataset_search(query="amsterdam", limit = 10)
# 
# # Limited search
# dataset_search(type="OCCURRENCE", limit=2)
# dataset_search(type="OCCURRENCE", limit=2, start=10)
# 
# # Return just descriptions
# dataset_search(type="OCCURRENCE", return="descriptions", limit = 10)
# 
# # Return metadata in a more human readable way (hard to manipulate though)
# dataset_search(type="OCCURRENCE", pretty=TRUE, limit = 10)
# 
# # Search by country code. Lookup isocodes first, and use US for United States
# isocodes[agrep("UNITED", isocodes$gbif_name),]
# dataset_search(country="US", limit = 10)
# 
# # Search by decade
# dataset_search(decade=1980, limit = 10)
# 
# # Faceting
# ## just facets
# dataset_search(facet="decade", facetMincount="10", limit=0)
# 
# ## data and facets
# dataset_search(facet="decade", facetMincount="10", limit=2)
# 
# ## httr options
# library('httr')
# dataset_search(facet="decade", facetMincount="10", limit=2, config=verbose())
# ## End(Not run)

Run the code above in your browser using DataLab