Learn R Programming

⚠️There's a newer version (4.0.1) of this package.Take me there.

blscrapeR

Designed to be a tidy API wrapper for the Bureau of Labor Statistics (BLS.) The package has additional functions to help parse, analyze and visualize the data. The package utalizes "tidyverse" concepts for internal functionality and encourages the use of those concepts with the output data.

Install

  • Stable version from CRAN:
install.packages("blscrapeR")
  • The latest development version from GitHub:
devtools::install_github("keberwein/blscrapeR")

Before getting started, you’ll probably want to head over to the BLS and get set up with an API key. While an API key is not required to use the package, the query limits are much higher if you have a key and you’ll have access to more data. Plus, it’s free (as in beer), so why not?

Basic Usage

For “quick and dirty” type of analysis, the package has some quick functions that will pull metrics from the API without series numbers. These quick functions include unemployment, employment, and civilian labor force on a national level.

library(blscrapeR)
# Grab the Unemployment Rate (U-3) 
df <- quick_unemp_rate()
head(df, 5)
#> # A tibble: 5 x 6
#>    year    period periodName value footnotes  seriesID
#>   <dbl>    <list>     <list> <dbl>    <list>    <list>
#> 1  2017 <chr [1]>  <chr [1]>   4.3 <chr [1]> <chr [1]>
#> 2  2017 <chr [1]>  <chr [1]>   4.4 <chr [1]> <chr [1]>
#> 3  2017 <chr [1]>  <chr [1]>   4.3 <chr [1]> <chr [1]>
#> 4  2017 <chr [1]>  <chr [1]>   4.4 <chr [1]> <chr [1]>
#> 5  2017 <chr [1]>  <chr [1]>   4.5 <chr [1]> <chr [1]>

DISCLAIMER: Some working knowledge of BLS series numbers are required here. The BLS claims that they “do not currently have a catalog of series IDs.” The BLS Data Finder website is a good place to nail down the series numbers we're looking for.

API Keys

You should consider getting an API key form the BLS. The package has a function to install your key in your .Renviron so you’ll only have to worry about it once. Plus, it will add extra security by not having your key hard-coded in your scripts for all the world to see.

From the BLS:

ServiceVersion 2.0 (Registered)Version 1.0 (Unregistered)
Daily query limit50025
Series per query limit5025
Years per query limit2010
Net/Percent ChangesYesNo
Optional annual averagesYesNo
Series descriptionsYesNo

Key Install

library(blscrapeR)
set_bls_key("YOUR_KEY_IN_QUOTATIONS")
# First time, reload your enviornment so you can use the key without restarting R.
readRenviron("~/.Renviron")
# You can check it with:
Sys.getenv("BLS_KEY")

Note: The above script will add a line to your .Renviron file to be re-used when ever you are in the package. If you are not comfortable with that, you can add the following line to your .Renviron file manually to produce the same result.

BLS_KEY='YOUR_KEY_IN_SINGLE_QUOTES'

Advanced Usage

Now that you have an API key installed, you can call your key in the package’s function arguments with "BLS_KEY". Don't forget the quotes! If you just HAVE to have your key hard-coded in your scripts, you can also pass they key as a string.

Download Multiple BLS Series at Once

library(blscrapeR)

# Grab several data sets from the BLS at onece.
# NOTE on series IDs: 
# EMPLOYMENT LEVEL - Civilian labor force - LNS12000000
# UNEMPLOYMENT LEVEL - Civilian labor force - LNS13000000
# UNEMPLOYMENT RATE - Civilian labor force - LNS14000000
df <- bls_api(c("LNS12000000", "LNS13000000", "LNS14000000"),
              startyear = 2008, endyear = 2017, registrationKey = "BLS_KEY") %>%
    # Add time-series dates
    dateCast()
# Plot employment level
library(ggplot2)
gg1200 <- subset(df, seriesID=="LNS12000000")
library(ggplot2)
ggplot(gg1200, aes(x=date, y=value)) +
    geom_line() +
    labs(title = "Employment Level - Civ. Labor Force")

Median Weekly Earnings

library(blscrapeR)
library(tidyverse)
# Median Usual Weekly Earnings by Occupation, Unadjusted Second Quartile.
# In current dollars
df <- bls_api(c("LEU0254530800", "LEU0254530600"), startyear = 2000, endyear = 2016, registrationKey = Sys.getenv("BLS_KEY")) %>%
    spread(seriesID, value) %>% dateCast()
# A little help from ggplot2!
library(ggplot2)
ggplot(data = df, aes(x = date)) + 
    geom_line(aes(y = LEU0254530800, color = "Database Admins.")) +
    geom_line(aes(y = LEU0254530600, color = "Software Devs.")) + 
    labs(title = "Median Weekly Earnings by Occupation") + ylab("value") +
    theme(legend.position="top", plot.title = element_text(hjust = 0.5)) 

For more advanced usage, please see the package vignettes.

Basic Mapping

Like the the “quick functions” for requesting API data, there are two "quick" map functions, bls_map_county() and bls_map_state(). These functions are designed to work with two helper functions get_bls_county() and get get_bls_state(). Each helper function downloads recent data for unemployment rate, unemployment level, employment rate, employment level and civilian labor force. These functions do not pull data from the API, rather they pull data from text files and do not count against daily query limits.

Note: Even though the get_bls functions return data in the correct formats, the bls_map functions can be used with any data set that includes FIPS codes.

The example below maps the current unemployment rate by county. Alaska and Hawaii have to re-located to save space.

library(blscrapeR)
# Grap the data in a pre-formatted data frame.
# If no argument is passed to the function it will load the most recent month's data.
df <- get_bls_county()

#Use map function with arguments.
map_bls(map_data = df, fill_rate = "unemployed_rate", 
               labtitle = "Unemployment Rate by County")

Advanced Mapping

What's R mapping without some interactivity? Below we’re using two additional packages, leaflet, which is popular for creating interactive maps and tigris, which allows us to download the exact shape files we need for these data and includes a few other nice tools.

# Leaflet map
library(blscrapeR)
library(tigris)
library(leaflet)
map.shape <- counties(cb = TRUE, year = 2015)
df <- get_bls_county()

# Slice the df down to only the variables we need and rename "fips" colunm
# so I can get a cleaner merge later.
df <- df[, c("unemployed_rate", "fips")]
colnames(df) <- c("unemployed_rate", "GEOID")

# Merge df with spatial object.
leafmap <- geo_join(map.shape, df, by="GEOID")

# Format popup data for leaflet map.
popup_dat <- paste0("<strong>County: </strong>", 
                    leafmap$NAME, 
                    "<br><strong>Value: </strong>", 
                    leafmap$unemployed_rate)

pal <- colorQuantile("YlOrRd", NULL, n = 20)
# Render final map in leaflet.
leaflet(data = leafmap) %>% addTiles() %>%
    addPolygons(fillColor = ~pal(unemployed_rate), 
                fillOpacity = 0.8, 
                color = "#BDBDC3", 
                weight = 1,
                popup = popup_dat)

Note: This is just a static image since the full map would not be as portable for the purpose of documentation.

Copy Link

Version

Install

install.packages('blscrapeR')

Monthly Downloads

428

Version

3.0.1

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Kris Eberwein

Last Published

October 5th, 2017

Functions in blscrapeR (3.0.1)

%<>%

Compound_pipe
county_fips

Dataset containing FIPS codes for mapping and lookup.
area_titles

Dataset containing FIPS codes for counties, states and MSAs.
bls_api

Basic Request Mechanism for BLS Tables
county_map_data

Dataset for mapping U.S. counties
dateCast

Cast a date column to data frame returned by the bls_api() function
bls_map_county

choropleth mapping of BLS data
bls_map_state

choropleth mapping of BLS data
get_bls_county

A function that returns county-level labor statistics
get_bls_state

A function that returns county-level labor statistics
niacs

Dataset containing NIACS codes for industry lookups.
%>%

Pipe operator
state_fips

Dataset with the lat. / long. of county FIPS codes used for mapping.
inflation_adjust

Convert the Value of a US Dollar to a Given Year
map_bls

choropleth mapping of BLS data
urlExists

urlExists
state_map_data

Dataset for mapping U.S. states
quick_employed_rate

Quick employed rate
quick_laborForce_level

Quick Civilian Labor Force Level
quick_unemp_level

Quick unemployment level function
quick_unemp_rate

Quick unemployment rate function
qcew_api

Request data from the Quarterly Census of Employment and Wages.
quick_employed_level

Quick employed level
set_bls_key

Install a BLS API Key in Your .Renviron File for Repeated Use
quick_laborForce_rate

Quick Civilian Labor Force Rate
quick_nonfarm_employed

Quick total nonfarm employment
size_titles

Dataset containing size codes for US industries by size.