Learn R Programming

MLBC (version 0.2.1)

SD_data: Job postings dataset

Description

A subset of data relating to job postings on the Lightcast platform for demonstrating bias correction methods with ML-generated variables.

Usage

SD_data

Arguments

Format

SD_data

A data frame with 16315 rows and 7 columns:

city_name

Character. City of the job posting

naics_2022_2

Character. Type of business (NAICS industry classification)

id

Integer. Unique identifier of the job posting

salary

Numeric. Salary offered (response variable)

wfh_wham

Numeric. Binary label generated via ML, indicating whether remote work is offered (subject to measurement error)

soc_2021_2

Character. Occupation code (SOC classification)

employment_type_name

Character. Employment type (part time/full time)

Examples

Run this code
if (FALSE) {
data(SD_data)
fit <- ols_bca(log(salary) ~ wfh_wham + soc_2021_2 + naics_2022_2,
               data = SD_data, fpr = 0.009, m = 1000)
}

Run the code above in your browser using DataLab