fastDummies (version 1.0.0)

dummy_cols: Fast creation of dummy variables

Description

dummy_cols() quickly creates dummy (binary) columns from character and factor type columns in the inputted data. This function is useful for statistical analysis when you want binary columns rather than character columns.

Usage

dummy_cols(data, select_columns = NULL, remove_first_dummy = FALSE)

Arguments

data

An object with the data set you want to make dummy columns from.

select_columns

Vector of column names that you want to create dummy variables from. If NULL (default), uses all character and factor columns.

remove_first_dummy

Removes the first dummy of every variable that only n-1 Dummies remain. This avoids multicollinearity issues in models.

Value

A data.frame with same number of rows as inputted data and original columns plus the newly created dummy columns.

See Also

dummy_rows For creating dummy rows

Other dummy functions: dummy_columns, dummy_rows

Examples

Run this code
# NOT RUN {
crime <- data.frame(city = c("SF", "SF", "NYC"),
    year = c(1990, 2000, 1990),
    crime = 1:3)
dummy_cols(crime)
# Include year column
dummy_cols(crime, select_columns = c("city", "year"))
# Remove first dummy for each pair of dummy columns made
dummy_cols(crime, select_columns = c("city", "year"),
    remove_first_dummy = TRUE)
# }

Run the code above in your browser using DataLab