Learn R Programming

RemixAutoML (version 0.2.8)

CreateHolidayVariables: CreateHolidayVariables Create Holiday Count Columns

Description

CreateHolidayVariables Rapidly creates holiday count variables based on the date columns you provide

Usage

CreateHolidayVariables(
  data,
  DateCols = NULL,
  HolidayGroups = c("USPublicHolidays", "EasterGroup", "ChristmasGroup",
    "OtherEcclesticalFeasts"),
  Holidays = NULL,
  GroupingVars = NULL,
  Print = FALSE
)

Arguments

data

This is your data

DateCols

Supply either column names or column numbers of your date columns you want to use for creating calendar variables

HolidayGroups

Pick groups

Holidays

Pick holidays

GroupingVars

Grouping variable names

Print

Set to TRUE to print iteration number to console

Value

Returns your data.table with the added holiday indicator variable

See Also

Other Feature Engineering: AutoDataPartition(), AutoHierarchicalFourier(), AutoLagRollStatsScoring(), AutoLagRollStats(), AutoTransformationCreate(), AutoTransformationScore(), AutoWord2VecModeler(), ContinuousTimeDataGenerator(), CreateCalendarVariables(), DT_GDL_Feature_Engineering(), DummifyDT(), H2oAutoencoder(), ModelDataPrep(), Partial_DT_GDL_Feature_Engineering(), TimeSeriesFill()

Examples

Run this code
# NOT RUN {
# Create fake data with a Date----
data <- RemixAutoML::FakeDataGenerator(
  Correlation = 0.75,
  N = 25000L,
  ID = 2L,
  ZIP = 0L,
  FactorCount = 4L,
  AddDate = TRUE,
  Classification = FALSE,
  MultiClass = FALSE)
for(i in seq_len(20L)) {
  print(i)
  data <- data.table::rbindlist(list(data,
  RemixAutoML::FakeDataGenerator(
    Correlation = 0.75,
    N = 25000L,
    ID = 2L,
    ZIP = 0L,
    FactorCount = 4L,
    AddDate = TRUE,
    Classification = FALSE,
    MultiClass = FALSE)))
}
# Run function and time it
runtime <- system.time(
  data <- CreateHolidayVariables(
    data,
    DateCols = "DateTime",
    HolidayGroups = c("USPublicHolidays","EasterGroup",
      "ChristmasGroup","OtherEcclesticalFeasts"),
    Holidays = NULL,
    GroupingVars = c("Factor_1","Factor_2","Factor_3","Factor_4"),
    Print = FALSE))
head(data)
print(runtime)
# }

Run the code above in your browser using DataLab