Learn R Programming

h2o (version 2.8.4.4)

h2o.ignoreColumns: Returns columns' names of a parsed H2O data object that are recommended to be ignored in an analysis

Description

Returns columns' names of a parsed H2OParsedData object if the columns have high counts of NA entries, threshold for high NA count can be set by user, or are columns with only one value.

Usage

h2o.ignoreColumns(data, max_na = 0.2)

Arguments

data
AnH2OParsedData object.
max_na
A numeric between 0 and 1 representing the proportion of NAs in a column.

Value

  • Returns a vector of column names.

Examples

Run this code
library(h2o)
localH2O = h2o.init()
airlinesURL = "https://s3.amazonaws.com/h2o-airlines-unpacked/allyears2k.csv"
airlines.hex = h2o.importFile(localH2O, path = airlinesURL, key = "airlines.hex")
h2o.ignoreColumns(airlines.hex)

Run the code above in your browser using DataLab