H2OIsolationForest

The data.table with the columns you wish to have analyzed

data

A character vector with the column names to utilize in the isolation forest

Features

A character vector with the column names to not utilize in the isolation forest but have returned with the data output. Otherwise those columns will be removed

IDcols

Name for model that gets saved to file if SavePath is supplied and valid

ModelID

Path directory to store saved model

SavePath

Quantile value to find the cutoff value for classifying outliers

Threshold

Specify the amount of memory to allocate to H2O. E.g. "28G"

MaxMem

Specify the number of threads (E.g. cores * 2)

NThreads

Specify the number of decision trees to build

NTrees

MaxDepth

Minimum number of rows allowed per leaf

MinRows

RowSampleRate

ColSampleRate

ColSampleRatePerLevel

ColSampleRatePerTree

Choose from "AUTO", "Enum", "OneHotInternal", "OneHotExplicit", "Binary", "Eigen", "LabelEncoder", "SortByResponse", "EnumLimited"

CategoricalEncoding

Debug

H2OIsolationForestScoring for dimensionality reduction and / or anomaly detection

R package for the automation of machine learning, forecasting, feature engineering, model evaluation, model interpretation, data generation, and recommenders. Built using data.table for all tabular data-related tasks.

State of Data and AI Literacy Report 2025

H2OIsolationForest: H2OIsolationForest

Description

Usage

Arguments

Value

See Also

Examples