The H2OAutoML class
S3 Group Generic Functions for H2O
The H2OCoxPHModel object.
The H2OFrame class
H2O Grid
The H2OClusteringModel object.
H2OConnectionMutableState
The H2OConnectionMutableState class
The H2OConnection class.
H2OCoxPHModelSummary-class
The H2OCoxPHModelSummary object.
Extract or Replace Parts of an H2OFrame Object
The H2OLeafNode class.
The H2OModel object.
wrapper function for instantiating H2OInfogram
H2OInfogram class
Logical or for H2OFrames
Accessor Methods for H2OModel Object
H2O Segment Models
The H2ONode class.
Starting H2O For examples
The H2OModelMetrics Object.
H2O Future Model
Logical and for H2OFrames
The H2OSplitNode class.
H2OSegmentModelsFuture-class
H2O Future Segment Models
The H2OTree class.
Converts parsed H2O data into an R data frame
Stop with a user friendly message if a user is missing the ggplot2 package or has an old version of it.
TODO: No objects in this file are being used. Either remove file or use objects.
Convert an H2OFrame to a String
Create H2OFrame
Decryption Endpoints
Convert H2O Data to Factors
Apply on H2O Datasets
Convert an H2OFrame to a matrix
Convert H2O Data to Numeric
Get a mapping between columns and their domains
Get feature count sorted by the count descending.
H2OFrame Manipulation
Removal Endpoints
Export Files Endpoint Generator
.h2o.__SEGMENT_MODELS_BUILDERS
Segment Models Builder Endpoint Generator
Word2Vec Endpoints
as.data.frame.H2OSegmentModels
Converts a collection of Segment Models to a data.frame
Virtual Keyed class
case_insensitive_match_arg
Works like match.arg but ignores case
Returns the column names of an H2OFrame
Administrative Endpoints
Log and Echo Endpoint
Perform a safe (i.e. error-checked) HTTP POST request to an H2O cluster.
Check if Progress Bar is Enabled
.is_plotting_to_rnotebook
Check if we are plotting in to r notebook.
Just like doRawGET but fills in the default h2oRestApiVersion if none is provided
.h2o.__checkConnectionHealth
Check H2O Server Health
Is the model
a Tree-based H2O Model?
Just like doRawPOST but fills in the default h2oRestApiVersion if none is provided
Is the model
an H2O model?
Enhance leaderboard with per-model predictions.
Perform a low-level HTTP GET operation on an H2O instance
Model Builder Endpoint Generator
Has the model variable importance?
Model Endpoint
Helper Collapse Function
Get the seed from H2OModel which was used during training.
If a user does not set the seed parameter before training, the seed is autogenerated.
It returns seed as the string if the value is bigger than the integer.
For example, an autogenerated seed is always long so that the seed in R is a string.
Convert an H2OFrame to a vector
H2O R Interface
Australia Coastal Data
Is the model considered to be interpretable, i.e., simple enough.
.process_models_or_automl
Do basic validation and transform object
to a "standardized" list containing models, and
their properties such as x
, y
, whether it is a (multinomial) clasification or not etc.
Compute the arc cosine of x
Retrieve the Akaike information criterion (AIC) value
Build an Aggregated Frame
Shortens model ids if possible (iff there will be same amount of unique model_ids as before)
Retrieve the default AECU (Average Excess Cumulative Uplift = area between AUUC and random AUUC)
Consolidate variable importances
Import/Export Endpoints
Model Metrics Endpoint
Locate a file given the pattern <bucket>/<path/to/file>
e.g. h2o:::.h2o.locate("smalldata/iris/iris22.csv") returns the absolute path to iris22.csv
Perform a REST API request to a previously connected server.
Check H2OFrame columns for factors
Parse Endpoints
Inspect/Summary Endpoints
Return the respective bias vector
Extracts the final training average objective function of a GLM model.
H2O bottomN
Get the between cluster sum of squares
Internal function that calculates a precise AUC from given
probabilities and actual responses.
.find_appropriate_column_name
Tries to match a fuzzy_col_name
with a column name that exists in cols
.
Get the algoritm used by the model_or_model_id
H2O <-> R Communication and Utility Methods
Returns the Dimensions of an H2OFrame
Convert to quantiles when provided with numeric vector.
When col is a factor vector assign uniformly value between 0 and 1 to each level.
Get first of family models
Capabilities endpoints
Retrieve centroid statistics
Connect to a running H2O instance.
Retrieve the Model Centers STD
Correlation of columns.
Map of operations known to H2O
Has the model
coefficients?
Access H2O Confusion Matrices
The H2O Package Environment
Min-max normalization.
Column names of an H2OFrame
Compute weighted gram matrix.
Plot variable importances with ggplot2
CHeck to see if the column names/indices entered is valid for the dataframe given. This is an internal function
Given a set of logical vectors, are all of the values true?
Anomaly Detection via H2O Deep Learning Model
feature_frequencies.H2OModel
Retrieve the number of occurrences of each feature for given observations
Available for GBM, Random Forest and Isolation Forest models.
Convert H2O Data to Characters
Rename an H2O object.
Convert H2O Data to Numerics
h2o.cross_validation_predictions
Retrieve the cross-validation predictions
h2o.cross_validation_models
Retrieve the cross-validation models
Retrieve the all types of AUUC in a table
Convert H2O Data to Factors
Retrieve normalized AUUC
Determine if an H2O cluster is up or not
Cut H2O Numeric Data to Factor
Convert Milliseconds to Day of Month in H2O Datasets
Feature Generation via H2O Deep Learning
Return the status of the cluster
Return coefficients fitted on the standardized data (requires standardize = True, which is on by default). These coefficients can be used to evaluate variable importance.
Return the cumulative max over a column or across a row
Return the coefficients table with coefficients, standardized coefficients, p-values, z-values and std-error for GLM models
H2O Description of A Dataset
Build a Deep Neural Network model using CPUs
Used to verify data, x, y and turn into the appropriate things
Get variable importance in a standardized way.
Retrieve HGLM ModelMetrics
Get Model Ids
Rapids Endpoint
Retrieve the all types of AECU (average excess cumulative uplift) value in a table
Perform a safe (i.e. error-checked) HTTP GET request to an H2O cluster.
A helper function that makes it easier to override/add params in a function call.
Perform a low-level HTTP POST operation on an H2O instance
Create a leaderboard like data frame for models
Split H2O Dataset, Apply Function, and Return Results
Return the cumulative min over a column or across a row
Create a frame containing aggregations of intersectional fairness across the models.
Setup a Decryption Tool
H2O Package Constants
Retrieve an aggregated frame from an Aggregator model
H2O ANOVAGLM is used to calculate Type III SS which is used to evaluate the contributions of individual predictors
and their interactions to a model. Predictors or interactions with negligible contributions to the model will have
high p-values while those with more contributions will have low p-values.
Compute the absolute value of x
h2o.extendedIsolationForest
Trains an Extended Isolation Forest model
Sorts an H2O frame by columns
Take a single numeric argument and return a numeric vector with the smallest integers
Export a Model to HDFS
Conduct a lag 1 transform on a numeric H2OFrame column
Export an H2O Data Frame (H2OFrame) to a File or to a collection of Files.
Given a set of logical vectors, is at least one of the values true?
Compute a pairwise distance measure between all rows of two numeric H2OFrames.
Open H2O Flow
Drops duplicated rows.
Download the Scoring POJO (Plain Old Java Object) of an H2O Model
Access H2O Gains/Lift Tables
fillNA
Filter NA Columns
Get an R Reference to an H2O Dataset, that will NOT be GC'd by default
Retrieve an H2O Connection
h2o.find_threshold_by_max_metric
Find the threshold, give the max metric
Take a single numeric argument and return a numeric vector with the largest integers
Build gradient boosted classification or regression trees
Imports a generic model into H2O. Such model can be used then used for scoring and obtaining
additional information about the model. The imported model has to be supported by H2O.
Partial dependence plot per protected group.
Get an R object that is a subclass of H2OAutoML
Get h2o version
Retrieve the Model Centers
Get a grid object from H2O distributed K/V store.
Retrieve the leaderboard from the AutoML instance.
Extracts the knot locations from model output if it is enabled.
h2o.getGLMFullRegularizationPath
Extract full regularization path from a GLM model
Automatic Machine Learning
Extract the maximum lambda value used during lambda search from glm model.
Extract the minimum lambda value calculated during lambda search from glm model.
Note that due to early stop, this minimum lambda value may not be used in the actual lambda search.
h2o.get_predictors_removed_per_step
Extracts the predictor removed to model at each step.
h2o.get_regression_influence_diagnostics
Extracts a list of H2OFrames containing regression influence diagnostics for predictor subsets of various sizes or
just one H2OFrame containing regression influence diagnostics for predictor subsets of one fixed size
Print H2O cluster info
Trains a Cox Proportional Hazards Model (CoxPH) on an H2O dataset
Retrieve AUUC
Delete All H2O R Logs
Compute the cosine of x
Retrieve the cluster sizes
Return the coefficients that can be applied to the non-standardized data.
Data H2OFrame Creation in H2O
Compute the hyperbolic cosine of x
Download H2O Data to Disk
Imports a model under given path, creating a Generic model with it.
Plot PR curve per protected group.
Plot ROC curve per protected group.
Extract best alpha value found from glm model.
Download H2O Log Files to Disk
Retrieves an instance of H2OSegmentModels for a given id.
Convert between character representations and objects of Date class
Get an R reference to an H2O model
Return the Head or Tail of an H2O Dataset.
Compute A Histogram
Retrieve the AUC
Fetchces a single tree of a H2O model. This function is intended to be used on Gradient Boosting Machine models or Distributed Random Forest models.
Extracts the best R2 values for all predictor subset size.
Initialize and Connect to H2O
Insert Missing Values into an H2OFrame
Return the cumulative product over a column or across a row
Import SQL Table into H2O
Combine H2O Datasets by Columns
h2o.calculate_fairness_metrics
Calculate intersectional fairness metrics.
Basic Imputation of H2O Vectors
h2o.cross_validation_holdout_predictions
Retrieve the cross-validation holdout predictions
Retrieve the AUCPR (Area Under Precision Recall Curve)
Return column names of an H2OFrame
Convert Milliseconds to Day of Week in H2O Datasets
Obtain a list of columns that are specified by `coltype`
h2o.cross_validation_fold_assignment
Retrieve the cross-validation fold assignment
Compute DCT of an H2OFrame
Download the model in MOJO format.
Shannon entropy
Compute the exponential function of x
Download the model in binary format.
The owner of the file saved is the user by which python session was executed.
Return the cumulative sum over a column or across a row
SHAP summary plot for one feature with protected groups on y-axis.
Feature interactions and importance, leaf statistics and split value histograms in a tabular form.
Available for XGBoost and GBM.
H2O Grid Support
h2o.get_variable_inflation_factors
Return the variable inflation factors associated with numerical predictors for GLM models.
Group and Apply by Column
Import Files into H2O
Import Hive Table into H2O
H2O Apply Conditional Statement
Import SQL table that is result of SELECT SQL query into H2O
Retrieve the GINI Coefficcient
Imports a MOJO under given path, creating a Generic model with it.
Fit a generalized linear model
Generate Model Explanations for a single row
Generate Model Explanations
h2o.find_row_by_threshold
Find the threshold, give the max metric. No duplicate thresholds allowed
h2o.gains_lift_plot,H2OModelMetrics-method
Plot Gains/Lift curves
Column names of an H2OFrame
h2o.gains_lift_plot,H2OModel-method
Plot Gains/Lift curves
h2o.get_gam_knot_column_names
Extracts the gam column names corresponding to the knot locations from model output if it is enabled.
Plot Individual Conditional Expectation (ICE) for each decile
Compute the log1p of x
Returns the number of rows and columns for an H2OFrame object.
Find synonyms using a word2vec model.
Get back-end distributed key/value store id from an H2OFrame.
Check if numeric
Plot Gains/Lift curves
Get the Time Zone on the H2O cluster
Returns a string
Build an Isotonic Regression model
Extract best lambda value found from glm model.
Method on Keyed
objects allowing to obtain their key.
Get an list of all model ids present in the cluster
Trains an Isolation Forest model
Fit a General Additive Model
Get the types-per-column
Generalized low rank decomposition of an H2O data frame
Loads previously saved grid with all it's models from the same folder
Performs k-means clustering on an H2O dataset
Kolmogorov-Smirnov metric for binomial models
Get best model of a given family/algorithm for a given criterion from an AutoML object.
h2o.get_best_model_predictors
Extracts the subset of predictor names that yield the best R2 value for each predictor subset size.
Model Prediction Correlation
Load frame previously stored in H2O's native format.
Compute msec since the Unix Epoch
Load H2O Model from HDFS or Local Disk
H2O ModelSelection is used to build the best model with one predictor, two predictors, ... up to max_predictor_number
specified in the algorithm parameters when mode=allsubsets. The best model is the one with the highest R2 value. When
mode=maxr, the model returned is no longer guaranteed to have the best R2 value.
Produce a k-fold column vector.
Dump the stack into the JVM's stdout.
Search for matches to an argument pattern
Convert Milliseconds to Hour of Day in H2O Datasets
Retrieve the Hit Ratios
Plot partial dependencies for a variable across multiple models
Check Client Mode Connection
Disable Progress Bar
h2o.get_predictors_added_per_step
Extracts the predictor added to model at each step.
Get the number of factor levels for this frame.
h2o.model_correlation_heatmap
Model Prediction Correlation Heatmap
Retrieve actual number of trees for tree algorithms
iSAX
Calculates Friedman and Popescu's H statistics, in order to test for the presence of an interaction between specified variables in h2o gbm and xgb models.
H varies from 0 to 1. It will have a value of 0 if the model exhibits no interaction between specified variables and a correspondingly larger value for a
stronger interaction effect between them. NaN is returned if a computation is spoiled by weak main effects and rounding errors.
String Global Substitute
Return the levels from the column requested column.
Plot partial dependence for a variable
List Keys on an H2O Cluster
List all of the Time Zones Acceptable by the H2O cluster.
H2O Model Metric Accessor Functions
H2O Data Parsing
h2o.multinomial_aucpr_table
Retrieve the all PR AUC values in a table (One to Rest, One to One, macro and weighted average)
for mutlinomial classification.
Retrieve the Mean Absolute Error Value
Strip set from left
Returns the minima of the input values.
Set betas of an existing H2O GLM Model
Plot Pareto front
h2o.multinomial_auc_table
Retrieve the all AUC values in a table (One to Rest, One to One, macro and weighted average)
for mutlinomial classification.
Check if character
Check if factor
H2O Infogram
List all H2O registered extensions
List registered API extensions
h2o.infogram_train_subset_models
Train models over subsets selected using infogram
Column names of an H2OFrame
Compute naive Bayes probabilities on an H2O dataset.
h2o.permutation_importance
Calculate Permutation Feature Importance.
Model Performance Metrics in H2O
This function will add a new column rank where the ranking is produced as follows:
1. sorts the H2OFrame by columns sorted in by columns specified in group_by_cols and sort_cols in the directions
specified by the ascending for the sort_cols. The sort directions for the group_by_cols are ascending only.
2. A new rank column is added to the frame which will contain a rank assignment performed next. The user can
choose to assign a name to this new column. The default name is New_Rank_column.
3. For each groupby groups, a rank is assigned to the row starting from 1, 2, ... to the end of that group.
4. If sort_cols_sorted is TRUE, a final sort on the frame will be performed frame according to the sort_cols and
the sort directions in ascending. If sort_cols_sorted is FALSE (by default), the frame from step 3 will be
returned as is with no extra sort. This may provide a small speedup if desired.
H2O Prediction from R without having H2O running
Execute a Rapids expression.
Retrieve the default Qini value
Trains a Support Vector Machine model on an H2O dataset
h2o.permutation_importance_plot
Plot Permutation Variable Importances.
Pivot a frame
Reconstruct Training Data via H2O GLRM Model
Convert Archetypes to Features from H2O GLRM Model
Return the product of all the values present in its arguments.
Evaluates validity of the given rules on the given data. Returns a frame with a column per each input rule id,
representing a flag whether given rule is applied to the observation or not.
Combine H2O Datasets by Rows
Reorders levels of factor columns by the frequencies for the individual levels.
Retrieves Root Mean Squared Error Value
Retrieve the Log Loss Value
Log a message on the server-side logs
Return list of jobs performed by the H2O cluster
Converts a frame to key-value representation while optionally skipping NA values.
Inverse operation to h2o.pivot.
Retrieve the Root Mean Squared Log Error
Reorders levels of an H2O factor, similarly to standard R's relevel.
List registered core extensions
h2o.inspect_model_fairness
Produce plots and dataframes related to a single model fairness.
Compute the log2 of x
Categorical Interaction Feature Creation in H2O
Create a leaderboard from a list of models, grids and/or automls.
Create Model Metrics from predicted and actual values in H2O
H2O Median
Remove Rows With NAs
h2o.mean_residual_deviance
Retrieve the Mean Residual Deviance value
Merge Two H2O Data Frames
Return the number of rows present in x.
Retrieves Mean Squared Error Value
Convert Milliseconds to Months in H2O Datasets
Retrieve the null degrees of freedom
h2o.predicted_vs_actual_by_variable
Calculates per-level mean of predicted value vs actual value for a given variable.
Retrieve the null deviance
Retrieve the number of iterations.
Delete Columns from an H2OFrame
Remove All Objects on the H2O Cluster
Print An H2OFrame
Resume previously stopped grid training.
Retrieve the standard deviations of principal components
Standard Deviation of a column of data.
Deprecated - use h2o.save_mojo instead. Save an H2O Model Object as Mojo to Disk
Count of NAs per column
h2o.residual_analysis_plot
Residual Analysis
h2o.shap_explain_row_plot
SHAP Local Explanation
Retrieve the residual deviance
Save contents of this data frame into a Hive table
Skewness of a column
Triggers auto-recovery resume - this will look into configured recovery dir and resume and
tasks that were interrupted by unexpected cluster stopping.
Retrieve the residual degrees of freedom
Save an H2O Model Object as Mojo to Disk
Creates a new Amazon S3 client internally with specified credentials.
Compute the sine of x
Split an H2O Data Set
String Substitute
Tokenize String
H2O Segmented-Data Bulk Model Training
String Split
Compute the square root of x
Convert strings to uppercase
Convert strings to lowercase
String length
Store frame data in H2O's native format.
Upload a binary model from the provided local path to the H2O cluster.
(H2O model can be saved in a binary form either by saveModel() or by download_model() function.)
Delete Objects In H2O
Substring
Compute the frame's sum by-column (or by-row).
H2O topBottomN
H2O topN
Compute the log10 of x
Kurtosis of a column
Value Matching in H2O
Compute the logarithm of x
Learning Curve Plot
Return the number of columns present in x.
SHAP Summary Plot
Reset model threshold and return old threshold value.
Retrieve the R2 value
Quantiles of H2O Frames.
Replicate Elements of Vectors or Lists into H2O
Build a RuleFit Model
Partial Dependence Plots
Get a parse setup back for the staged data.
Scaling and Centering of an H2OFrame
This function returns the table with estimated coefficients and language representations (in case it is a rule)
for each of the significant baselearners.
Imports a MOJO from a local filesystem, creating a Generic model with it.
Plot an H2O Tabulate Heatmap
Plot an H2O Model
View Network Traffic Speed
H2O Prediction from R without having H2O running
Retrieve the mean per class error
H2O Prediction from R without having H2O running
Compute the frame's mean by-column (or by-row).
Returns the maxima of the input values.
h2o.negative_log_likelihood
Extracts the final training negative log likelihood of a GLM model.
Get the total within cluster sum of squares.
Save an H2O Model Object to Disk
Retrieve Model Score History
Plot Variable Importances
Retrieve per-variable split information for a given Isolation Forest model.
Output will include:
- count - The number of times a variable was used to make a split.
- aggregated_split_ratios - The split ratio is defined as "abs(#left_observations - #right_observations) / #before_split".
Even splits (#left_observations approx the same as #right_observations) contribute
less to the total aggregated split ratio value for the given feature;
highly imbalanced splits (eg. #left_observations >> #right_observations) contribute more.
- aggregated_split_depths - The sum of all depths of a variable used to make a split. (If a variable is used
on level N of a tree, then it contributes with N to the total aggregate.)
Enable Progress Bar
Edgar Anderson's Iris Data
h2o.transform,H2OTargetEncoderModel-method
Applies target encoding to a given dataset
Stop Writing H2O R Logs
Apply Target Encoding Map to Frame
Retrieve the variable importance.
Variable Importance Heatmap across multiple models
Plot Standardized Coefficient Magnitudes
Set the Time Zone on the H2O cluster
Set Levels of H2O Factor Column
Needed to be able to memoise the models
h2o.transform,H2OWordEmbeddingModel-method
Transform words (or sequences of words) to vectors using a word2vec model.
Check if character
Truncate values in x toward 0
Cross Tabulation and Table Creation in H2O
Create Target Encoding Map
Trim Space
Retrieve GLM Model Score History buried in GAM model
Save an H2O Model Details
Scree Plot
Count of substrings >= 2 chars that are contained in file
Variance of a column or covariance of columns.
predict_leaf_node_assignment.H2OModel
Predict the Leaf Node Assignment on an H2O Model
plot,H2OParetoFront-method
Plot Pareto front
Check if factor
Is H2O Frame object
Muscular Actuations for Walking Subject
predict_contributions.H2OModel
Predict feature contributions - SHAP values on an H2O Model (only DRF, GBM, XGBoost models and equivalent imported MOJOs).
Use optional package
Plot an H2O Infogram
Principal component analysis of an H2O data frame
Column names of an H2OFrame
summary,H2OCoxPHModel-method
Summary method for H2OCoxPHModel objects
Format AutoML object in user-friendly way
View H2O R Logs
show,H2OParetoFront-method
Show H2OParetoFront
Format AutoML object in user-friendly way
Predict on an H2O Model
Round doubles/floats to the given number of decimal places.
Build a Random Forest model
Returns a vector containing the minimum and maximum of all the given arguments.
Strip set from right
Produce a Vector of Random Uniform Numbers
Saves an existing Grid of models into a given folder.
Summarizes the columns of an H2OFrame.
Round doubles/floats to the given number of significant digits.
Shut Down H2O Instance
Singular value decomposition of an H2O data frame using the power method
Start Writing H2O R Logs
Compute element-wise string distances between two H2OFrames
Display the structure of an H2OFrame object
Builds a Stacked Ensemble
Format grid object in user-friendly way
Print the Model Summary
Which indice contains the max value?
Build an eXtreme Gradient Boosting model
Which indices are TRUE?
Predict on an AutoML object
Compute the hyperbolic tangent of x
Predict on an H2O Model
h2o.varimp,H2OAutoML-method
Retrieve the variable importance.
Compute the tangent of x
Get the total sum of squares.
Tabulation between Two Columns of an H2OFrame
Computes TF-IDF values for each word in given documents.
h2o.varimp,H2OFrame-method
Retrieve the variable importance.
Transformation of a categorical variable with a mean value of the target variable
Convert a word2vec model into an H2OFrame
h2o.varimp,H2OModel-method
Retrieve the variable importance.
h2o.thresholds_and_metric_scores
Retrieve the thresholds and metric scores table
H2O Unique
Print method for H2OTable objects
Trains a word2vec model on a String column of an H2O data frame
Print An H2OFrame
Determines whether an XGBoost model can be built
Check if numeric
Convert Milliseconds to Years in H2O Datasets
Overrides the behavior of length() function on H2OTree class. Returns number of nodes in an H2OTree
Which index contains the min value?
row_to_tree_assignment.H2OModel
Output row to tree assignment for the model and provided training data.
Transform words (or sequences of words) to vectors using a word2vec model.
Get the Within SS
Range of an H2O Column
Use H2O Transformation model and apply the underlying transformation
Convert Milliseconds to Week of Week Year in H2O Datasets
Build a Uplift Random Forest model
Scaling and Centering of an H2OFrame
Retrieve the respective weight matrix
Prostate Cancer Study
initialize,H2OInfogram-method
Method on H2OInfogram
object which in this case is to instantiate and initialize it
United States Congressional Voting Records 1984
staged_predict_proba.H2OModel
Predict class probabilities at each stage of an H2O Model
Display the structure of an H2OFrame object
Shutdown H2O cluster after examples run