The H2OAutoML class
The H2OCoxPHModel object.
S3 Group Generic Functions for H2O
The H2OConnection class.
H2OCoxPHModelSummary-class
The H2OCoxPHModelSummary object.
H2OConnectionMutableState
The H2OConnectionMutableState class
The H2OClusteringModel object.
H2O Grid
The H2OSplitNode class.
H2OSegmentModelsFuture-class
H2O Future Segment Models
The H2OLeafNode class.
Extract or Replace Parts of an H2OFrame Object
The H2OFrame class
H2OInfogram class
wrapper function for instantiating H2OInfogram
Starting H2O For examples
The H2ONode class.
Accessor Methods for H2OModel Object
Logical or for H2OFrames
Logical and for H2OFrames
The H2OModel object.
case_insensitive_match_arg
Works like match.arg but ignores case
H2O Future Model
Returns the column names of an H2OFrame
TODO: No objects in this file are being used. Either remove file or use objects.
Get first of family models
H2O Segment Models
The H2OTree class.
Stop with a user friendly message if a user is missing the ggplot2 package or has an old version of it.
Apply on H2O Datasets
as.data.frame.H2OSegmentModels
Converts a collection of Segment Models to a data.frame
Removal Endpoints
Map of operations known to H2O
H2OFrame Manipulation
Decryption Endpoints
Has the model
coefficients?
Capabilities endpoints
The H2OModelMetrics Object.
Virtual Keyed class
Column names of an H2OFrame
Converts parsed H2O data into an R data frame
Returns the Dimensions of an H2OFrame
Convert an H2OFrame to a String
Has the model variable importance?
Rapids Endpoint
Word2Vec Endpoints
Import/Export Endpoints
Is the model considered to be interpretable, i.e., simple enough.
H2O <-> R Communication and Utility Methods
Convert to quantiles when provided with numeric vector.
When col is a factor vector assign uniformly value between 0 and 1 to each level.
Retrieve the Akaike information criterion (AIC) value
Convert an H2OFrame to a matrix
Given a set of logical vectors, are all of the values true?
Convert an H2OFrame to a vector
Convert H2O Data to Factors
Create H2OFrame
Perform a safe (i.e. error-checked) HTTP GET request to an H2O cluster.
Convert H2O Data to Numeric
Consolidate variable importances
Australia Coastal Data
Helper Collapse Function
Rename an H2O object.
Retrieve Average Treatment Effect on the Control
Take a single numeric argument and return a numeric vector with the smallest integers
Automatic Machine Learning
Get the seed from H2OModel which was used during training.
If a user does not set the seed parameter before training, the seed is autogenerated.
It returns seed as the string if the value is bigger than the integer.
For example, an autogenerated seed is always long so that the seed in R is a string.
.h2o.__checkConnectionHealth
Check H2O Server Health
Create a leaderboard like data frame for models
H2O Package Constants
Locate a file given the pattern <bucket>/<path/to/file>
e.g. h2o:::.h2o.locate("smalldata/iris/iris22.csv") returns the absolute path to iris22.csv
.find_appropriate_column_name
Tries to match a fuzzy_col_name
with a column name that exists in cols
.
Retrieve the Model Centers
Get the algoritm used by the model_or_model_id
Administrative Endpoints
A helper function that makes it easier to override/add params in a function call.
Get a mapping between columns and their domains
Retrieve HGLM ModelMetrics
Return coefficients fitted on the standardized data (requires standardize = True, which is on by default). These coefficients can be used to evaluate variable importance.
Internal function that calculates a precise AUC from given
probabilities and actual responses.
Get feature count sorted by the count descending.
.is_plotting_to_rnotebook
Check if we are plotting in to r notebook.
Enhance leaderboard with per-model predictions.
Convert H2O Data to Factors
Given a set of logical vectors, is at least one of the values true?
Return the coefficients table with coefficients, standardized coefficients, p-values, z-values and std-error for GLM models
Export Files Endpoint Generator
Inspect/Summary Endpoints
Check H2OFrame columns for factors
Retrieve the default AECU (Average Excess Cumulative Uplift = area between AUUC and random AUUC)
Compute DCT of an H2OFrame
Retrieve the all types of AECU (average excess cumulative uplift) value in a table
Model Metrics Endpoint
Return the respective bias vector
Compute weighted gram matrix.
Parse Endpoints
Retrieve the Model Centers STD
Log and Echo Endpoint
Model Endpoint
Model Builder Endpoint Generator
Just like doRawPOST but fills in the default h2oRestApiVersion if none is provided
Combine H2O Datasets by Columns
feature_frequencies.H2OModel
Retrieve the number of occurrences of each feature for given observations
Available for GBM, Random Forest and Isolation Forest models.
h2o.cross_validation_fold_assignment
Retrieve the cross-validation fold assignment
Retrieve centroid statistics
Check if Progress Bar is Enabled
h2o.cross_validation_holdout_predictions
Retrieve the cross-validation holdout predictions
Column names of an H2OFrame
Return the cumulative product over a column or across a row
Return the cumulative sum over a column or across a row
.h2o.__SEGMENT_MODELS_BUILDERS
Segment Models Builder Endpoint Generator
Perform a low-level HTTP POST operation on an H2O instance
Access H2O Confusion Matrices
Convert Milliseconds to Day of Week in H2O Datasets
Perform a safe (i.e. error-checked) HTTP POST request to an H2O cluster.
Retrieve the all types of AUUC in a table
Return the cumulative min over a column or across a row
H2O R Interface
Compute the arc cosine of x
Perform a REST API request to a previously connected server.
Split H2O Dataset, Apply Function, and Return Results
Build an AdaBoost model
Is the model
an H2O model?
Is the model
a Tree-based H2O Model?
Compute the hyperbolic cosine of x
Create a frame containing aggregations of intersectional fairness across the models.
Just like doRawGET but fills in the default h2oRestApiVersion if none is provided
Feature Generation via H2O Deep Learning
Setup a Decryption Tool
Data H2OFrame Creation in H2O
CHeck to see if the column names/indices entered is valid for the dataframe given. This is an internal function
Download the model in binary format.
The owner of the file saved is the user by which python session was executed.
Retrieve normalized AUUC
Compute the absolute value of x
Convert between character representations and objects of Date class
Min-max normalization.
Get Model Ids
Convert H2O Data to Characters
Compute the exponential function of x
Get variable importance in a standardized way.
Partial dependence plot per protected group.
Download H2O Data to Disk
Plot PR curve per protected group.
Perform a low-level HTTP GET operation on an H2O instance
Generate Model Explanations
h2o.get_best_model_predictors
Extracts the subset of predictor names that yield the best R2 value for each predictor subset size.
Return the cumulative max over a column or across a row
Retrieve an aggregated frame from an Aggregator model
Used to verify data, x, y and turn into the appropriate things
The H2O Package Environment
Obtain a list of columns that are specified by `coltype`
Export a Model to HDFS
Trains a Cox Proportional Hazards Model (CoxPH) on an H2O dataset
Generate Model Explanations for a single row
.process_models_or_automl
Do basic validation and transform object
to a "standardized" list containing models, and
their properties such as x
, y
, whether it is a (multinomial) clasification or not etc.
Sorts an H2O frame by columns
h2o.calculate_fairness_metrics
Calculate intersectional fairness metrics.
Plot variable importances with ggplot2
Return column names of an H2OFrame
Extract best lambda value found from glm model.
Shortens model ids if possible (iff there will be same amount of unique model_ids as before)
Build an Aggregated Frame
Anomaly Detection via H2O Deep Learning Model
Export an H2O Data Frame (H2OFrame) to a File or to a collection of Files.
h2o.gains_lift_plot,H2OModelMetrics-method
Plot Gains/Lift curves
Extract best alpha value found from glm model.
Get the types-per-column
Retrieve an H2O Connection
Get h2o version
Take a single numeric argument and return a numeric vector with the largest integers
Retrieve the GINI Coefficcient
Retrieve AUUC
H2O ANOVAGLM is used to calculate Type III SS which is used to evaluate the contributions of individual predictors
and their interactions to a model. Predictors or interactions with negligible contributions to the model will have
high p-values while those with more contributions will have low p-values.
Open H2O Flow
Search for matches to an argument pattern
Build a Decision Tree model
Extract the minimum lambda value calculated during lambda search from glm model.
Note that due to early stop, this minimum lambda value may not be used in the actual lambda search.
Plot Gains/Lift curves
Compute a pairwise distance measure between all rows of two numeric H2OFrames.
Return the status of the cluster
Get a grid object from H2O distributed K/V store.
h2o.get_variable_inflation_factors
Return the variable inflation factors associated with numerical predictors for GLM models.
Download H2O Log Files to Disk
H2O Apply Conditional Statement
h2o.extendedIsolationForest
Trains an Extended Isolation Forest model
Convert H2O Data to Numerics
Retrieve Average Treatment Effect
Retrieve Average Treatment Effect on the Treated
h2o.get_regression_influence_diagnostics
Extracts a list of H2OFrames containing regression influence diagnostics for predictor subsets of various sizes or
just one H2OFrame containing regression influence diagnostics for predictor subsets of one fixed size
Get an R reference to an H2O model
Retrieve the AUC
Plot Individual Conditional Expectation (ICE) for each decile
H2O bottomN
Retrieve the AUCPR (Area Under Precision Recall Curve)
Convert Milliseconds to Hour of Day in H2O Datasets
Delete All H2O R Logs
Print H2O cluster info
h2o.getGLMFullRegularizationPath
Extract full regularization path from a GLM model
Build a Deep Neural Network model using CPUs
Find synonyms using a word2vec model.
Extracts the final training average objective function of a GLM model.
Import Files into H2O
Get an R Reference to an H2O Dataset, that will NOT be GC'd by default
Get the between cluster sum of squares
Retrieve the cluster sizes
Cut H2O Numeric Data to Factor
Convert Milliseconds to Day of Month in H2O Datasets
H2O Description of A Dataset
Filter NA Columns
Plot ROC curve per protected group.
Retrieves an instance of H2OSegmentModels for a given id.
Shannon entropy
Return the coefficients that can be applied to the non-standardized data.
Determine if an H2O cluster is up or not
Basic Imputation of H2O Vectors
H2O Infogram
Get back-end distributed key/value store id from an H2OFrame.
SHAP summary plot for one feature with protected groups on y-axis.
Correlation of columns.
h2o.gains_lift_plot,H2OModel-method
Plot Gains/Lift curves
Connect to a running H2O instance.
Import SQL table that is result of SELECT SQL query into H2O
Fetchces a single tree of a H2O model. This function is intended to be used on Gradient Boosting Machine models or Distributed Random Forest models.
Returns the number of rows and columns for an H2OFrame object.
h2o.cross_validation_models
Retrieve the cross-validation models
Check if factor
Insert Missing Values into an H2OFrame
Extract the maximum lambda value used during lambda search from glm model.
Fit a General Additive Model
H2O Grid Support
Dump the stack into the JVM's stdout.
Fit a generalized linear model
Check if numeric
Compute the cosine of x
h2o.find_row_by_threshold
Find the threshold, give the max metric. No duplicate thresholds allowed
Conduct a lag 1 transform on a numeric H2OFrame column
Retrieve the leaderboard from the AutoML instance.
h2o.cross_validation_predictions
Retrieve the cross-validation predictions
Get the Time Zone on the H2O cluster
Returns a string
Strip set from left
Retrieve actual number of trees for tree algorithms
Drops duplicated rows.
Generalized low rank decomposition of an H2O data frame
h2o.infogram_train_subset_models
Train models over subsets selected using infogram
Build an Isotonic Regression model
Value Matching in H2O
Download the model in MOJO format.
List all of the Time Zones Acceptable by the H2O cluster.
Convert Milliseconds to Months in H2O Datasets
Return list of jobs performed by the H2O cluster
fillNA
h2o.get_gam_knot_column_names
Extracts the gam column names corresponding to the knot locations from model output if it is enabled.
h2o.inspect_model_fairness
Produce plots and dataframes related to a single model fairness.
Compute msec since the Unix Epoch
H2O ModelSelection is used to build the best model with one predictor, two predictors, ... up to max_predictor_number
specified in the algorithm parameters when mode=allsubsets. The best model is the one with the highest R2 value. When
mode=maxr, the model returned is no longer guaranteed to have the best R2 value.
Performs k-means clustering on an H2O dataset
Initialize and Connect to H2O
Feature interactions and importance, leaf statistics and split value histograms in a tabular form.
Available for XGBoost and GBM.
View H2O R Logs
Access H2O Gains/Lift Tables
Extracts the best R2 values for all predictor subset size.
Trains an Isolation Forest model
h2o.mean_residual_deviance
Retrieve the Mean Residual Deviance value
Get a parse setup back for the staged data.
List all H2O registered extensions
Loads previously saved grid with all it's models from the same folder
h2o.multinomial_auc_table
Retrieve the all AUC values in a table (One to Rest, One to One, macro and weighted average)
for mutlinomial classification.
Get an list of all model ids present in the cluster
Create a leaderboard from a list of models, grids and/or automls.
List Keys on an H2O Cluster
Return the product of all the values present in its arguments.
Plot Pareto front
Imports a MOJO under given path, creating a Generic model with it.
Download the Scoring POJO (Plain Old Java Object) of an H2O Model
Compute the log1p of x
Group and Apply by Column
Compute the log10 of x
Plot partial dependencies for a variable across multiple models
Count of substrings >= 2 chars that are contained in file
H2O Median
h2o.multinomial_aucpr_table
Retrieve the all PR AUC values in a table (One to Rest, One to One, macro and weighted average)
for mutlinomial classification.
h2o.negative_log_likelihood
Extracts the final training negative log likelihood of a GLM model.
Execute a Rapids expression.
View Network Traffic Speed
List registered API extensions
Compute A Histogram
Create Model Metrics from predicted and actual values in H2O
h2o.get_predictors_added_per_step
Extracts the predictor added to model at each step.
Categorical Interaction Feature Creation in H2O
String Global Substitute
Convert Archetypes to Features from H2O GLRM Model
Imports a generic model into H2O. Such model can be used then used for scoring and obtaining
additional information about the model. The imported model has to be supported by H2O.
Log a message on the server-side logs
Import SQL Table into H2O
List registered core extensions
Triggers auto-recovery resume - this will look into configured recovery dir and resume and
tasks that were interrupted by unexpected cluster stopping.
Quantiles of H2O Frames.
Converts a frame to key-value representation while optionally skipping NA values.
Inverse operation to h2o.pivot.
Imports a model under given path, creating a Generic model with it.
Merge Two H2O Data Frames
Returns the maxima of the input values.
Load frame previously stored in H2O's native format.
Compute the logarithm of x
Get an R object that is a subclass of H2OAutoML
Delete Columns from an H2OFrame
H2O Model Metric Accessor Functions
Partial Dependence Plots
Get best model of a given family/algorithm for a given criterion from an AutoML object.
Check Client Mode Connection
Model Prediction Correlation
Return the number of columns present in x.
String length
Method on Keyed
objects allowing to obtain their key.
h2o.find_threshold_by_max_metric
Find the threshold, give the max metric
h2o.model_correlation_heatmap
Model Prediction Correlation Heatmap
Retrieve the mean per class error
Start Writing H2O R Logs
Enable Progress Bar
Retrieve the default Qini value
Shut Down H2O Instance
Remove Rows With NAs
h2o.get_predictors_removed_per_step
Extracts the predictor removed to model at each step.
Principal component analysis of an H2O data frame
Import Hive Table into H2O
Retrieve the R2 value
Store frame data in H2O's native format.
Compute the log2 of x
Retrieve the Hit Ratios
Build gradient boosted classification or regression trees
Calculates Friedman and Popescu's H statistics, in order to test for the presence of an interaction between specified variables in h2o gbm and xgb models.
H varies from 0 to 1. It will have a value of 0 if the model exhibits no interaction between specified variables and a correspondingly larger value for a
stronger interaction effect between them. NaN is returned if a computation is spoiled by weak main effects and rounding errors.
This function will add a new column rank where the ranking is produced as follows:
1. sorts the H2OFrame by columns sorted in by columns specified in group_by_cols and sort_cols in the directions
specified by the ascending for the sort_cols. The sort directions for the group_by_cols are ascending only.
2. A new rank column is added to the frame which will contain a rank assignment performed next. The user can
choose to assign a name to this new column. The default name is New_Rank_column.
3. For each groupby groups, a rank is assigned to the row starting from 1, 2, ... to the end of that group.
4. If sort_cols_sorted is TRUE, a final sort on the frame will be performed frame according to the sort_cols and
the sort directions in ascending. If sort_cols_sorted is FALSE (by default), the frame from step 3 will be
returned as is with no extra sort. This may provide a small speedup if desired.
Produce a k-fold column vector.
Extracts the knot locations from model output if it is enabled.
h2o.permutation_importance_plot
Plot Permutation Variable Importances.
Remove All Objects on the H2O Cluster
Retrieve the Mean Absolute Error Value
iSAX
Strip set from right
Save an H2O Model Object as Mojo to Disk
Pivot a frame
Compute the frame's sum by-column (or by-row).
Return the Head or Tail of an H2O Dataset.
Print An H2OFrame
Compute the frame's mean by-column (or by-row).
Trains a Support Vector Machine model on an H2O dataset
Learning Curve Plot
Saves an existing Grid of models into a given folder.
H2O topN
Return the levels from the column requested column.
H2O Prediction from R without having H2O running
This function returns the table with estimated coefficients and language representations (in case it is a rule)
for each of the significant baselearners.
Create Target Encoding Map
Model Performance Metrics in H2O
h2o.shap_explain_row_plot
SHAP Local Explanation
Reorders levels of an H2O factor, similarly to standard R's relevel.
Retrieve the log likelihood value
Save an H2O Model Object to Disk
Plot Standardized Coefficient Magnitudes
Compute naive Bayes probabilities on an H2O dataset.
H2O Prediction from R without having H2O running
Kolmogorov-Smirnov metric for binomial models
H2O Prediction from R without having H2O running
Compute the hyperbolic tangent of x
Column names of an H2OFrame
Set Levels of H2O Factor Column
h2o.predicted_vs_actual_by_variable
Calculates per-level mean of predicted value vs actual value for a given variable.
h2o.varimp,H2OFrame-method
Retrieve the variable importance.
Scree Plot
Get the number of factor levels for this frame.
Retrieve the Log Loss Value
Check if character
H2O Data Parsing
Singular value decomposition of an H2O data frame using the power method
Substring
Plot partial dependence for a variable
Kurtosis of a column
Retrieve GLM Model Score History buried in GAM model
Which index contains the min value?
Save an H2O Model Details
Apply Target Encoding Map to Frame
Deprecated - use h2o.save_mojo instead. Save an H2O Model Object as Mojo to Disk
Compute the sine of x
Disable Progress Bar
Round doubles/floats to the given number of significant digits.
h2o.permutation_importance
Calculate Permutation Feature Importance.
Returns the minima of the input values.
Set betas of an existing H2O GLM Model
Count of NAs per column
Retrieve the respective weight matrix
Load H2O Model from HDFS or Local Disk
Return the number of rows present in x.
Predict on an AutoML object
Format AutoML object in user-friendly way
Resume previously stopped grid training.
Retrieve the null degrees of freedom
Retrieves Mean Squared Error Value
Convert strings to lowercase
Retrieve the null deviance
H2O topBottomN
Use GRLM to transform a frame.
Transform words (or sequences of words) to vectors using a word2vec model.
Retrieve the number of iterations.
String Substitute
United States Congressional Voting Records 1984
Retrieve the residual degrees of freedom
Cross Tabulation and Table Creation in H2O
h2o.varimp,H2OModel-method
Retrieve the variable importance.
Delete Objects In H2O
Replicate Elements of Vectors or Lists into H2O
H2O Unique
Print An H2OFrame
Variance of a column or covariance of columns.
Transformation of a categorical variable with a mean value of the target variable
Get the total sum of squares.
Convert Milliseconds to Week of Week Year in H2O Datasets
Convert strings to uppercase
Plot an H2O Model
Predict on an H2O Model
Retrieve the standard deviations of principal components
Reorders levels of factor columns by the frequencies for the individual levels.
initialize,H2OInfogram-method
Method on H2OInfogram
object which in this case is to instantiate and initialize it
Retrieves Root Mean Squared Error Value
Returns a vector containing the minimum and maximum of all the given arguments.
Save contents of this data frame into a Hive table
summary,H2OCoxPHModel-method
Summary method for H2OCoxPHModel objects
Skewness of a column
Scaling and Centering of an H2OFrame
show,H2OParetoFront-method
Show H2OParetoFront
SHAP Summary Plot
Which indices are TRUE?
Retrieve the results to view the best predictor subsets.
plot,H2OParetoFront-method
Plot Pareto front
Build an eXtreme Gradient Boosting model
Build a Random Forest model
Tabulation between Two Columns of an H2OFrame
Trim Space
h2o.varimp,H2OAutoML-method
Retrieve the variable importance.
h2o.residual_analysis_plot
Residual Analysis
Compute element-wise string distances between two H2OFrames
Split an H2O Data Set
Convert Milliseconds to Years in H2O Datasets
Get the Within SS
Plot Variable Importances
Determines whether an XGBoost model can be built
Print method for H2OTable objects
predict_leaf_node_assignment.H2OModel
Predict the Leaf Node Assignment on an H2O Model
Edgar Anderson's Iris Data
h2o.thresholds_and_metric_scores
Retrieve the thresholds and metric scores table
Reset model threshold and return old threshold value.
Computes TF-IDF values for each word in given documents.
String Split
Creates a new Amazon S3 client internally with specified credentials.
Combine H2O Datasets by Rows
Build a RuleFit Model
H2O Segmented-Data Bulk Model Training
Compute the tangent of x
Plot an H2O Infogram
Evaluates validity of the given rules on the given data. Returns a frame with a column per each input rule id,
representing a flag whether given rule is applied to the observation or not.
Format grid object in user-friendly way
Reconstruct Training Data via H2O GLRM Model
staged_predict_proba.H2OModel
Predict class probabilities at each stage of an H2O Model
Retrieve the residual deviance
Variable Importance Heatmap across multiple models
Produce a Vector of Random Uniform Numbers
Round doubles/floats to the given number of decimal places.
Standard Deviation of a column of data.
Range of an H2O Column
Shutdown H2O cluster after examples run
predict_contributions.H2OModel
Predict feature contributions - SHAP values on an H2O Model (only DRF, GBM, XGBoost models and equivalent imported MOJOs).
h2o.transform,H2OWordEmbeddingModel-method
Transform words (or sequences of words) to vectors using a word2vec model.
Set the Time Zone on the H2O cluster
Retrieve the Root Mean Squared Log Error
Retrieve Model Score History
Prostate Cancer Study
Get the total within cluster sum of squares.
Imports a MOJO from a local filesystem, creating a Generic model with it.
Use H2O Transformation model and apply the underlying transformation
h2o.transform,H2OTargetEncoderModel-method
Applies target encoding to a given dataset
Predict on an H2O Model
Upload a binary model from the provided local path to the H2O cluster.
(H2O model can be saved in a binary form either by saveModel() or by download_model() function.)
Summarizes the columns of an H2OFrame.
Which indice contains the max value?
Trains a word2vec model on a String column of an H2O data frame
Build a Uplift Random Forest model
Column names of an H2OFrame
Compute the square root of x
Truncate values in x toward 0
Display the structure of an H2OFrame object
Use optional package
Builds a Stacked Ensemble
Check if factor
Retrieve per-variable split information for a given Isolation Forest model.
Output will include:
- count - The number of times a variable was used to make a split.
- aggregated_split_ratios - The split ratio is defined as "abs(#left_observations - #right_observations) / #before_split".
Even splits (#left_observations approx the same as #right_observations) contribute
less to the total aggregated split ratio value for the given feature;
highly imbalanced splits (eg. #left_observations >> #right_observations) contribute more.
- aggregated_split_depths - The sum of all depths of a variable used to make a split. (If a variable is used
on level N of a tree, then it contributes with N to the total aggregate.)
Check if character
Convert a word2vec model into an H2OFrame
Tokenize String
Stop Writing H2O R Logs
Print the Model Summary
Is H2O Frame object
Retrieve the variable importance.
Display the structure of an H2OFrame object
Plot an H2O Tabulate Heatmap
row_to_tree_assignment.H2OModel
Output row to tree assignment for the model and provided training data.
Format AutoML object in user-friendly way
Check if numeric
Overrides the behavior of length() function on H2OTree class. Returns number of nodes in an H2OTree
Muscular Actuations for Walking Subject
Needed to be able to memoise the models
Scaling and Centering of an H2OFrame