S3 Group Generic Functions for H2O
H2OConnectionMutableState
The H2OConnectionMutableState class
Extract or Replace Parts of an H2OFrame Object
H2O Grid
The H2OClusteringModel object.
The H2OConnection class.
The H2OCoxPHModel object.
The H2OAutoML class
H2OSegmentModelsFuture-class
H2O Future Segment Models
The H2OFrame class
H2OInfogram class
Converts parsed H2O data into an R data frame
H2O Segment Models
Australia Coastal Data
Convert an H2OFrame to a vector
The H2ONode class.
The H2OTree class.
Column names of an H2OFrame
The H2OSplitNode class.
Virtual Keyed class
wrapper function for instantiating H2OInfogram
The H2OLeafNode class.
.find_appropriate_column_name
Tries to match a fuzzy_col_name
with a column name that exists in cols
.
A helper function that makes it easier to override/add params in a function call.
The H2OModel object.
Starting H2O For examples
The H2OModelMetrics Object.
TODO: No objects in this file are being used. Either remove file or use objects.
H2O Future Model
H2OCoxPHModelSummary-class
The H2OCoxPHModelSummary object.
Stop with a user friendly message if a user is missing the ggplot2 package or has an old version of it.
Convert H2O Data to Numeric
Logical and for H2OFrames
Convert an H2OFrame to a matrix
as.data.frame.H2OSegmentModels
Converts a collection of Segment Models to a data.frame
Get the algoritm used by the model_or_model_id
Enhance leaderboard with per-model predictions.
Compute the absolute value of x
Given a set of logical vectors, is at least one of the values true?
Administrative Endpoints
.is_plotting_to_rnotebook
Check if we are plotting in to r notebook.
Retrieve HGLM ModelMetrics
H2O ANOVAGLM is used to calculate Type III SS which is used to evaluate the contributions of individual predictors
and their interactions to a model. Predictors or interactions with negligible contributions to the model will have
high p-values while those with more contributions will have low p-values.
.process_models_or_automl
Do basic validation and transform object
to a "standardized" list containing models, and
their properties such as x
, y
, whether it is a (multinomial) clasification or not etc.
Import/Export Endpoints
Get a mapping between columns and their domains
Consolidate variable importances
Rapids Endpoint
Internal function that calculates a precise AUC from given
probabilities and actual responses.
Log and Echo Endpoint
Model Endpoint
Parse Endpoints
Map of operations known to H2O
Convert H2O Data to Characters
Compute the arc cosine of x
Retrieve the default AECU (Average Excess Cumulative Uplift = area between AUUC and random AUUC)
Helper Collapse Function
Logical or for H2OFrames
Accessor Methods for H2OModel Object
Apply on H2O Datasets
Convert H2O Data to Factors
Compute the hyperbolic cosine of x
Capabilities endpoints
Return coefficients fitted on the standardized data (requires standardize = True, which is on by default). These coefficients can be used to evaluate variable importance.
Convert an H2OFrame to a String
.h2o.__SEGMENT_MODELS_BUILDERS
Segment Models Builder Endpoint Generator
H2OFrame Manipulation
H2O Package Constants
Returns the column names of an H2OFrame
Check H2OFrame columns for factors
H2O bottomN
Inspect/Summary Endpoints
Retrieve the Model Centers STD
Create a leaderboard like data frame for models
Convert H2O Data to Factors
Removal Endpoints
Perform a low-level HTTP POST operation on an H2O instance
fillNA
Create H2OFrame
Decryption Endpoints
Perform a REST API request to a previously connected server.
Combine H2O Datasets by Columns
Perform a safe (i.e. error-checked) HTTP GET request to an H2O cluster.
Get the between cluster sum of squares
Returns the Dimensions of an H2OFrame
Automatic Machine Learning
Retrieve the Model Centers
Get first of family models
Get feature count sorted by the count descending.
Model Builder Endpoint Generator
Imports a generic model into H2O. Such model can be used then used for scoring and obtaining
additional information about the model. The imported model has to be supported by H2O.
Return the cumulative product over a column or across a row
Retrieve AUUC
Get Model Ids
Perform a safe (i.e. error-checked) HTTP POST request to an H2O cluster.
Return the coefficients table with coefficients, standardized coefficients, p-values, z-values and std-error for GLM models
feature_frequencies.H2OModel
Retrieve the number of occurrences of each feature for given observations
Available for GBM, Random Forest and Isolation Forest models.
Min-max normalization.
Retrieve the cluster sizes
Is the model
a Tree-based H2O Model?
.h2o.__checkConnectionHealth
Check H2O Server Health
Delete All H2O R Logs
Return the cumulative min over a column or across a row
CHeck to see if the column names/indices entered is valid for the dataframe given. This is an internal function
Word2Vec Endpoints
Print H2O cluster info
Extract best alpha value found from glm model.
Return the respective bias vector
Model Metrics Endpoint
Check if Progress Bar is Enabled
Download H2O Data to Disk
Connect to a running H2O instance.
Feature Generation via H2O Deep Learning
h2o.find_row_by_threshold
Find the threshold, give the max metric. No duplicate thresholds allowed
Correlation of columns.
Download the model in binary format.
The owner of the file saved is the user by which python session was executed.
Return the coefficients that can be applied to the non-standardized data.
Build a Deep Neural Network model using CPUs
Retrieve the AUC
Export Files Endpoint Generator
h2o.find_threshold_by_max_metric
Find the threshold, give the max metric
Retrieve normalized AUUC
Just like doRawGET but fills in the default h2oRestApiVersion if none is provided
Just like doRawPOST but fills in the default h2oRestApiVersion if none is provided
Get back-end distributed key/value store id from an H2OFrame.
Get h2o version
H2O Grid Support
Search for matches to an argument pattern
Get a grid object from H2O distributed K/V store.
Get the types-per-column
Is the model considered to be interpretable, i.e., simple enough.
Perform a low-level HTTP GET operation on an H2O instance
Get the seed from H2OModel which was used during training.
If a user does not set the seed parameter before training, the seed is autogenerated.
It returns seed as the string if the value is bigger than the integer.
For example, an autogenerated seed is always long so that the seed in R is a string.
Import SQL table that is result of SELECT SQL query into H2O
Is the model
an H2O model?
Sorts an H2O frame by columns
Locate a file given the pattern <bucket>/<path/to/file>
e.g. h2o:::.h2o.locate("smalldata/iris/iris22.csv") returns the absolute path to iris22.csv
Shortens model ids if possible (iff there will be same amount of unique model_ids as before)
H2O <-> R Communication and Utility Methods
Convert to quantiles when provided with numeric vector.
When col is a factor vector assign uniformly value between 0 and 1 to each level.
Retrieves Mean Squared Error Value
Convert between character representations and objects of Date class
Data H2OFrame Creation in H2O
h2o.cross_validation_models
Retrieve the cross-validation models
The H2O Package Environment
Has the model variable importance?
Return the cumulative max over a column or across a row
Create Model Metrics from predicted and actual values in H2O
Retrieve the all types of AECU (average excess cumulative uplift) value in a table
Get variable importance in a standardized way.
Plot variable importances with ggplot2
Used to verify data, x, y and turn into the appropriate things
Access H2O Confusion Matrices
Retrieve the all types of AUUC in a table
Retrieve an aggregated frame from an Aggregator model
h2o.cross_validation_fold_assignment
Retrieve the cross-validation fold assignment
Extract best lambda value found from glm model.
h2o.mean_residual_deviance
Retrieve the Mean Residual Deviance value
Get an R reference to an H2O model
Retrieve the mean per class error
Anomaly Detection via H2O Deep Learning Model
Given a set of logical vectors, are all of the values true?
Return the status of the cluster
Trains a Cox Proportional Hazards Model (CoxPH) on an H2O dataset
Filter NA Columns
Download the model in MOJO format.
H2O Description of A Dataset
Conduct a lag 1 transform on a numeric H2OFrame column
Download the Scoring POJO (Plain Old Java Object) of an H2O Model
h2o.cross_validation_predictions
Retrieve the cross-validation predictions
Rename an H2O object.
Categorical Interaction Feature Creation in H2O
Import SQL Table into H2O
Convert Milliseconds to Day of Month in H2O Datasets
h2o.gains_lift_plot,H2OModelMetrics-method
Plot Gains/Lift curves
Compute weighted gram matrix.
Compute a pairwise distance measure between all rows of two numeric H2OFrames.
Retrieve the AUCPR (Area Under Precision Recall Curve)
Retrieve centroid statistics
Drops duplicated rows.
H2O R Interface
Access H2O Gains/Lift Tables
Compute the cosine of x
Determine if an H2O cluster is up or not
Fit a General Additive Model
Retrieve the leaderboard from the AutoML instance.
Find synonyms using a word2vec model.
Build gradient boosted classification or regression trees
iSAX
Return the cumulative sum over a column or across a row
Shannon entropy
Check Client Mode Connection
Method on Keyed
objects allowing to obtain their key.
Trains an Isolation Forest model
Basic Imputation of H2O Vectors
Cut H2O Numeric Data to Factor
Build an Aggregated Frame
Generate Model Explanations
Extract the minimum lambda value calculated during lambda search from glm model.
Note that due to early stop, this minimum lambda value may not be used in the actual lambda search.
List registered API extensions
Imports a MOJO under given path, creating a Generic model with it.
Export a Model to HDFS
Check if character
h2o.extendedIsolationForest
Trains an Extended Isolation Forest model
Take a single numeric argument and return a numeric vector with the largest integers
Retrieve the Akaike information criterion (AIC) value
H2O Prediction from R without having H2O running
Retrieves an instance of H2OSegmentModels for a given id.
Load H2O Model from HDFS or Local Disk
Convert H2O Data to Numerics
Load frame previously stored in H2O's native format.
Retrieve actual number of trees for tree algorithms
Returns the number of rows and columns for an H2OFrame object.
Model Prediction Correlation
Get an R object that is a subclass of H2OAutoML
Column names of an H2OFrame
h2o.gains_lift_plot,H2OModel-method
Plot Gains/Lift curves
Calculates Friedman and Popescu's H statistics, in order to test for the presence of an interaction between specified variables in h2o gbm and xgb models.
H varies from 0 to 1. It will have a value of 0 if the model exhibits no interaction between specified variables and a correspondingly larger value for a
stronger interaction effect between them. NaN is returned if a computation is spoiled by weak main effects and rounding errors.
Initialize and Connect to H2O
Kurtosis of a column
Insert Missing Values into an H2OFrame
Return the Head or Tail of an H2O Dataset.
Retrieve an H2O Connection
Learning Curve Plot
h2o.get_best_model_predictors
Extracts the subset of predictor names that yield the best R2 value for each predictor subset size.
Retrieve the GINI Coefficcient
Split H2O Dataset, Apply Function, and Return Results
Compute the logarithm of x
Compute naive Bayes probabilities on an H2O dataset.
H2O Median
Group and Apply by Column
H2O ModelSelection is used to build the best model with one predictor, two predictors, ... up to max_predictor_number
specified in the algorithm parameters when mode=allsubsets. The best model is the one with the highest R2 value. When
mode=maxr, the model returned is no longer guaranteed to have the best R2 value.
Get best model of a given family/algorithm for a given criterion from an AutoML object.
Retrieve the Hit Ratios
Compute A Histogram
String Global Substitute
Loads previously saved grid with all it's models from the same folder
Compute the log10 of x
Imports a model under given path, creating a Generic model with it.
Evaluates validity of the given rules on the given data. Returns a frame with a column per each input rule id,
representing a flag whether given rule is applied to the observation or not.
Setup a Decryption Tool
Compute the log1p of x
Compute the log2 of x
List all of the Time Zones Acceptable by the H2O cluster.
Get an list of all model ids present in the cluster
h2o.residual_analysis_plot
Residual Analysis
Delete Objects In H2O
Return the levels from the column requested column.
H2O Infogram
Extract the maximum lambda value used during lambda search from glm model.
Retrieve the R2 value
List Keys on an H2O Cluster
Import Hive Table into H2O
Generate Model Explanations for a single row
Download H2O Log Files to Disk
Remove Rows With NAs
Strip set from left
h2o.multinomial_aucpr_table
Retrieve the all PR AUC values in a table (One to Rest, One to One, macro and weighted average)
for mutlinomial classification.
Take a single numeric argument and return a numeric vector with the smallest integers
h2o.cross_validation_holdout_predictions
Retrieve the cross-validation holdout predictions
Return column names of an H2OFrame
Compute the exponential function of x
Compute DCT of an H2OFrame
Return the number of columns present in x.
Obtain a list of columns that are specified by `coltype`
Return the number of rows present in x.
View Network Traffic Speed
Export an H2O Data Frame (H2OFrame) to a File or to a collection of Files.
Feature interactions and importance, leaf statistics and split value histograms in a tabular form.
Available for XGBoost and GBM.
Returns the minima of the input values.
Compute msec since the Unix Epoch
H2O Segmented-Data Bulk Model Training
h2o.getGLMFullRegularizationPath
Extract full regularization path from a GLM model
Open H2O Flow
Convert Milliseconds to Day of Week in H2O Datasets
Log a message on the server-side logs
Get the total within cluster sum of squares.
h2o.multinomial_auc_table
Retrieve the all AUC values in a table (One to Rest, One to One, macro and weighted average)
for mutlinomial classification.
Partial Dependence Plots
Retrieve the Log Loss Value
Get an R Reference to an H2O Dataset, that will NOT be GC'd by default
Converts a frame to key-value representation while optionally skipping NA values.
Inverse operation to h2o.pivot.
Returns a vector containing the minimum and maximum of all the given arguments.
Count of NAs per column
Retrieves Root Mean Squared Error Value
List registered core extensions
Fetchces a single tree of a H2O model. This function is intended to be used on Gradient Boosting Machine models or Distributed Random Forest models.
Plot Gains/Lift curves
Convert Milliseconds to Hour of Day in H2O Datasets
Extracts the best R2 values for all predictor subset size.
Get the Time Zone on the H2O cluster
Returns a string
Count of substrings >= 2 chars that are contained in file
View H2O R Logs
Check if factor
Performs k-means clustering on an H2O dataset
Kolmogorov-Smirnov metric for binomial models
Generalized low rank decomposition of an H2O data frame
Fit a generalized linear model
Disable Progress Bar
Returns the maxima of the input values.
Compute the frame's mean by-column (or by-row).
Plot Individual Conditional Expectation (ICE) for each decile
Column names of an H2OFrame
Retrieve the Mean Absolute Error Value
H2O Model Metric Accessor Functions
Check if numeric
Set betas of an existing H2O GLM Model
Merge Two H2O Data Frames
Print An H2OFrame
Plot partial dependencies for a variable across multiple models
Produce a k-fold column vector.
Dump the stack into the JVM's stdout.
Compute the frame's sum by-column (or by-row).
Quantiles of H2O Frames.
h2o.thresholds_and_metric_scores
Retrieve the thresholds and metric scores table
Execute a Rapids expression.
Return the product of all the values present in its arguments.
Reconstruct Training Data via H2O GLRM Model
Combine H2O Datasets by Rows
Retrieve GLM Model Score History buried in GAM model
Triggers auto-recovery resume - this will look into configured recovery dir and resume and
tasks that were interrupted by unexpected cluster stopping.
Convert Milliseconds to Months in H2O Datasets
Shut Down H2O Instance
Predict on an H2O Model
List all H2O registered extensions
Retrieve the residual deviance
H2O Data Parsing
Builds a Stacked Ensemble
H2O Prediction from R without having H2O running
Singular value decomposition of an H2O data frame using the power method
Reorders levels of factor columns by the frequencies for the individual levels.
H2O Apply Conditional Statement
Import Files into H2O
Reset model threshold and return old threshold value.
Which indice contains the max value?
Needed to be able to memoise the models
Start Writing H2O R Logs
h2o.varimp,H2OAutoML-method
Retrieve the variable importance.
Summarizes the columns of an H2OFrame.
Which indices are TRUE?
This function will add a new column rank where the ranking is produced as follows:
1. sorts the H2OFrame by columns sorted in by columns specified in group_by_cols and sort_cols in the directions
specified by the ascending for the sort_cols. The sort directions for the group_by_cols are ascending only.
2. A new rank column is added to the frame which will contain a rank assignment performed next. The user can
choose to assign a name to this new column. The default name is New_Rank_column.
3. For each groupby groups, a rank is assigned to the row starting from 1, 2, ... to the end of that group.
4. If sort_cols_sorted is TRUE, a final sort on the frame will be performed frame according to the sort_cols and
the sort directions in ascending. If sort_cols_sorted is FALSE (by default), the frame from step 3 will be
returned as is with no extra sort. This may provide a small speedup if desired.
Variance of a column or covariance of columns.
United States Congressional Voting Records 1984
h2o.predicted_vs_actual_by_variable
Calculates per-level mean of predicted value vs actual value for a given variable.
Reorders levels of an H2O factor, similarly to standard R's relevel.
Retrieve the number of iterations.
String length
Round doubles/floats to the given number of significant digits.
Retrieve the null degrees of freedom
Delete Columns from an H2OFrame
H2O Unique
Scaling and Centering of an H2OFrame
Round doubles/floats to the given number of decimal places.
Replicate Elements of Vectors or Lists into H2O
H2O topBottomN
initialize,H2OInfogram-method
Method on H2OInfogram
object which in this case is to instantiate and initialize it
Return list of jobs performed by the H2O cluster
Retrieve the Root Mean Squared Log Error
Pivot a frame
Get a parse setup back for the staged data.
SHAP Summary Plot
Value Matching in H2O
Get the number of factor levels for this frame.
h2o.model_correlation_heatmap
Model Prediction Correlation Heatmap
H2O Prediction from R without having H2O running
Retrieve the null deviance
Store frame data in H2O's native format.
Convert strings to uppercase
Retrieve the default Qini value
Resume previously stopped grid training.
Saves an existing Grid of models into a given folder.
Produce a Vector of Random Uniform Numbers
Column names of an H2OFrame
Retrieve the residual degrees of freedom
Is H2O Frame object
Principal component analysis of an H2O data frame
Build a Random Forest model
Strip set from right
Build a RuleFit Model
h2o.varimp,H2OFrame-method
Retrieve the variable importance.
Apply Target Encoding Map to Frame
Save an H2O Model Object as Mojo to Disk
Retrieve Model Score History
Scree Plot
h2o.shap_explain_row_plot
SHAP Local Explanation
String Split
Get the Within SS
Format AutoML object in user-friendly way
String Substitute
H2O topN
Retrieve per-variable split information for a given Isolation Forest model.
Output will include:
- count - The number of times a variable was used to make a split.
- aggregated_split_ratios - The split ratio is defined as "abs(#left_observations - #right_observations) / #before_split".
Even splits (#left_observations approx the same as #right_observations) contribute
less to the total aggregated split ratio value for the given feature;
highly imbalanced splits (eg. #left_observations >> #right_observations) contribute more.
- aggregated_split_depths - The sum of all depths of a variable used to make a split. (If a variable is used
on level N of a tree, then it contributes with N to the total aggregate.)
Which index contains the min value?
Tokenize String
Model Performance Metrics in H2O
h2o.permutation_importance_plot
Plot Permutation Variable Importances.
Trains a Support Vector Machine model on an H2O dataset
Print method for H2OTable objects
h2o.permutation_importance
Calculate Permutation Feature Importance.
Trim Space
Check if numeric
Convert Archetypes to Features from H2O GLRM Model
Plot partial dependence for a variable
Truncate values in x toward 0
h2o.transform,H2OWordEmbeddingModel-method
Transform words (or sequences of words) to vectors using a word2vec model.
Set Levels of H2O Factor Column
Print the Model Summary
Build a Uplift Random Forest model
Save an H2O Model Details
Save an H2O Model Object to Disk
Plot Standardized Coefficient Magnitudes
Cross Tabulation and Table Creation in H2O
Use optional package
Overrides the behavior of length() function on H2OTree class. Returns number of nodes in an H2OTree
staged_predict_proba.H2OModel
Predict class probabilities at each stage of an H2O Model
Standard Deviation of a column of data.
Compute the sine of x
Save contents of this data frame into a Hive table
Set the Time Zone on the H2O cluster
Tabulation between Two Columns of an H2OFrame
predict_leaf_node_assignment.H2OModel
Predict the Leaf Node Assignment on an H2O Model
Retrieve the variable importance.
Computes TF-IDF values for each word in given documents.
Stop Writing H2O R Logs
Transformation of a categorical variable with a mean value of the target variable
h2o.transform,H2OTargetEncoderModel-method
Applies target encoding to a given dataset
Enable Progress Bar
Suppresses h2o progress output from expr
Creates a new Amazon S3 client internally with specified credentials.
Display the structure of an H2OFrame object
Use H2O Transformation model and apply the underlying transformation
Split an H2O Data Set
Compute the square root of x
Trains a word2vec model on a String column of an H2O data frame
Compute the hyperbolic tangent of x
Compute the tangent of x
Determines whether an XGBoost model can be built
Build an eXtreme Gradient Boosting model
h2o.varimp,H2OModel-method
Retrieve the variable importance.
Deprecated - use h2o.save_mojo instead. Save an H2O Model Object as Mojo to Disk
Remove All Objects on the H2O Cluster
Substring
Convert strings to lowercase
Plot an H2O Model
Convert Milliseconds to Years in H2O Datasets
Transform words (or sequences of words) to vectors using a word2vec model.
Convert a word2vec model into an H2OFrame
Variable Importance Heatmap across multiple models
Display the structure of an H2OFrame object
Plot an H2O Infogram
Retrieve the standard deviations of principal components
Format AutoML object in user-friendly way
Skewness of a column
Check if character
Create Target Encoding Map
Get the total sum of squares.
Plot an H2O Tabulate Heatmap
Convert Milliseconds to Week of Week Year in H2O Datasets
Scaling and Centering of an H2OFrame
Shutdown H2O cluster after examples run
Print An H2OFrame
Imports a MOJO from a local filesystem, creating a Generic model with it.
Compute element-wise string distances between two H2OFrames
Predict on an AutoML object
Retrieve the respective weight matrix
Range of an H2O Column
Check if factor
Edgar Anderson's Iris Data
Upload a binary model from the provided local path to the H2O cluster.
(H2O model can be saved in a binary form either by saveModel() or by download_model() function.)
Plot Variable Importances
Format grid object in user-friendly way
summary,H2OCoxPHModel-method
Summary method for H2OCoxPHModel objects
Prostate Cancer Study
predict_contributions.H2OModel
Predict feature contributions - SHAP values on an H2O Model (only DRF, GBM, XGBoost models and equivalent imported MOJOs).
Predict on an H2O Model
Muscular Actuations for Walking Subject