h2o.splitFrame

An H2OFrame object representing the dataste to split.

data

A numeric value or array indicating the ratio of total rows
contained in each split. Must total up to less than 1.

ratios

An array of frame IDs equal to the number of ratios
specified plus one.

destination_frames

seed

Split an existing H2O data set according to user-specified ratios. The number of
subsets is always 1 more than the number of given ratios. Note that this does not give
an exact split. H2O is designed to be efficient on big data using a probabilistic
splitting method rather than an exact split. For example, when specifying a split of
0.75/0.25, H2O will produce a test/train split with an expected value of 0.75/0.25
rather than exactly 0.75/0.25. On small datasets, the sizes of the resulting splits
will deviate from the expected value more than on big data, where they will be very
close to exact.

R interface for 'H2O', the scalable open source machine learning
platform that offers parallelized implementations of many supervised and
unsupervised machine learning algorithms such as Generalized Linear
Models, Gradient Boosting Machines (including XGBoost), Random Forests,
Deep Neural Networks (Deep Learning), Stacked Ensembles, Naive Bayes, Cox
Proportional Hazards, K-Means, PCA, Word2Vec, as well as a fully automatic
machine learning algorithm (AutoML).

Erin LeDell

R Interface for 'H2O'

Navdeep Gill

Spencer Aiello

Anqi Fu

Arno Candel

Cliff Click

Tom Kraljevic

Tomas Nykodym

Patrick Aboyoun

Michal Kurka

Michal Malohlava

Ludi Rehak

Eric Eckstrand

Brandon Hill

Sebastian Vidrio

Surekha Jadhawani

Amy Wang

Raymond Peck

Wendy Wong

Jan Gorecki

Matt Dowle

Yuan Tang

Lauren DiPerna

H2O.ai 

h2o.splitFrame function

Split an H2O Data Set — h2o.splitFrame

Split an H2O Data Set

h2o.splitFrame: Split an H2O Data Set

Description

Usage

Arguments

Value

Examples