ml-tuning

ml_cross_validator

ml_train_validation_split

A <code>spark_connection</code>, <code>ml_pipeline</code>, or a <code>tbl_spark</code>.

A <code>ml_estimator</code> object.

estimator

A named list of stages and hyper-parameter sets to tune. See details.

estimator_param_maps

A <code>ml_evaluator</code> object, see <a rd-options="" href="/link/ml_evaluator?package=sparklyr&version=0.7.0" data-mini-rdoc="sparklyr::ml_evaluator">ml_evaluator</a>.

evaluator

Number of folds for cross validation. Must be &gt;= 2. Default: 3

num_folds

A random seed. Set this value if you need your results to be
reproducible across repeated calls.

seed

A character string used to uniquely identify the ML estimator.

Optional arguments; currently unused.

Ratio between train and validation data. Must be between 0 and 1. Default: 0.75

train_ratio

Perform hyper-parameter tuning using either K-fold cross validation or train-validation split.

R interface to Apache Spark, a fast and general engine for big data
processing, see <http://spark.apache.org>. This package supports connecting to
local and remote Apache Spark clusters, provides a 'dplyr' compatible back-end,
and provides an interface to Spark's built-in machine learning algorithms.

Javier Luraschi

sparklyr

R Interface to Apache Spark

Kevin Kuo

Kevin Ushey

JJ Allaire

 RStudio

 The Apache Software Foundation

ml-tuning function

A <code>ml_evaluator</code> object, see <a rd-options='' href='ml_evaluator'>ml_evaluator</a>.

ml-tuning: Spark ML -- Tuning

Description

Usage

Arguments

Value

Details