ft_standard_scaler

A <code>spark_connection</code>, <code>ml_pipeline</code>, or a <code>tbl_spark</code>.

input_col

output_col

Whether to center the data with mean before scaling. It will
build a dense output, so take care when applying to sparse input. Default: FALSE

with_mean

Whether to scale the data to unit standard deviation. Default: TRUE

with_std

(Optional) A <code>tbl_spark</code>. If provided, eagerly fit the (estimator)
feature "transformer" against <code>dataset</code>. See details.

dataset

A character string used to uniquely identify the feature transformer.

Optional arguments; currently unused.

Standardizes features by removing the mean and scaling to unit variance using
 column summary statistics on the samples in the training set. The "unit std"
 is computed using the corrected sample standard deviation, which is computed
 as the square root of the unbiased sample variance.

R interface to Apache Spark, a fast and general engine for big data
processing, see <http://spark.apache.org>. This package supports connecting to
local and remote Apache Spark clusters, provides a 'dplyr' compatible back-end,
and provides an interface to Spark's built-in machine learning algorithms.

ft_standard_scaler: Feature Tranformation -- StandardScaler (Estimator)

Description

Usage

Arguments

Value

Details

See Also