ft_pca

ml_pca

A <code>spark_connection</code>, <code>ml_pipeline</code>, or a <code>tbl_spark</code>.

input_col

output_col

The number of principal components

A character string used to uniquely identify the feature transformer.

Optional arguments; currently unused.

The columns to use in the principal components
analysis. Defaults to all columns in <code>x</code>.

features

Length-one character vector used to prepend names of components.

pc_prefix

PCA trains a model to project vectors to a lower dimensional space of the top k principal components.

R interface to Apache Spark, a fast and general engine for big data
processing, see <http://spark.apache.org>. This package supports connecting to
local and remote Apache Spark clusters, provides a 'dplyr' compatible back-end,
and provides an interface to Spark's built-in machine learning algorithms.

Yitao Li

sparklyr

R Interface to Apache Spark

ft_pca: Feature Transformation -- PCA (Estimator)

Description

Usage

Arguments

Value

Details

See Also

Examples