ft_ngram

A <code>spark_connection</code>, <code>ml_pipeline</code>, or a <code>tbl_spark</code>.

input_col

output_col

Minimum n-gram length, greater than or equal to 1. Default: 2, bigram features

A character string used to uniquely identify the feature transformer.

Optional arguments; currently unused.

A feature transformer that converts the input array of strings into an array of n-grams. Null values in the input array are ignored. It returns an array of n-grams where each n-gram is represented by a space-separated string of words.

R interface to Apache Spark, a fast and general engine for big data
processing, see <http://spark.apache.org>. This package supports connecting to
local and remote Apache Spark clusters, provides a 'dplyr' compatible back-end,
and provides an interface to Spark's built-in machine learning algorithms.

Javier Luraschi

ft_ngram: Feature Tranformation -- NGram (Transformer)

Description

Usage

Arguments

Value

Details

See Also