ft_tokenizer

An object (usually a <code>spark_tbl</code>) coercable to a Spark DataFrame.

input.col

output.col

Optional arguments; currently unused.

A tokenizer that converts the input string to lowercase and then splits it
by white spaces.

R interface to Apache Spark, a fast and general engine for big data
processing, see <http://spark.apache.org>. This package supports connecting to
local and remote Apache Spark clusters, provides a 'dplyr' compatible back-end,
and provides an interface to Spark's built-in machine learning algorithms.

Javier Luraschi

sparklyr

ft_tokenizer: Feature Tranformation -- Tokenizer

Description

Usage

Arguments

See Also