ft_binarizer

0th

Percentile

Feature Transformation -- Binarizer

Apply thresholding to a column, such that values less than or equal to the threshold are assigned the value 0.0, and values greater than the threshold are assigned the value 1.0. Column output is numeric for compatibility with other modeling functions.

Usage
ft_binarizer(x, input.col, output.col, threshold = 0.5, ...)
Arguments
x

An object (usually a spark_tbl) coercable to a Spark DataFrame.

input.col

The name of the input column(s).

output.col

The name of the output column.

threshold

The numeric threshold.

...

Optional arguments; currently unused.

See Also

See http://spark.apache.org/docs/latest/ml-features.html for more information on the set of transformations available for DataFrame columns in Spark.

Other feature transformation routines: ft_bucketizer, ft_count_vectorizer, ft_discrete_cosine_transform, ft_elementwise_product, ft_index_to_string, ft_one_hot_encoder, ft_quantile_discretizer, ft_regex_tokenizer, ft_stop_words_remover, ft_string_indexer, ft_tokenizer, ft_vector_assembler, sdf_mutate

Aliases
  • ft_binarizer
Documentation reproduced from package sparklyr, version 0.6.3, License: Apache License 2.0 | file LICENSE

Community examples

kevin@fergs.com at Aug 4, 2017 sparklyr v0.6.0