sparklyr (version 0.3.9)

sdf_with_unique_id: Add a Unique ID Column to a Spark DataFrame

Description

Add a unique ID column to a Spark DataFrame. The Spark monotonicallyIncreasingId function is used to produce these and is guaranteed to produce unique, monotonically increasing ids; however, there is no guarantee that these IDs will be sequential.

Usage

sdf_with_unique_id(x, id = "id")

Arguments

x
An object coercable to a Spark DataFrame (typically, a tbl_spark).
id
The name of the column to host the generated IDs.