sdf_with_unique_id

0th

Percentile

Add a Unique ID Column to a Spark DataFrame

Add a unique ID column to a Spark DataFrame. The Spark monotonicallyIncreasingId function is used to produce these and is guaranteed to produce unique, monotonically increasing ids; however, there is no guarantee that these IDs will be sequential. The table is persisted immediately after the column is generated, to ensure that the column is stable -- otherwise, it can differ across new computations.

Usage
sdf_with_unique_id(x, id = "id")
Arguments
x

An object coercable to a Spark DataFrame (typically, a tbl_spark).

id

The name of the column to host the generated IDs.

Aliases
  • sdf_with_unique_id
Documentation reproduced from package sparklyr, version 0.5.1, License: Apache License 2.0 | file LICENSE

Community examples

Looks like there are no examples yet.