sdf_with_sequential_id
From sparklyr v0.7.0
by Javier Luraschi
Add a Sequential ID Column to a Spark DataFrame
Add a sequential ID column to a Spark DataFrame. The Spark
zipWithIndex
function is used to produce these. This differs from
sdf_with_unique_id
in that the IDs generated are independent of
partitioning.
Usage
sdf_with_sequential_id(x, id = "id", from = 1L)
Arguments
- x
A
spark_connection
,ml_pipeline
, or atbl_spark
.- id
The name of the column to host the generated IDs.
- from
The starting value of the id column
Community examples
Looks like there are no examples yet.