Given a numeric column within a Spark DataFrame, compute approximate quantiles (to some relative error).
sdf_quantile(
x,
column,
probabilities = c(0, 0.25, 0.5, 0.75, 1),
relative.error = 1e-05
)
A spark_connection
, ml_pipeline
, or a tbl_spark
.
The column for which quantiles should be computed.
A numeric vector of probabilities, for which quantiles should be computed.
The relative error -- lower values imply more precision in the computed quantiles.