powered by
Computes the connected component membership of each vertex and returns a DataFrame of vertex information with each vertex assigned a component ID.
gf_connected_components(x, broadcast_threshold = 1000000L, algorithm = c("graphframes", "graphx"), checkpoint_interval = 2L, ...)
An object coercable to a GraphFrame (typically, a gf_graphframe).
gf_graphframe
Broadcast threshold in propagating component assignments.
One of 'graphframes' or 'graphx'.
Checkpoint interval in terms of number of iterations.
Optional arguments, currently not used.
# NOT RUN { # checkpoint directory is required for gf_connected_components() spark_set_checkpoint_dir(sc, tempdir()) g <- gf_friends(sc) gf_connected_components(g) # }
Run the code above in your browser using DataLab