Learn R Programming

disk.frame (version 0.1.1)

group_by.disk.frame: Group by within each disk.frame

Description

The disk.frame group by operation perform group WITHIN each chunk. This is often used for performance reasons. If the user wishes to perform group-by, they may choose to use the `hard_group_by` function which is expensive as it reorganizes the chunks by the shard key.

Usage

# S3 method for disk.frame
group_by(.data, ...)

Arguments

.data

a disk.frame

...

same as the dplyr::group_by

See Also

hard_group_by

Other dplyr verbs: select.disk.frame