ddfply

ddfdir

(character vector) Columns names to used to split the data(if 
missing, <code>fun</code> is applied on each chunk)

groupby

(object of class <em>function</em>) function to apply on each subset
after the split

(string) Collect the result as <code>list</code> or <code>dataframe</code>
or <code>none</code>. <code>none</code> keeps the resulting ddo on disk.

collect

(string) Path where intermediary files are kept

temploc

(positive integer) Number of directories into which the 
distributed dataframe (ddf) or distributed data object (ddo) is distributed

nbins

(positive integer) Number of rows of the file to be read at a 
time

chunk

(positive integer) Maximum number of rows of any subset 
resulting from split

spill

(positive integer) Number of cores to be used in parallel

cores

(positive integer) Size of batches of key-value pairs to be 
passed to the map OR Size of the batches of key-value pairs to flush to 
intermediate storage from the map output OR Size of the batches of 
key-value pairs to send to the reduce

buffer

Arguments to be passed to <code>data.table</code> function asis.

performs chunk processing or split-apply-combine on the data
  in a distributed data frame(ddf)

Perform chunk processing or split-apply-combine on data in a
delimited file (example: CSV) and Distributed Dataframes (DDF) across multiple
cores of a single machine with low memory footprint. These functions are a
convenient wrapper over the versatile package 'datadr'.

ddfply: ddfply

Description

Usage

Arguments

Value

Details

Examples