data_frame: function to create a data_frame object
Description
function to create a data_frame object. The data_frame object
is an object that is held on disk. It is written to a folder
path on disk where the data is written to in blocks or
chunks. The data is written in binary format using a C++ function
in purely numerical data and a mapping to the table is held in
a ".meta_data" file in the folder. The table object accomodates
numeric, factor, and character (converted to factor).
Usage
data_frame(
data = stop("data must be supplied"),
chunkSize = stop("chunkSize must be specified, a good number is 50000"),
path = stop("path must be specified"),
nCores = parallel::detectCores(),
...
)
Arguments
data
data.frame object to be converted into a data_frame object
chunkSize
number of rows to be used in each chunk
path
character to folder where the object will be created
nCores
the number of cores to use defaults to parallel::detectCores()