ReadCausataCsv(causataR, include=c(), exclude=c(), maxMb=1000, colFilterFunc=NA, rowIndex=NA, nrows=NA, metadata=FALSE, debug=FALSE, ...)
ReadCausataR
function.
nrows
is specified.
See 'Details' for more information.
keep
. If the value is TRUE then the variable is kept, if FALSE the
variable is discarded.
doMC
, which is easier to debug.colFilterFunc
.The include
and exclude
arguments are used to select which columns to load from the csv file.
If these arguments are left at their default values then all columns are loaded.
If include
and exclude
are set then exclude
is applied first, followed by include
.
The maxMb
parameter can be used to load and filter data in several passes, which would reduce the
total memory required if row / column filters
are specified in colFilterFunc
or rowIndex
.
If the estimated required memory exceeds maxMb
, then the load will be broken into multiple passes,
each no larger than maxMb
.
The default estimate is 12 bytes per cell of a data frame, so when MaxMb=1000
(about a
gigabyte) that corresponds to a data frame with 100k rows and 833 columns.