- path
Character vector that represents the path to the data. Note that
it can be equal to patterns if multiple files with the same name are to be imported
(if so it must be a fixed pattern, NOT a regular expression).
- dirDest
The destination directory, where the new HDD data should be saved.
- chunkMB
The chunk sizes in MB, defaults to 500MB. Instead of using this
argument, you can alternatively use the argument rowsPerChunk
which decides
the size of chunks in terms of lines.
- rowsPerChunk
Number of rows per chunk. By default it is missing: its value
is deduced from argument chunkMB
and the size of the file. If provided,
replaces any value provided in chunkMB
.
- col_names
The column names, by default is uses the ones of the data set.
If the data set lacks column names, you must provide them.
- col_types
The column types, in the readr
fashion. You can use guess_col_types
to find them.
- nb_skip
Number of lines to skip.
- delim
The delimiter. By default the function tries to find the delimiter, but sometimes it fails.
- preprocessfun
A function that is applied to the data before saving. Default
is missing. Note that if a function is provided, it MUST return a data.frame,
anything other than data.frame is ignored.
- replace
If the destination directory already exists, you need to set the
argument replace=TRUE
to overwrite all the HDD files in it.
- encoding
Character scalar containing the encoding of the file to be read.
By default it is "UTF-8" and is passed to the readr
function locale
which is used
in read_delim_chunked
(the reading function). A common encoding in Western Europe is
"ISO-8859-1" (simply use "file filename" in a non-Windows console to get the encoding).
Note that this argument is ignored if the argument locale
is not NULL.
- verbose
Logical scalar or NULL
(default). If TRUE
, then the evolution of
the importing process as well as the time to import are reported.
If NULL
, it becomes TRUE
when the data to import is greater than 5GB or there are
more than one chunk.
- locale
Either NULL
(default), either an object created with locale
.
This object will be passed to the reading function read_delim_chunked
and handles
how the data is imported.
- ...
Other arguments to be passed to read_delim_chunked
,
quote = ""
can be interesting sometimes.