import_fst_chunked

get_fst_chunk_size

For `import_fst_chunked`, if a large fst file which could not be imported into the memory all at once,
 this function could read the fst file by chunks and preprocessed the chunk to ensure the
 results yielded by the chunks are small enough to be summarised in the end.
 For `get_fst_chunk_size`, this function can measure the memory used by a specified row number.

A toolkit of tidy data manipulation verbs with 'data.table' as the backend.
Combining the merits of syntax elegance from 'dplyr' and computing performance from 'data.table',
'tidyfst' intends to provide users with state-of-the-art data manipulation tools with least pain.
This package is an extension of 'data.table'. While enjoying a tidy syntax,
it also wraps combinations of efficient functions to facilitate frequently-used data operations.

Tian-Yuan Huang

tidyfst

Tidy Verbs for Fast Data Manipulation

import_fst_chunked function

<dl><dt>path</dt>
<dd>Path to fst file</dd>
<dt>chunk_size</dt>
<dd>Integer. The number of rows to include in each chunk</dd>
<dt>chunk_f</dt>
<dd>A function implemented on every chunk.</dd>
<dt>combine_f</dt>
<dd>A function to aggregate all the elements from the list of results from chunks.</dd>
<dt>nrows</dt>
<dd>Number of rows to test.</dd></dl>

Arguments

Read a fst file by chunks — import_fst_chunked

<dl>

<dt>path</dt>
<dd>Path to fst file</dd>


<dt>chunk_size</dt>
<dd>Integer. The number of rows to include in each chunk</dd>


<dt>chunk_f</dt>
<dd>A function implemented on every chunk.</dd>


<dt>combine_f</dt>
<dd>A function to aggregate all the elements from the list of results from chunks.</dd>


<dt>nrows</dt>
<dd>Number of rows to test.</dd>

</dl>

import_fst_chunked: Read a fst file by chunks

Description

Usage

Value

Arguments

See Also

Examples