pc_batch

Splits identifier vectors into chunks, applies a worker function per chunk,
and records per-chunk success and error metadata.

Provides an interface to the 'PubChem' database via the PUG REST <https://pubchem.ncbi.nlm.nih.gov/docs/pug-rest> and
PUG View <https://pubchem.ncbi.nlm.nih.gov/docs/pug-view> services. This package allows users to automatically
access chemical and biological data from 'PubChem', including compounds, substances, assays, and various other data types.
Functions are available to retrieve data in different formats, perform searches, and access detailed annotations.

Selcuk Korkmaz

PubChemR

Interface to the 'PubChem' Database for Chemical Data Retrieval

Bilge Eren Yamasan

Dincer Goksuluk

pc_batch function

<dl><dt>ids</dt>
<dd>Identifier vector.</dd>
<dt>fn</dt>
<dd>Function to run on each chunk of `ids`.</dd>
<dt>chunk_size</dt>
<dd>Chunk size.</dd>
<dt>parallel</dt>
<dd>Logical; use parallel execution.</dd>
<dt>workers</dt>
<dd>Number of workers.</dd>
<dt>checkpoint_dir</dt>
<dd>Optional directory to persist per-chunk checkpoint files.</dd>
<dt>checkpoint_id</dt>
<dd>Optional checkpoint run id. If `NULL`, a deterministic id is generated.</dd>
<dt>resume</dt>
<dd>Logical; resume from an existing checkpoint manifest.</dd>
<dt>rerun_failed</dt>
<dd>Logical; when resuming, rerun chunks previously marked as failed.</dd>
<dt>...</dt>
<dd>Additional arguments passed into `fn`.</dd></dl>

Arguments

Batch-Orchestrate PubChem Workflows — pc_batch

<dl>

<dt>ids</dt>
<dd>Identifier vector.</dd>


<dt>fn</dt>
<dd>Function to run on each chunk of `ids`.</dd>


<dt>chunk_size</dt>
<dd>Chunk size.</dd>


<dt>parallel</dt>
<dd>Logical; use parallel execution.</dd>


<dt>workers</dt>
<dd>Number of workers.</dd>


<dt>checkpoint_dir</dt>
<dd>Optional directory to persist per-chunk checkpoint files.</dd>


<dt>checkpoint_id</dt>
<dd>Optional checkpoint run id. If `NULL`, a deterministic id is generated.</dd>


<dt>resume</dt>
<dd>Logical; resume from an existing checkpoint manifest.</dd>


<dt>rerun_failed</dt>
<dd>Logical; when resuming, rerun chunks previously marked as failed.</dd>


<dt>...</dt>
<dd>Additional arguments passed into `fn`.</dd>

</dl>

pc_batch: Batch-Orchestrate PubChem Workflows

Description

Usage

Value

Arguments

Details

Examples