Learn R Programming

startR (version 2.4.0)

Automatically Retrieve Multidimensional Distributed Data Sets

Description

Tool to automatically fetch, transform and arrange subsets of multi- dimensional data sets (collections of files) stored in local and/or remote file systems or servers, using multicore capabilities where possible. The tool provides an interface to perceive a collection of data sets as a single large multidimensional data array, and enables the user to request for automatic retrieval, processing and arrangement of subsets of the large array. Wrapper functions to add support for custom file formats can be plugged in/out, making the tool suitable for any research field where large multidimensional data sets are involved.

Copy Link

Version

Install

install.packages('startR')

Monthly Downloads

303

Version

2.4.0

License

GPL-3

Maintainer

Victoria Agudetse

Last Published

September 19th, 2024

Functions in startR (2.4.0)

Sort

Sort the coordinate variable values in a Start() call
Start

Declare, discover, subset and retrieve multidimensional distributed data sets
Step

Define the operation applied on declared data.
indices

Specify dimension selectors with indices
values

Specify dimension selectors with actual values
CDORemapper

CDO Remap Data Transformation for 'startR'
NcCloser

NetCDF file closer for 'startR'
SelectorChecker

Translate a set of selectors into a set of numeric indices
NcDataReader

NetCDF file data reader for 'startR'
NcDimReader

NetCDF dimension reader for 'startR'
AddStep

Create the workflow with the previous defined operation and data.
Compute

Specify the execution parameters and trigger the execution
NcOpener

NetCDF file opener for 'startR'
NcVarReader

NetCDF variable reader for 'startR'
Collect

Collect and merge the computation results