Learn R Programming

bigReg (version 0.1.5)

Generalized Linear Models (GLM) for Large Data Sets

Description

Allows the user to carry out GLM on very large data sets. Data can be created using the data_frame() function and appended to the object with object$append(data); data_frame and data_matrix objects are available that allow the user to store large data on disk. The data is stored as doubles in binary format and any character columns are transformed to factors and then stored as numeric (binary) data while a look-up table is stored in a separate .meta_data file in the same folder. The data is stored in blocks and GLM regression algorithm is modified and carries out a MapReduce- like algorithm to fit the model. The functions bglm(), and summary() and bglm_predict() are available for creating and post-processing of models. The library requires Armadillo installed on your system. It may not function on windows since multi-core processing is done using mclapply() which forks R on Unix/Linux type operating systems.

Copy Link

Version

Install

install.packages('bigReg')

Monthly Downloads

124

Version

0.1.5

License

GPL (>= 2)

Maintainer

Chibisi ChimaOkereke

Last Published

December 11th, 2023

Functions in bigReg (0.1.5)

process_bglm_block

Function to print the summary object from the blm object
quasi_

quasi family function
plasma

plasma data from the HSAUR package
sum_bglm_block

The reduction function for the algorithm
summary.bglm

summary function for the bglm object
read_matrix_blocks

read matrix blocks from file
write_numeric_vector

writes numeric vector to file
read_matrix_block

read matrix block from file
print.bglm

print function for the bglm object
read_df_blocks

read multiple blocks of data frames from file
print.blm

print function for the blm object
writeNumericVector

writes numeric vector to file
poisson_

poisson family function
r_bind

row binding for benchmarking ...
summary.blm

summary function for the blm object
readNumericVector

reads numeric vector to file
bglm

Function to carry out generalized linear regression on a data_frame data object
XWXMatrixW

Calculation of iterative regression components
SVD

Singular value decomposition of the aggregated list from XWXMatrix(W) functions
XWXMatrix

Calculation of iterative regression components
blm

Function to carry out linear regression on a data_frame data object
Gamma_

Gamma family function
asInteger

converts numeric vector to integer
bglm_predict

predict function for bglm object
CreateFactor

creates factor from numeric vector and character vector as levels
binomial_

binomial family function
load_data_frame

function to load data_frame object
load_data_matrix

function to load data_frame object
data_frame

function to create a data_frame object
data_matrix

function to create a data_frame object
gaussian_

gaussian family function
.control

Function for creating control parameters for the GLM fit
family_

family function
print.summary.bglm

Function to print the summary object from the bglm object
print.summary.blm

Function to print the summary object from the blm object
inverse.gaussian_

inverse.gaussian family function
quasibinomial_

quasibinomial family function
print.data_frame

print function for a data_frame
quasipoisson_

quasipoisson family function
print.data_matrix

print function for a data_matrix
myIn

finds whether x is in y
mySeq

mySeq function to sequence integers
read_df_block

read data frame block from file