Learn R Programming

biglasso (version 1.0-1)

setupX: Set up design matrix X by reading data from big data file

Description

Set up the design matrix X as a big.matrix object based on external massive data file stored on disk that cannot be fullly loaded into memory. The data file must be a well-formated ASCII-file, and contains only one single type. Current version only supports double type. Other restrictions about the data file are described in biglasso-package. This function reads the massive data, and creates a big.matrix object. By default, the resulting big.matrix is file-backed, and can be shared across processors or nodes of a cluster.

Usage

setupX(filename, dir = getwd(), sep = ",", backingfile = paste0(unlist(strsplit(filename, split = "\\."))[1], ".bin"), descriptorfile = paste0(unlist(strsplit(filename, split = "\\."))[1], ".desc"), ...)

Arguments

filename
The name of the data file. For example, "dat.txt".
dir
The directory used to store the binary and descriptor files associated with the big.matrix. The default is current working directory.
sep
The field separator character. For example, "," for comma-delimited files (the default); "\t" for tab-delimited files.
backingfile
The binary file associated with the file-backed big.matrix. By default, its name is the same as filename with the extension replaced by ".bin".
descriptorfile
The descriptor file used for the description of the file-backed big.matrix. By default, its name is the same as filename with the extension replaced by ".desc".
...
Additional arguments that can be passed into function read.big.matrix.

Value

A big.matrix object corresponding to a file-backed big.matrix. It's ready to be used as the design matrix X in biglasso and cv.biglasso.

Details

This function is a simple wrapper of read.big.matrix. See read.big.matrix and the package bigmemory for more details.

See Also

biglasso, cv.ncvreg

Examples

Run this code
## see the example in "biglasso-package"

Run the code above in your browser using DataLab