Learn R Programming

DSFM (version 1.0.1)

DPC: Distributed Principal Component Analysis

Description

Performs distributed principal component analysis on a numeric dataset split across multiple nodes. Estimates loading matrices, residual variances, and covariance matrices for each node.

Usage

DPC(data, m, n1, K)

Value

A list with the following components:

Ahat

List of estimated loading matrices for each node.

Dhat

List of diagonal residual variance matrices for each node.

Sigmahat

List of covariance matrices for each node.

Arguments

data

A numeric matrix containing the total dataset.

m

An integer specifying the number of principal components.

n1

An integer specifying the length of each data subset.

K

An integer specifying the number of nodes.

Examples

Run this code
set.seed(123)
data <- matrix(rnorm(500), nrow = 100, ncol = 5)
DPC(data = data, m = 3, n1 = 20, K = 5)

Run the code above in your browser using DataLab