Learn R Programming

⚠️There's a newer version (0.1.18.5) of this package.Take me there.

PivotalR

PivotalR is a package that enables users of R, the most popular open source statistical programming language and environment to interact with (Greenplum) Database as well as Apache HAWQ (incubating) and the open-source database PostgreSQL for Big Data analytics. It does so by providing an interface to the operations on tables/views in the database. These operations are almost the same as those of data.frame. Minimal amount of data is transfered between R and the database system. Thus the users of R do not need to learn SQL when they operate on the objects in the database. PivotalR also lets the user to run the functions of the open-source big-data machine learning package Apache MADlib (incubating) directly from R.

  1. An Introduction to PivotalR

     vignette("pivotalr") # execute in R console to view the PDF file
  2. To install PivotalR:

    • Get the latest stable version from CRAN by running install.packages("PivotalR")

    • Or try out the latest development version from github by running the following code (Need R >= 3.0.2):

      ## install.packages("devtools") # 'devtools' package is only available for R >= 3.0.2
      devtools::install_github("PivotalR", "pivotalsoftware")
    • Or download the source tarball directly from here, and then install the tarball

      install.packages("pivotalsoftware-PivotalR-xxxx.tar.gz", repos = NULL, type = "source")

    where "pivotalsoftware-PivotalR-xxxx.tar.gz" is the name of the package that you have downloaded.

  3. To get started:

Copy Link

Version

Install

install.packages('PivotalR')

Monthly Downloads

239

Version

0.1.18.3.1

License

GPL (>= 2)

Maintainer

Rahul Iyer

Last Published

January 5th, 2019

Functions in PivotalR (0.1.18.3.1)

conn.eql

Check whether two connections are the same
conn.id

Find out the connection ID of a db.obj object
AIC

AIC methods for Madlib regression objects
as.factor-methods

Convert one column of a '>db.obj object into a categorical variable
Arith-methods

as.db.data.frame

Convert other objects into a db.data.frame object
db.data.frame-class

Class "db.data.frame"
by

Apply a Function to a db.data.frame Split by column(s)
abalone

Abalone data set
as.environment

Evaluate expressions within the context of a database table or view
Aggregate functions

Functions to perform a calculation on multiple values and return a single value
array.len

Get the length of the array in an array column
db.data.frame

Create a db.data.frame object pointing to a table/view in the database
cbind2-methods

Combine two db.obj Objects by Columns
clean.madlib.temp

Delete all the result tables created during calculations of MADlib
coef

Extract model coefficients for Madlib regression objects
generic.bagging

This function runs boostrap aggregating for a given training function.
Extract database connection info

Utilities for extracting related information about a database connection
arraydb.to.arrayr

Convert strings extracted from database into arrays
generic.cv

Generic cross-validation for supervised learning algorithms
Compare-methods

content

Print the content of a db.obj object
db.q

Execute a SQL query
db.objects

List all the existing tables/views in a database with their schema names
db.Rquery-class

Class "db.Rquery" and its sub-class db.Rview-class
ifelse

Conditional Element Selection
is.db.data.frame

Check if an object is of type db.data.frame
dim-methods

Dimension of a table
na.action

Functions for filtering NA values in data
db.list

List all the currently active connections with their information
db.connect

Create a connection to a database
eql-methods

Test if two objects point to the same table
getTree.rf.madlib

MADlib wrapper function for Random Forest
crossprod

Compute the matrix product of X^T and Y.
db.obj-class

Abstract Class "db.obj"
Extract-Replace-methods

Func-methods

Mathematical functions that take '>db.obj objects as the argument
db.Rcrossprod-class

Class "db.Rcrossprod"
names-methods

The Names of an object
db.view-class

Class "db.view"
groups

Summary information for Logistic Regression output
predict.lda

Prediction function for MADlib's LDA models
delete

Safely delete a db.obj object or a table/view in the database
predict.rf.madlib

Compute the predictions of the model produced by madlib.randomForest
db.disconnect

Disconnect a connection to a database
summary.arima.madlib

Summary information for MADlib's ARIMA model
madlib.glm

Generalized Linear Regression by MADlib in databases
key

Get or set the primary key for a table
is.factor-methods

Detect whether a '>db.obj object is a categorical object
madlib.kmeans

Wrapper for MADlib's Kmeans clustering function
db.existsObject

Test whether an object exists in the database
plot.dt.madlib

Plot the result of madlib.rpart
is.na-method

Query if the entries in a table are NULL
predict

Generate the db.Rquery object that can calculate the predictions
preview

Read the actual data stored in a table of database.
Logical-methods

Logical operations for '>db.obj objects
summary.elnet.madlib

Summary information for Elastic net regularization output
db.search.path

Display or set the search path (i.e. default schemas) for a connected session to a database. The use can easily switch to a schema that he has the privilege to write.
db.table-class

Class "db.table"
madlib.arima

Wrapper for MADlib's ARIMA model fitting function
madlib.summary

Data summary function
print-methods

Display the connection information associated with a db object
madlib.svm

Support Vector Machine with regression and novelty detection
predict.dt.madlib

Compute the predictions of the model produced by madlib.rpart
predict.elnet.madlib

Predict using the regression result of elastic net regularization
null.data

A Data Set with lots of NA values
margins

Compute the marginal effects of regression models
merge-method

Computing a join on two tables
residuals

Residuals methods for Madlib regression objects
madlib.elnet

MADlib's elastic net regularization for generalized linear models
perplexity.lda

Perplexity of LDA predictions
madlib.randomForest

MADlib wrapper function for Random Forest
print.arima.madlib

Display results of ARIMA fitting of madlib.arima
print.dt.madlib

Print the result of madlib.rpart
print.rf.madlib

Print the result of madlib.randomForest
sort

Sort a table or view by a set of columns
madlib.lda

Wrapper for MADlib's Latent Dirichilet Allocation
predict.arima

Forecast from MADlib's ARIMA fits
madlib.lm

Linear regression with grouping support, heteroskedasticity
print.summary.madlib

Display the results from summary function in a pretty format
predict.bagging.model

madlib.rpart

MADlib wrapper function for Decision Tree
subset-methods

Extract a subset of a table or view
text.dt.madlib

Add labels onto the figure generated by plot.dt.madlib
Type Cast functions

Cast columns of db.obj objects to other types
summary.lm.madlib

Summary information for Linear Regression output
GUI

Graphical interface for PivotalR based upon shiny
print

Display results of logistic regression
Row_actions

Compute the sum or mean of all columns in one row of a table
print.none.obj

Function used in GUI to print absolutely nothing
PivotalR-package

An R font-end to PostgreSQL and Greenplum database, and wrapper for in-database parallel and distributed machine learning open-source library MADlib
unique-methods

The Unique of an object
print.lm.madlib

Display results of linear regression
print.elnet.madlib

Display the results from madlib.elnet function in a pretty format
summary

Summary information for Logistic Regression output
sample-methods

Methods for sampling rows of data from a table/view randomly
scale

Scaling and centering of tables
vcov

vcov methods for Madlib regression objects