Learn R Programming

⚠️There's a newer version (2.0-3) of this package.Take me there.

stream - Infrastructure for Data Stream Mining - R package

A framework for data stream modeling and associated data mining tasks such as clustering and classification. The development of this package was supported in part by NSF IIS-0948893 and NIH R21HG005912.

Additional packages in the stream family are: streamMOA.

Installation

  • Stable CRAN version: install from within R.
  • Current development version: Download package from AppVeyor or install via install_git("mhahsler/stream") (needs devtools)

Example

R> library("stream")
R> stream <- DSD_Gaussians(k=3, noise=0)

# create micro-clusters via sampling
R> sample <- DSC_Sample(k=20)
R> update(sample, stream, 500)
R> sample
Reservoir sampling
Class: DSC_Sample, DSC_Micro, DSC_R, DSC 
Number of micro-clusters: 20 

# recluster micro-clusters
R> kmeans <- DSC_Kmeans(k=3)
R> recluster(kmeans, sample)
R> plot(kmeans, stream, type="both")

Further Information

Maintainer: Michael Hahsler

Copy Link

Version

Install

install.packages('stream')

Monthly Downloads

712

Version

1.2-3

License

GPL-3

Maintainer

Michael Hahsler

Last Published

August 7th, 2016

Functions in stream (1.2-3)

DSC_Mirco

Abstract Class for Micro Clusterers
DSD_Benchmark

Data Stream Generator for Benchmark Data
DSC

Data Stream Clusterer Base Classes
DSD_BarsAndGaussians

Data Stream Generator for Bars and Gaussians
DSC_Static

Create as Static Copy of a Clustering
DSD_Cubes

Static Cubes Data Stream Generator
DSD_Memory

A Data Stream Interface for Data Stored in Memory
DSC_Window

A sliding window from a Data Stream
DSClassify

Abstract Class for Data Stream Classifiers
DSD_Gaussians

Mixture of Gaussians Data Stream Generator
DSC_TwoStage

TwoStage Clustering Process
DSD_ReadCSV

Read a Data Stream from File
DSD_mlbenchData

Stream Interface for Data Sets From mlbench
DSD_ReadDB

Read a Data Stream from an open DB Query
DSD_mlbenchGenerator

mlbench Data Stream Generator
DSD_ScaleStream

Scale a Stream from a DSD
DSD

Data Stream Data Generator Base Classes
DSD_Target

Target Data Stream Generator
DSFP

Abstract Class for Frequent Pattern Mining Algorithms for Data Streams
DSD_MG

DSD Moving Generator
DSD_UniformNoise

Uniform Noise Data Stream Generator
DST

Abstract Base Class for All Data Stream Mining Tasks
get_assignment

Assignment Data Points to Clusters
get_points

Get Points from a Data Stream Generator
get_weights

Get Cluster Weights
get_centers

Get Cluster Centers from a DSC
DSO_Window

Sliding Window (Data Stream Operator)
DSO

Data Stream Operator Base Classes
DSO_Sample

Sampling from a Data Stream (Data Stream Operator)
get_copy

Create a Deep Copy of a DSC Object
evaluate

Evaluate Clusterings
reset_stream

Reset a Data Stream to its Beginning
nclusters

nclusters
write_stream

Write a Data Stream to a File
update

Update a Data Stream Clustering Model
recluster

Re-clustering micro-clusters
save

Save and Read DSC Objects
microToMacro

Translate Micro-cluster IDs to Macro-cluster IDs
MGC

Moving Generator Cluster
plot

Plotting Data Stream Data and Clusterings
prune_clusters

Prune Clusters from a Clustering
DSC_Kmeans

Kmeans Macro-clusterer
DSC_Marco

Abstract Class for Macro Clusterers
DSC_Hierarchical

Hierarchical Micro-Cluster Reclusterer
animation

Animates the plotting of a DSD and the clustering process
DSC_DBSTREAM

DBSTREAM clustering algorithm
DSC_DBSCAN

DBSCAN Macro-clusterer
DSC_Reachability

Reachability Micro-Cluster Reclusterer
DSC_Sample

Extract a Fixed-size Sample from a Data Stream
DSC_DStream

D-Stream Data Stream Clustering Algorithm