stream - Infrastructure for Data Stream Mining - R package
A framework for data stream modeling and associated data mining tasks such as clustering and classification. The development of this package was supported in part by NSF IIS-0948893 and NIH R21HG005912.
Additional packages in the stream family are:
Installation
Stable CRAN version: install from within R with
install.packages("stream")Current development version: Download package from AppVeyor or install from GitHub (needs devtools).
install_git("mhahsler/stream")Usage
Load the package and create micro-clusters via sampling.
library("stream")
stream <- DSD_Gaussians(k=3, noise=0)
sample <- DSC_Sample(k=20)
update(sample, stream, 500)
sampleReservoir sampling
Class: DSC_Sample, DSC_Micro, DSC_R, DSC
Number of micro-clusters: 20 Recluster micro-clusters using k-means and plot results
kmeans <- DSC_Kmeans(k=3)
recluster(kmeans, sample)
plot(kmeans, stream, type="both")References
- stream package vignette with complete examples.
- stream reference manual