Learn R Programming

⚠️There's a newer version (0.9.9) of this package.Take me there.

fdm2id (version 0.9.1)

Data Mining and R Programming for Beginners

Description

Contains functions to simplify the use of data mining methods (classification, regression, clustering, etc.), for students and beginners in R programming. Various R packages are used and wrappers are built around the main functions, to standardize the use of data mining methods (input/output): it brings a certain loss of flexibility, but also a gain of simplicity. The package name came from the French "Fouille de Donnes en Master 2 Informatique Dcisionnelle".

Copy Link

Version

Install

install.packages('fdm2id')

Monthly Downloads

377

Version

0.9.1

License

GPL-3

Maintainer

Alexandre Blansch<c3><a9>

Last Published

January 10th, 2020

Functions in fdm2id (0.9.1)

ADABOOST

Classification using AdaBoost
APRIORI

Classification using APRIORI
DBSCAN

DBSCAN clustering method
BAGGING

Classification using Bagging
FEATURESELECTION

Classification with Feature selection
HCA

Hierarchical Cluster Analysis method
CDA

Classification using Canonical Discriminant Analysis
CART

Classification using CART
GRADIENTBOOSTING

Classification using Gradient Boosting
EM

Expectation-Maximization clustering method
KNN

Classification using k-NN
KERREG

Kernel Regression
LR

Classification using Logistic Regression
LINREG

Linear Regression
SVR

Regression using Support Vector Machine
SVD

Singular Value Decomposition
STUMP

Classification using one-level decision tree
NB

Classification using Naive Bayes
MLPREG

Multi-Layer Perceptron Regression
SVMr

Classification using Support Vector Machine with a radial kernel
MEANSHIFT

MeanShift method
SVM

Classification using Support Vector Machine
MLP

Classification using Multilayer Perceptron
SVMl

Classification using Support Vector Machine with a linear kernel
apriori-class

APRIORI classification model
QDA

Classification using Quadratic Discriminant Analysis
autompg

Auto MPG dataset
RANDOMFOREST

Classification using Random Forest
bootstrap

Bootstrap evaluation
boosting-class

Boosting methods model
LDA

Classification using Linear Discriminant Analysis
compare

Comparison of two sets of clusters
NMF

Non-negative Matrix Factorization
compare.accuracy

Comparison of two sets of clusters, using accuracy
SVRl

Regression using Support Vector Machine with a linear kernel
KMEANS

K-means method
SOM

Self-Organizing Maps clustering method
SPECTRAL

Spectral clustering method
cartleafs

Number of Leafs
SVRr

Regression using Support Vector Machine with a radial kernel
cartinfo

CART information
bootstrap.curves

Plot evaluation curves with bootstrap sampling
TSNE

t-distributed Stochastic Neighbor Embedding
TEXTMINING

Text mining
beetles

Flea beetles dataset
cost.curves

Plot Cost Curves
boxclus

Clustering Box Plots
compare.jaccard

Comparison of two sets of clusters, using Jaccard index
distplot

Plot a k-distance graphic
em-class

Expectation-Maximization model
credit

Credit dataset
compare.kappa

Comparison of two sets of clusters, using kappa
birth

Birth dataset
evaluation.accuracy

Accuracy of classification predictions
closegraphics

Close a graphics device
cda-class

Canonical Disciminant Analysis model
data.target1

Target1 dataset
evaluation.recall

Recall of classification predictions
exportgraphics

Open a graphics device
evaluation

Evaluation of classification or regression predictions
data.parabol

Parabol dataset
filter.rules

Filtering a set of rules
meanshift-class

MeanShift model
evaluation.goodness

Goodness
evaluation.jaccard

Jaccard index
kmeans.getk

Estimation of the number of clusters for K-means
ionosphere

Ionosphere dataset
accident2014

Sample of car accident location in the UK during year 2014.
POLYREG

Polynomial Regression
frequentwords

Frequent words
movies

Movies dataset
alcohol

Alcohol dataset
model-class

Generic classification or regression model
data.target2

Target2 dataset
cookplot

Plot the Cook's distance of a linear regression model
cookies

Cookies dataset
cartplot

CART Plot
cartdepth

Depth
data2

"data2" dataset
data1

"data1" dataset
britpop

Population and location of 18 major british cities.
cartnodes

Number of Nodes
data.twomoons

Two moons dataset
dbs-class

DBSCAN model
decathlon

Decathlon dataset
eucalyptus

Eucalyptus dataset
resplot

Plot the studentized residuals of a linear regression model
plotdata

Advanced plot function
predict.model

Model predictions
plotclus

Generic Plot Method for Clustering
predict.meanshift

Predict function for MeanShift
regplot

Plot function for a regression model
data.gauss

Gaussian mixture dataset
data3

"data3" dataset
data.diag

Square dataset
dataset-class

Training set and test set
evaluate

Evaluate several classication (or regression) methods
predict.em

Predict function for EM
ozone

Ozone dataset
predict.dbs

Predict function for DBSCAN
evaluation.kappa

Kappa evaluation of classification predictions
evaluation.msep

MSEP evaluation of regression predictions
intern.dunn

Clustering evaluation through Dunn's index
intern

Clustering evaluation through internal criteria
evaluation.fmeasure

F-measure
titanic

Titanic dataset
textmining-class

Text mining object
evaluation.precision

Precision of classification predictions
params-class

Learning Parameters
evaluation.r2

R2 evaluation of regression predictions
intern.intraclass

Clustering evaluation through intraclass inertia
plot.cda

Plot function for cda-class
intern.interclass

Clustering evaluation through interclass inertia
vectorize.words

Word vectorization
general.rules

Remove redundancy in a set of rules
predict.boosting

Model predictions
evaluation.fowlkesmallows

Fowlkes<U+2013>Mallows index
predict.kmeans

Predict function for K-means
vectorizer-class

Document vectorization object
query.docs

Document query
predict.cda

Model predictions
query.words

Word query
leverageplot

Plot the leverage points of a linear regression model
knn-class

K Nearest Neighbours model
runningtime

Running time
plotcloud

Plot word cloud
plot.som

Plot function for som-class
print.apriori

Print a classification model obtained by APRIORI
predict.knn

Model predictions
selectfeatures

Feature selection for classification
spectral-class

Spectral clustering model
selection-class

Feature selection
scatterplot

Clustering Scatter Plots
summary.apriori

Print summary of a classification model obtained by APRIORI
wheat

Wheat dataset
splitdata

Splits a dataset into training set and test set
roc.curves

Plot ROC Curves
rotation

Rotation
pseudoF

Pseudo-F
stability

Clustering evaluation through stability
spine

Spine dataset
temperature

Temperature dataset
wine

Wine dataset
vectorize.docs

Document vectorization
universite

University dataset
zoo

Zoo dataset
linsep

Linsep dataset
plotzipf

Plot rank versus frequency
loadtext

load a text file
getvocab

Extract words and phrases from a corpus
predict.apriori

Model predictions
predict.selection

Model predictions
predict.textmining

Model predictions
reg1

reg1 dataset
reg2

reg2 dataset
snore

Snore dataset
exportgraphics.off

Toggle graphic exports
treeplot

Dendrogram Plots
som-class

Self-Organizing Maps model