Learn R Programming

⚠️There's a newer version (0.9.9) of this package.Take me there.

fdm2id (version 0.9.4)

Data Mining and R Programming for Beginners

Description

Contains functions to simplify the use of data mining methods (classification, regression, clustering, etc.), for students and beginners in R programming. Various R packages are used and wrappers are built around the main functions, to standardize the use of data mining methods (input/output): it brings a certain loss of flexibility, but also a gain of simplicity. The package name came from the French "Fouille de Donnes en Master 2 Informatique Dcisionnelle".

Copy Link

Version

Install

install.packages('fdm2id')

Monthly Downloads

330

Version

0.9.4

License

GPL-3

Maintainer

Alexandre Blansch<c3><a9>

Last Published

October 30th, 2020

Functions in fdm2id (0.9.4)

CA

Correspondence Analysis (CA)
FEATURESELECTION

Classification with Feature selection
APRIORI

Classification using APRIORI
CART

Classification using CART
BAGGING

Classification using Bagging
GRADIENTBOOSTING

Classification using Gradient Boosting
CDA

Classification using Canonical Discriminant Analysis
ADABOOST

Classification using AdaBoost
DBSCAN

DBSCAN clustering method
EM

Expectation-Maximization clustering method
KMEANS

K-means method
SVD

Singular Value Decomposition
NMF

Non-negative Matrix Factorization
PCA

Principal Component Analysis (PCA)
SVM

Classification using Support Vector Machine
KNN

Classification using k-NN
boxclus

Clustering Box Plots
cartleafs

Number of Leafs
britpop

Population and location of 18 major british cities.
cartnodes

Number of Nodes
autompg

Auto MPG dataset
KERREG

Kernel Regression
HCA

Hierarchical Cluster Analysis method
MEANSHIFT

MeanShift method
MLPREG

Multi-Layer Perceptron Regression
LR

Classification using Logistic Regression
SVR

Regression using Support Vector Machine
NB

Classification using Naive Bayes
birth

Birth dataset
SVRl

Regression using Support Vector Machine with a linear kernel
MCA

Multiple Correspondence Analysis (MCA)
LINREG

Linear Regression
LDA

Classification using Linear Discriminant Analysis
cost.curves

Plot Cost Curves
correlated

Correlated variables
MLP

Classification using Multilayer Perceptron
boosting-class

Boosting methods model
STUMP

Classification using one-level decision tree
SPECTRAL

Spectral clustering method
SVRr

Regression using Support Vector Machine with a radial kernel
SVMr

Classification using Support Vector Machine with a radial kernel
SVMl

Classification using Support Vector Machine with a linear kernel
RANDOMFOREST

Classification using Random Forest
SOM

Self-Organizing Maps clustering method
POLYREG

Polynomial Regression
QDA

Classification using Quadratic Discriminant Analysis
alcohol

Alcohol dataset
apriori-class

APRIORI classification model
TEXTMINING

Text mining
beetles

Flea beetles dataset
compare.kappa

Comparison of two sets of clusters, using kappa
compare

Comparison of two sets of clusters
cookies

Cookies dataset
closegraphics

Close a graphics device
cartdepth

Depth
credit

Credit dataset
data.gauss

Gaussian mixture dataset
data.parabol

Parabol dataset
confusion

Confuion matrix
data.diag

Square dataset
accident2014

Sample of car accident location in the UK during year 2014.
TSNE

t-distributed Stochastic Neighbor Embedding
cartplot

CART Plot
data3

"data3" dataset
cookplot

Plot the Cook's distance of a linear regression model
dataset-class

Training set and test set
dbs-class

DBSCAN model
decathlon

Decathlon dataset
factorial-class

Factorial analysis results
exportgraphics

Open a graphics device
cda-class

Canonical Disciminant Analysis model
data1

"data1" dataset
evaluation.accuracy

Accuracy of classification predictions
getvocab

Extract words and phrases from a corpus
evaluation.fmeasure

F-measure
leverageplot

Plot the leverage points of a linear regression model
linsep

Linsep dataset
performance

Performance estimation
general.rules

Remove redundancy in a set of rules
distplot

Plot a k-distance graphic
intern.interclass

Clustering evaluation through interclass inertia
em-class

Expectation-Maximization model
cartinfo

CART information
data2

"data2" dataset
knn-class

K Nearest Neighbours model
evaluation

Evaluation of classification or regression predictions
plotclus

Generic Plot Method for Clustering
eucalyptus

Eucalyptus dataset
kmeans.getk

Estimation of the number of clusters for K-means
plotcloud

Plot word cloud
compare.accuracy

Comparison of two sets of clusters, using accuracy
evaluation.fowlkesmallows

Fowlkes<U+2013>Mallows index
compare.jaccard

Comparison of two sets of clusters, using Jaccard index
predict.kmeans

Predict function for K-means
predict.em

Predict function for EM
plotdata

Advanced plot function
ozone

Ozone dataset
params-class

Learning Parameters
intern.intraclass

Clustering evaluation through intraclass inertia
resplot

Plot the studentized residuals of a linear regression model
regplot

Plot function for a regression model
summary.apriori

Print summary of a classification model obtained by APRIORI
plot.cda

Plot function for cda-class
print.factorial

Plot function for factorial-class
temperature

Temperature dataset
pseudoF

Pseudo-F
selection-class

Feature selection
selectfeatures

Feature selection for classification
spectral-class

Spectral clustering model
data.xor

XOR dataset
data.twomoons

Two moons dataset
evaluation.jaccard

Jaccard index
spine

Spine dataset
evaluation.kappa

Kappa evaluation of classification predictions
wheat

Wheat dataset
vowels

Vowels dataset
evaluation.goodness

Goodness
filter.rules

Filtering a set of rules
predict.apriori

Model predictions
frequentwords

Frequent words
evaluation.r2

R2 evaluation of regression predictions
ionosphere

Ionosphere dataset
evaluation.recall

Recall of classification predictions
keiser

Keiser rule
data.target1

Target1 dataset
predict.boosting

Model predictions
query.docs

Document query
data.target2

Target2 dataset
plotzipf

Plot rank versus frequency
print.apriori

Print a classification model obtained by APRIORI
som-class

Self-Organizing Maps model
predict.textmining

Model predictions
snore

Snore dataset
universite

University dataset
vectorize.docs

Document vectorization
query.words

Word query
exportgraphics.off

Toggle graphic exports
treeplot

Dendrogram Plots
movies

Movies dataset
model-class

Generic classification or regression model
vectorize.words

Word vectorization
vectorizer-class

Document vectorization object
predict.selection

Model predictions
predict.dbs

Predict function for DBSCAN
predict.cda

Model predictions
predict.model

Model predictions
textmining-class

Text mining object
runningtime

Running time
scatterplot

Clustering Scatter Plots
evaluation.msep

MSEP evaluation of regression predictions
evaluation.precision

Precision of classification predictions
titanic

Titanic dataset
zoo

Zoo dataset
wine

Wine dataset
intern

Clustering evaluation through internal criteria
loadtext

load a text file
plot.factorial

Plot function for factorial-class
intern.dunn

Clustering evaluation through Dunn's index
meanshift-class

MeanShift model
predict.knn

Model predictions
plot.som

Plot function for som-class
reg2

reg2 dataset
predict.meanshift

Predict function for MeanShift
reg1

reg1 dataset
roc.curves

Plot ROC Curves
rotation

Rotation
stability

Clustering evaluation through stability
splitdata

Splits a dataset into training set and test set