Learn R Programming

DAL Toolbox

As research experiments grow in scale and complexity, data analytics demands tools that go beyond isolated functions. DAL Toolbox is a framework designed to meet these modern challenges by organizing a comprehensive set of data analytics capabilities into an integrated workflow environment. Inspired by the Experiment Line model doi:10.1007/978-3-642-02279-1_20, it supports essential tasks such as data preprocessing, classification, regression, clustering, and time series prediction. With a unified data model, consistent method API, and support for hyperparameter tuning, DAL Toolbox enables the seamless construction and execution of end-to-end analytics pipelines. It also offers easy integration with existing libraries and languages, promoting usability, extensibility, and reproducibility in data science.


Examples

Graphics: https://github.com/cefet-rj-dal/daltoolbox/tree/main/examples/graphics/

Transformation: https://github.com/cefet-rj-dal/daltoolbox/tree/main/examples/transf/

Classification: https://github.com/cefet-rj-dal/daltoolbox/tree/main/examples/classification/

Clustering: https://github.com/cefet-rj-dal/daltoolbox/tree/main/examples/clustering/

Regression: https://github.com/cefet-rj-dal/daltoolbox/tree/main/examples/regression/

The examples are organized according to general (data preprocessing), clustering, classification, regression, and time series functions.


Installation

The latest version of DAL Toolbox at CRAN is available at: https://CRAN.R-project.org/package=daltoolbox

You can install the stable version of DAL Toolbox from CRAN with:

install.packages("daltoolbox")

You can install the development version of DAL Toolbox from GitHub https://github.com/cefet-rj-dal/daltoolbox with:

library(devtools)
devtools::install_github("cefet-rj-dal/daltoolbox", force=TRUE, dependencies=FALSE, upgrade="never")

Bugs and new features request

https://github.com/cefet-rj-dal/daltoolbox/issues

Copy Link

Version

Install

install.packages('daltoolbox')

Monthly Downloads

890

Version

1.3.717

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Eduardo Ogasawara

Last Published

February 10th, 2026

Functions in daltoolbox (1.3.717)

dal_graphics

Graphics utilities
dal_learner

DAL Learner (base class)
discover

Discover
cluster_kmeans

k-means
cluster_hclust

Hierarchical clustering
data_sample

Data sampling abstractions
dal_base

Class dal_base
clusterer

Clusterer
cla_svm

SVM for classification
fit

Fit
dal_tune

DAL Tune (base for hyperparameter search)
k_fold

K-fold sampling
evaluate

Evaluate
dt_pca

PCA
minmax

Min-max normalization
cluster_pam

PAM (Partitioning Around Medoids)
cluster_louvain_graph

Louvain community detection
pat_cspade

cSPADE sequences
pattern_miner

Pattern miner
outliers_boxplot

Outlier removal by boxplot (IQR rule)
pat_eclat

ECLAT itemsets
na_removal

Missing value removal
plot_boxplot

Plot boxplot
cluster_dbscan

DBSCAN
feature_generation

Feature generation
cluster_gmm

Gaussian mixture model clustering (GMM)
plot_boxplot_class

Boxplot per class
fit_curvature_max

Maximum curvature analysis (elbow detection)
dal_transform

DAL Transform
fit.cluster_dbscan

fit dbscan model
hierarchy_cut

Hierarchy mapping by cut
fit.cla_tune

tune hyperparameters of ml model
fit_curvature_min

Minimum curvature analysis (elbow detection)
inverse_transform

Inverse Transform
outliers_gaussian

Outlier removal by Gaussian 3-sigma rule
imputation_simple

Simple imputation
plot_bar

Plot bar graph
plot_dendrogram

Plot dendrogram
plot_correlation

Plot correlation
plot_groupedbar

Plot grouped bar
plot_pair_adv

Plot advanced scatter matrix
pat_apriori

Apriori rules
feature_selection_corr

Feature selection by correlation
plot_pixel

Plot pixel visualization
plot_pair

Plot scatter matrix
plot_lollipop

Plot lollipop
plot_pieplot

Plot pie
plot_hist

Plot histogram
plot_stackedbar

Plot stacked bar
plot_ts_pred

Plot time series with predictions
plot_parallel

Plot parallel coordinates
plot_points

Plot points
plot_radar

Plot radar
plot_scatter

Scatter graph
plot_density_class

Plot density per class
plot_density

Plot density
reg_dtree

Decision Tree for regression
plot_ts

Plot time series chart
plot_series

Plot series
predictor

Predictor (base for classification/regression)
sample_simple

Simple sampling
sample_random

Random sampling
reg_rf

Random Forest for regression
reg_lm

Linear regression (lm)
reg_knn

K-Nearest Neighbors (KNN) Regression
reg_svm

SVM for regression
sample_cluster

Cluster sampling
sample_balance

Class balancing (up/down sampling)
smoothing_inter

Smoothing by equal interval
reg_tune

Regression tuning (k-fold CV)
select_hyper

Selection of hyperparameters
sample_stratified

Stratified sampling
regression

Regression base class
reg_mlp

MLP for regression
select_hyper.cla_tune

selection of hyperparameters
set_params

Assign parameters
set_params.default

Default Assign parameters
smoothing

Smoothing (binning/quantization)
train_test_from_folds

k-fold training and test partition object
train_test

Train-Test Partition
transform

Transform
zscore

Z-score normalization
smoothing_freq

Smoothing by equal frequency
smoothing_cluster

Smoothing by clustering (k-means)
action.dal_transform

Action implementation for transform
adjust_matrix

Adjust to matrix
aggregation

Aggregation by groups
adjust_data.frame

Adjust to data frame
autoenc_base_e

Autoencoder base (encoder)
adjust_class_label

Adjust categorical mapping
autoenc_base_ed

Autoencoder base (encoder + decoder)
Boston

Boston Housing Data (Regression)
action

Action
adjust_factor

Adjust factors
cla_glm

Logistic regression (GLM)
cla_glmnet

LASSO logistic regression (glmnet)
cla_majority

Majority baseline classifier
cla_knn

K-Nearest Neighbors (KNN) Classification
cla_nb

Naive Bayes Classifier
categ_mapping

Categorical mapping (one‑hot encoding)
cla_bagging

Bagging (ipred)
cluster

Cluster
cluster_cmeans

Fuzzy c-means
cla_tune

Classification tuning (k-fold CV)
cla_xgboost

XGBoost
cla_rf

Random Forest for classification
cla_multinom

Multinomial logistic regression
cla_mlp

MLP for classification
classification

Classification base class
clu_tune

Clustering tuning (intrinsic metric)
cla_dtree

Decision Tree for classification
cla_rpart

CART (rpart)
cla_boosting

Boosting (adabag)