benchmark

(list of <a rd-options="" href="/link/Learner?package=mlr&version=2.17.0" data-mini-rdoc="mlr::Learner">Learner</a> | <a rd-options="" href="/link/character?package=mlr&version=2.17.0" data-mini-rdoc="mlr::character">character</a>)
Learning algorithms which should be compared, can also be a single learner.
If you pass strings the learners will be created via <a rd-options="" href="/link/makeLearner?package=mlr&version=2.17.0" data-mini-rdoc="mlr::makeLearner">makeLearner</a>.

learners

list of <a rd-options="" href="/link/Task?package=mlr&version=2.17.0" data-mini-rdoc="mlr::Task">Task</a>
Tasks that learners should be run on.

tasks

(list of <a rd-options="" href="/link/ResampleDesc?package=mlr&version=2.17.0" data-mini-rdoc="mlr::ResampleDesc">ResampleDesc</a> | <a rd-options="" href="/link/ResampleInstance?package=mlr&version=2.17.0" data-mini-rdoc="mlr::ResampleInstance">ResampleInstance</a>)
Resampling strategy for each tasks.
If only one is provided, it will be replicated to match the number of tasks.
If missing, a 10-fold cross validation is used.

resamplings

(list of <a rd-options="" href="/link/Measure?package=mlr&version=2.17.0" data-mini-rdoc="mlr::Measure">Measure</a>)
Performance measures for all tasks.
If missing, the default measure of the first task is used.

measures

(<code>logical(1)</code>)
Keep the prediction data in the <code>pred</code> slot of the result object.
If you do many experiments (on larger data sets) these objects might unnecessarily increase
object size / mem usage, if you do not really need them.
The default is set to <code>TRUE</code>.

keep.pred

(<code>logical(1)</code>)
Keep the <code>extract</code> slot of the result object. When creating a lot of
benchmark results with extensive tuning, the resulting R objects can become
very large in size. That is why the tuning results stored in the <code>extract</code>
slot are removed by default (<code>keep.extract = FALSE</code>). Note that when
<code>keep.extract = FALSE</code> you will not be able to conduct analysis in the
tuning results.

keep.extract

(<code>logical(1)</code>)
Should all fitted models be stored in the <a rd-options="" href="/link/ResampleResult?package=mlr&version=2.17.0" data-mini-rdoc="mlr::ResampleResult">ResampleResult</a>?
Default is <code>FALSE</code>.

models

(<code>logical(1)</code>)
Print verbose output on console?
Default is set via <a rd-options="" href="/link/configureMlr?package=mlr&version=2.17.0" data-mini-rdoc="mlr::configureMlr">configureMlr</a>.

show.info

Complete benchmark experiment to compare different learning algorithms across one or more tasks
w.r.t. a given resampling strategy. Experiments are paired, meaning always the same
training / test sets are used for the different learners.
Furthermore, you can of course pass &#8220;enhanced&#8221; learners via wrappers, e.g., a
learner can be automatically tuned using <a rd-options="" href="/link/makeTuneWrapper?package=mlr&version=2.17.0" data-mini-rdoc="mlr::makeTuneWrapper">makeTuneWrapper</a>.

Interface to a large number of classification and
regression techniques, including machine-readable parameter
descriptions. There is also an experimental extension for survival
analysis, clustering and general, example-specific cost-sensitive
learning. Generic resampling, including cross-validation,
bootstrapping and subsampling. Hyperparameter tuning with modern
optimization techniques, for single- and multi-objective problems.
Filter and wrapper methods for feature selection. Extension of basic
learners with additional operations common in machine learning, also
allowing for easy nested resampling. Most operations can be
parallelized.

Patrick Schratz

Machine Learning in R

Bernd Bischl

Michel Lang

Lars Kotthoff

Julia Schiffner

Jakob Richter

Zachary Jones

Giuseppe Casalicchio

Mason Gallo

Jakob Bossek

Erich Studerus

Leonard Judt

Tobias Kuehn

Pascal Kerschke

Florian Fendt

Philipp Probst

Xudong Sun

Janek Thomas

Bruno Vieira

Laura Beggel

Quay Au

Martin Binder

Florian Pfisterer

Stefan Coors

Steve Bronder

Alexander Engelhardt

Christoph Molnar

Annette Spooner

benchmark function

(list of <a rd-options='' href='Learner'>Learner</a> | <a rd-options='' href='character'>character</a>)
Learning algorithms which should be compared, can also be a single learner.
If you pass strings the learners will be created via <a rd-options='' href='makeLearner'>makeLearner</a>.

list of <a rd-options='' href='Task'>Task</a>
Tasks that learners should be run on.

(list of <a rd-options='' href='ResampleDesc'>ResampleDesc</a> | <a rd-options='' href='ResampleInstance'>ResampleInstance</a>)
Resampling strategy for each tasks.
If only one is provided, it will be replicated to match the number of tasks.
If missing, a 10-fold cross validation is used.

(list of <a rd-options='' href='Measure'>Measure</a>)
Performance measures for all tasks.
If missing, the default measure of the first task is used.

(<code>logical(1)</code>)
Should all fitted models be stored in the <a rd-options='' href='ResampleResult'>ResampleResult</a>?
Default is <code>FALSE</code>.

(<code>logical(1)</code>)
Print verbose output on console?
Default is set via <a rd-options='' href='configureMlr'>configureMlr</a>.

Complete benchmark experiment to compare different learning algorithms across one or more tasks
w.r.t. a given resampling strategy. Experiments are paired, meaning always the same
training / test sets are used for the different learners.
Furthermore, you can of course pass &#8220;enhanced&#8221; learners via wrappers, e.g., a
learner can be automatically tuned using <a rd-options='' href='makeTuneWrapper'>makeTuneWrapper</a>.

State of Data and AI Literacy Report 2025

benchmark: Benchmark experiment for multiple learners and tasks.

Description

Usage

Arguments

Value

See Also

Examples