mlr3
A clean, object-oriented rewrite of mlr.
Installation
remotes::install_github("mlr-org/mlr3")Why a rewrite?
mlr was first released to CRAN in 2013. Its core design and architecture date back even further. The addition of many features has led to a feature creep which makes mlr hard to maintain and hard to extend. We also think that while mlr was nicely extensible in some parts (learners, measures, etc.), other parts were less easy to extend from the outside. Also, many helpful R libraries did not exist at the time mlr was created, and their inclusion would result in non-trivial API changes.
Design principles
- Only the basic building blocks for machine learning are implemented in this package.
- Focus on computation here. No visualization or other stuff. That can go in extra packages.
- Overcome the limitations of R's S3 classes with the help of R6.
- Embrace R6, clean OO-design, object state-changes and reference semantics. This might be less "traditional R", but seems to fit mlrnicely.
- Embrace data.tablefor fast and convenient data frame computations.
- Combine data.tableandR6, for this we will make heavy use of list columns in data.tables.
- Be light on dependencies. mlr3requires the following packages:- backports: Ensures backward compatibility with older R releases. Developed by members of the- mlrteam. No recursive dependencies.
- checkmate: Fast argument checks. Developed by members of the- mlrteam. No extra recursive dependencies.
- mlr3miscMiscellaneous functions used in multiple mlr3 extension packages. Developed by the- mlrteam. No extra recursive dependencies.
- paradox: Descriptions for parameters and parameter sets. Developed by the- mlrteam. No extra recursive dependencies.
- R6: Reference class objects. No recursive dependencies.
- data.table: Extension of R's- data.frame. No recursive dependencies.
- digest: Hash digests. No recursive dependencies.
- lgr: Logging facility. No extra recursive dependencies.
- Metrics: Package which implements performance measures. No recursive dependencies.
- mlbench: A collection of machine learning data sets. No dependencies.
 
- Reflections: Objects are queryable for properties and capabilities, allowing you to programm on them.
- Additional functionality that comes with extra dependencies:- For parallelization, mlr3utilizes thefutureandfuture.applypackages.
- To capture output, warnings and exceptions, evaluateandcallrcan be used.
 
- For parallelization, 
Resources
- Reference Manual
- Extension packages.
- We started to write a book, but it is still very unfinished.
- useR2019 talks