Learn R Programming

mlr3 (version 0.12.0)

TaskClassif: Classification Task

Description

This task specializes Task and TaskSupervised for classification problems. The target column is assumed to be a factor or ordered factor. The task_type is set to "classif".

Additional task properties include:

  • "twoclass": The task is a binary classification problem.

  • "multiclass": The task is a multiclass classification problem.

Predefined tasks are stored in the dictionary mlr_tasks.

Arguments

Super classes

mlr3::Task -> mlr3::TaskSupervised -> TaskClassif

Active bindings

class_names

(character()) Returns all class labels of the target column.

positive

(character(1)) Stores the positive class for binary classification tasks, and NA for multiclass tasks. To switch the positive class, assign a level to this field.

negative

(character(1)) Stores the negative class for binary classification tasks, and NA for multiclass tasks.

Methods

Public methods

Method new()

Creates a new instance of this R6 class. The function as_task_classif() provides an alternative way to construct classification tasks.

Usage

TaskClassif$new(id, backend, target, positive = NULL, extra_args = list())

Arguments

id

(character(1)) Identifier for the new instance.

backend

(DataBackend) Either a DataBackend, or any object which is convertible to a DataBackend with as_data_backend(). E.g., a data.frame() will be converted to a DataBackendDataTable.

target

(character(1)) Name of the target column.

positive

(character(1)) Only for binary classification: Name of the positive class. The levels of the target columns are reordered accordingly, so that the first element of $class_names is the positive class, and the second element is the negative class.

extra_args

(named list()) Named list of constructor arguments, required for converting task types via convert_task().

Method data()

Calls $data from parent class Task and ensures that levels of the target column are in the right order.

Usage

TaskClassif$data(
  rows = NULL,
  cols = NULL,
  data_format = "data.table",
  ordered = TRUE
)

Arguments

rows

integer() Row indices.

cols

character() Column names.

data_format

(character(1)) Desired data format, e.g. "data.table" or "Matrix".

ordered

(logical(1)) If TRUE (default), data is ordered according to the columns with column role "order".

Returns

Depending on the DataBackend, but usually a data.table::data.table().

Method truth()

True response for specified row_ids. Format depends on the task type. Defaults to all rows with role "use".

Usage

TaskClassif$truth(rows = NULL)

Arguments

rows

integer() Row indices.

Returns

factor().

Method droplevels()

Updates the cache of stored factor levels, removing all levels not present in the current set of active rows. cols defaults to all columns with storage type "factor" or "ordered". Also updates the task property "twoclass"/"multiclass".

Usage

TaskClassif$droplevels(cols = NULL)

Arguments

cols

character() Column names.

Returns

Modified self.

Method clone()

The objects of this class are cloneable with this method.

Usage

TaskClassif$clone(deep = FALSE)

Arguments

deep

Whether to make a deep clone.

See Also

  • Chapter in the mlr3book: https://mlr3book.mlr-org.com/tasks.html

  • Package mlr3data for more toy tasks.

  • Package mlr3oml for downloading tasks from https://openml.org.

  • Package mlr3viz for some generic visualizations.

  • Dictionary of Tasks: mlr_tasks

  • as.data.table(mlr_tasks) for a table of available Tasks in the running session (depending on the loaded packages).

  • Extension packages for additional task types:

    • mlr3proba for probabilistic supervised regression and survival analysis.

    • mlr3cluster for unsupervised clustering.

Other Task: TaskRegr, TaskSupervised, TaskUnsupervised, Task, mlr_tasks_boston_housing, mlr_tasks_breast_cancer, mlr_tasks_german_credit, mlr_tasks_iris, mlr_tasks_mtcars, mlr_tasks_penguins, mlr_tasks_pima, mlr_tasks_sonar, mlr_tasks_spam, mlr_tasks_wine, mlr_tasks_zoo, mlr_tasks

Examples

Run this code
# NOT RUN {
data("Sonar", package = "mlbench")
task = as_task_classif(Sonar, target = "Class", positive = "M")

task$task_type
task$formula()
task$truth()
task$class_names
task$positive
task$data(rows = 1:3, cols = task$feature_names[1:2])
# }

Run the code above in your browser using DataLab