Learn R Programming

mlr3data (version 0.7.0)

titanic: Titanic

Description

Classification data to predict the fate of passengers on the ocean liner "Titanic". Contains 10 features and 1309 observations. Target column is "Survived".

Arguments

Pre-processing

  • All column names have been changed to snake_case.

  • training and test set have been joined. Observations of the test set have a missing value in the target column "survived".

  • Column '"survived"' has been re-encoded to a factor with levels '"yes"' and '"no"'.

  • Id column has been removed.

  • Passenger class "pclass" has been converted to an ordered factor.

  • Features "sex" and "embarked" have been converted to factors.

  • Empty strings in "cabin" and "embarked" have been encoded as missing values.

Examples

Run this code
data("titanic", package = "mlr3data")
str(titanic)

Run the code above in your browser using DataLab