Physiological data of patients tested for breast cancer.
breastcancerA data frame containing 699 patients (rows) and 9 variables (columns).
Clump Thickness
Uniformity of Cell Size
Uniformity of Cell Shape
Marginal Adhesion
Single Epithelial Cell Size
Bare Nuclei
Bland Chromatin
Normal Nucleoli
Mitoses
Criterion: Absence/presence of breast cancer.
Values: FALSE vs. TRUE (65.0% vs.\ 35.0%).
We made the following enhancements to the original data for improved usability:
The ID number of the cases was excluded.
The numeric criterion with value 2 for benign and 4 for malignant was converted to logical (i.e., TRUE/FALSE).
16 cases were excluded because they contained NA values.
Other than that, the data remains consistent with the original dataset.
Other datasets:
blood,
car,
contraceptive,
creditapproval,
fertility,
forestfires,
heart.cost,
heart.test,
heart.train,
heartdisease,
iris.v,
mushrooms,
sonar,
titanic,
voting,
wine