check_data: Sanity check for input data before imputation
Description
The function `check_data()` serves the purpose of performing a preliminary check and fix some evident issues. However, the function cannot resolve all data quality-related problems.
An integer specifying the maximum number of levels allowed for a factor variable. This is used to detect potential ID columns that are often non-informative for imputation. Default: 50% of the number of rows, rounded to the nearest integer.
verbose
Verbose setting. If TRUE, will print out warnings when data issues are found. Default: TRUE.