Chooses the best duplicate, based on the
duplicate with the smallest number of missing values. In case of
ties, it picks the first duplicate, as it is the one most likely
to be valid and authentic, given practice effects.
Usage
best_duplicate(data, id, keep.rows = FALSE)
Value
A dataframe, containing only the "best" duplicates.
Arguments
data
The data frame.
id
The ID variable for which to check for duplicates.
keep.rows
Logical, whether to add a column at the beginning
of the data frame with the original row indices.
Details
For the easystats equivalent, see:
datawizard::data_duplicated().