remove_duplicates

When removing duplicates, users can specify a set columns to consider with
the <code>target_columns</code> argument.

Cleaning and standardizing tabular data package, tailored
specifically for curating epidemiological data. It streamlines various
data cleaning tasks that are typically expected when working with
datasets in epidemiology. It returns the processed data in the same
format, and generates a comprehensive report detailing the outcomes of
each cleaning task.

Karim Mané

cleanepi

Clean and Standardize Epidemiological Data

Thibaut Jombart

Abdoelnaser Degoot

Bankolé Ahadzie

Nuredin Mohammed

Bubacarr Bah

Hugo Gruson

Pratik R. Gupte

James M. Azam

Joshua W. Lambert

Chris Hartgerink

Andree Valle-Campos

London School of Hygiene and Tropical Medicine, LSHTM 

data.org 

remove_duplicates function

<dl><dt>data</dt>
<dd>The input <code>&lt;data.frame&gt;</code> or <code>&lt;linelist&gt;</code>.</dd>
<dt>target_columns</dt>
<dd>A <code>&lt;vector&gt;</code> of column names to use when looking
for duplicates. When the input data is a <code>linelist</code> object, this
parameter can be set to <code>linelist_tags</code> if you wish to look for
duplicates on tagged columns only. Default is <code>NULL</code>.</dd></dl>

Arguments

Remove duplicates — remove_duplicates

<dl>

<dt>data</dt>
<dd>The input <code>&lt;data.frame&gt;</code> or <code>&lt;linelist&gt;</code>.</dd>


<dt>target_columns</dt>
<dd>A <code>&lt;vector&gt;</code> of column names to use when looking
for duplicates. When the input data is a <code>linelist</code> object, this
parameter can be set to <code>linelist_tags</code> if you wish to look for
duplicates on tagged columns only. Default is <code>NULL</code>.</dd>

</dl>

Remove duplicates

remove_duplicates: Remove duplicates

Description

Usage

Value

Arguments

Details

Examples