Learn R Programming

ChocoLattes (version 0.1.0)

remove_duplicates: Function to remove duplicated entries

Description

Cleans up duplicated items on a production data frame based on DOI, ISBN, and Title. This function also removes any rows where the value of "year" is greater than the current year (since these are considered errors)

Usage

remove_duplicates(x)

Arguments

x
data frame containing information on a certain type of production (journal papers, conference papers, etc.). Must contain columns "Title" and "Year". Optional (very useful) columns include "DOI" and "ISBN".

Value

data frame with duplicates removed