h2o.drop_duplicates

An H2OFrame object to drop duplicates on.

frame

Columns to compare during the duplicate detection process.

columns

Which rows to keep. The "first" value (default) keeps the first row and delets the rest. 
The "last" keeps the last row.

keep

Drops duplicated rows across specified columns.

R interface for 'H2O', the scalable open source machine learning
platform that offers parallelized implementations of many supervised and
unsupervised machine learning algorithms such as Generalized Linear
Models (GLM), Gradient Boosting Machines (including XGBoost), Random Forests,
Deep Neural Networks (Deep Learning), Stacked Ensembles, Naive Bayes,
Generalized Additive Models (GAM), Cox Proportional Hazards, K-Means, PCA,
Word2Vec, as well as a fully automatic machine learning algorithm (H2O AutoML).

Erin LeDell

R Interface for the 'H2O' Scalable Machine Learning Platform

Navdeep Gill

Spencer Aiello

Anqi Fu

Arno Candel

Cliff Click

Tom Kraljevic

Tomas Nykodym

Patrick Aboyoun

Michal Kurka

Michal Malohlava

Ludi Rehak

Eric Eckstrand

Brandon Hill

Sebastian Vidrio

Surekha Jadhawani

Amy Wang

Raymond Peck

Wendy Wong

Jan Gorecki

Matt Dowle

Yuan Tang

Lauren DiPerna

H2O.ai 

h2o.drop_duplicates function

Drops duplicated rows. — h2o.drop_duplicates

Drops duplicated rows.

h2o.drop_duplicates: Drops duplicated rows.

Description

Usage

Arguments

Examples