Extract the top N percent of values of a column and return it in a H2OFrame.
h2o.topN(x, column, nPercent)
An H2OFrame with 2 columns. The first column is the original row indices, second column contains the topN values
an H2OFrame
is a column name or column index to grab the top N percent value from
is a top percentage value to grab
if (FALSE) {
library(h2o)
h2o.init()
f <- "https://s3.amazonaws.com/h2o-public-test-data/bigdata/laptop/jira/TopBottomNRep4.csv.zip"
dataset <- h2o.importFile(f)
frameNames <- names(dataset)
nPercent <- c(1, 2, 3, 4)
nP <- nPercent[sample(1:length(nPercent), 1, replace = FALSE)]
colIndex <- sample(1:length(frameNames), 1, replace = FALSE)
h2o.topN(dataset, frameNames[colIndex], nP)
}
Run the code above in your browser using DataLab