Learn R Programming

rHpcc (version 1.0)

ECLDedUp: Creates an ECL "DEDUP" definition.

Description

The DEDUP function evaluates the recordset for duplicate records, as defined by the condition parameter, and returns a unique return set. This is similar to the DISTINCT statement in SQL. The recordset should be sorted, unless ALL is specified

Arguments

Examples

Run this code
  ## Not run: 
#     ecl1 <- ECL$new(hostName="127.0.0.1", port="8008")
#     recPerson <- ECLRecord$new(name="rec_person")
#     recPerson$addField("STRING", "code")
#     recPerson$addField("STRING", "firstName")
#     recPerson$addField("STRING", "lastName")
#     recPerson$addField("STRING", "address")
#     recPerson$addField("STRING", "stateCode")
#     recPerson$addField("STRING", "city")
#     recPerson$addField("STRING", "zip")
#     ecl1$add(recPerson)
#     
#     dsPerson <- ECLDataset$new(name="ds_person", datasetType = recPerson, 
#                   logicalFileName ="~ds::person", fileType="CSV")
#     ecl1$add(dsPerson)
#     
#     recPersonTable <- ECLRecord$new(name="personNewTableFormat")
#     recPersonTable$addField(dsPerson$getName(), "code", seperator=".")
#     recPersonTable$addField(dsPerson$getName(), "firstName", seperator=".")
#     recPersonTable$addField(dsPerson$getName(), "lastName", seperator=".")
#     
#     ecl1$add(recPersonTable)
#     
#     tblPerson <- ECLTable$new(name="PersonNewTable", 
#                   inDataset = dsPerson, format= recPersonTable)
#     ecl1$add(tblPerson)
#     
#     PersonNewTableSorted <- ECLSort$new(name="PersonNewTableSorted", 
#                               inDataset = tblPerson)
#     PersonNewTableSorted$addField("lastName")
#     ecl1$add(PersonNewTableSorted)
#     
#     mySets <- ECLDedUp$new(name="mySets", 
#                     inDataset = PersonNewTableSorted)
#     mySets$addField("lastName")
#     ecl1$add(mySets)
#     ecl1$print()
#   ## End(Not run)

Run the code above in your browser using DataLab