
Please see http://www.win-vector.com/blog/2017/05/managing-spark-data-handles-in-r/ for details. Note: one usually needs to alter the keys column which is just populated with all columns.
tableDescription(tableName, handle, ...,
keyInspector = key_inspector_all_cols)
name of table to add to join plan.
table or table handle to add to join plan (can already be in the plan).
force later arguments to bind by name.
function that determines preferred primary key set for table.
table describing the data.
Please see vignette('DependencySorting', package = 'replyr')
and vignette('joinController', package= 'replyr')
for more details.
buildJoinPlan
, keysAreUnique
, makeJoinDiagramSpec
, executeLeftJoinPlan
# NOT RUN {
d <- data.frame(x=1:3, y=NA)
tableDescription('d', d)
# }
Run the code above in your browser using DataLab