Learn R Programming

datadr (version 0.8.5)

hdfsConn: Connect to Data Source on HDFS

Description

Connect to a data source on HDFS

Usage

hdfsConn(loc, type = "sequence", autoYes = FALSE, reset = FALSE, verbose = TRUE)

Arguments

loc
location on HDFS for the data source
type
the type of data ("map", "sequence", "text")
autoYes
automatically answer "yes" to questions about creating a path on HDFS
reset
should existing metadata for this object be overwritten?
verbose
logical - print messages about what is being done

Value

a "kvConnection" object of class "hdfsConn"

Details

This simply creates a "connection" to a directory on HDFS (which need not have data in it). To actually do things with this data, see ddo, etc.

See Also

addData, ddo, ddf, localDiskConn

Examples

Run this code
## Not run: 
#   # connect to empty HDFS directory
#   conn <- hdfsConn("/test/irisSplit")
#   # add some data
#   addData(conn, list(list("1", iris[1:10,])))
#   addData(conn, list(list("2", iris[11:110,])))
#   addData(conn, list(list("3", iris[111:150,])))
#   # represent it as a distributed data frame
#   hdd <- ddf(conn)
# ## End(Not run)

Run the code above in your browser using DataLab