datadr (version 0.8.4)

hdfsConn: Connect to Data Source on HDFS

Description

Connect to a data source on HDFS

Usage

hdfsConn(loc, type = "sequence", autoYes = FALSE, reset = FALSE,
  verbose = TRUE)

Arguments

loc
location on HDFS for the data source
type
the type of data ("map", "sequence", "text")
autoYes
automatically answer "yes" to questions about creating a path on HDFS
reset
should existing metadata for this object be overwritten?
verbose
logical - print messages about what is being done

Value

  • a "kvConnection" object of class "hdfsConn"

Details

This simply creates a "connection" to a directory on HDFS (which need not have data in it). To actually do things with this data, see ddo, etc.

See Also

addData, ddo, ddf, localDiskConn

Examples

Run this code
# connect to empty HDFS directory
  conn <- hdfsConn("/test/irisSplit")
  # add some data
  addData(conn, list(list("1", iris[1:10,])))
  addData(conn, list(list("2", iris[11:110,])))
  addData(conn, list(list("3", iris[111:150,])))
  # represent it as a distributed data frame
  hdd <- ddf(conn)

Run the code above in your browser using DataLab