sparklyr (version 1.0.4)

stream_read_json: Read JSON Stream

Description

Reads a JSON stream as a Spark dataframe stream.

Usage

stream_read_json(sc, path, name = NULL, columns = NULL,
  options = list(), ...)

Arguments

sc

A spark_connection.

path

The path to the file. Needs to be accessible from the cluster. Supports the "hdfs://", "s3a://" and "file://" protocols.

name

The name to assign to the newly generated stream.

columns

A vector of column names or a named vector of column types.

options

A list of strings with additional options.

...

Optional arguments; currently unused.

See Also

Other Spark stream serialization: stream_read_csv, stream_read_kafka, stream_read_orc, stream_read_parquet, stream_read_scoket, stream_read_text, stream_write_console, stream_write_csv, stream_write_json, stream_write_kafka, stream_write_memory, stream_write_orc, stream_write_parquet, stream_write_text

Examples

Run this code
# NOT RUN {
sc <- spark_connect(master = "local")

dir.create("json-in")
jsonlite::write_json(list(a = c(1,2), b = c(10,20)), "json-in/data.json")

json_path <- file.path("file://", getwd(), "json-in")

stream <- stream_read_json(sc, json_path) %>% stream_write_json("json-out")

stream_stop(stream)

# }
# NOT RUN {
# }

Run the code above in your browser using DataCamp Workspace