stream_read_csv

The path to the file. Needs to be accessible from the cluster.
Supports the <samp>"hdfs://"</samp>, <samp>"s3a://"</samp> and <samp>"file://"</samp> protocols.

path

The name to assign to the newly generated stream.

name

Boolean; should the first row of data be used as a header?
Defaults to <code>TRUE</code>.

header

A vector of column names or a named vector of column types.

columns

The character used to delimit each column. Defaults to <samp>','</samp>.

delimiter

The character used as a quote. Defaults to <samp>'"'</samp>.

quote

The character used to escape other characters. Defaults to <samp>'\'</samp>.

escape

The character set. Defaults to <samp>"UTF-8"</samp>.

charset

The character to use for null, or missing, values. Defaults to <code>NULL</code>.

null_value

A list of strings with additional options.

options

Optional arguments; currently unused.

Reads a CSV stream as a Spark dataframe stream.

R interface to Apache Spark, a fast and general engine for big data
processing, see <http://spark.apache.org>. This package supports connecting to
local and remote Apache Spark clusters, provides a 'dplyr' compatible back-end,
and provides an interface to Spark's built-in machine learning algorithms.

stream_read_csv: Read CSV Stream

Description

Usage

Arguments

See Also

Examples