spark_read_csv

name

The path to the file. Needs to be accessible from the cluster. Supports: "hdfs://" or "s3n://"

path

Should the first row of data be used as a header? Defaults to <code>TRUE</code>.

header

The character used to delimit each column, defaults to <code>,</code>.

delimiter

The character used as a quote, defaults to <code>"hdfs://"</code>.

quote

The chatacter used to escape other characters, defaults to <code>\</code>.

escape

The character set, defaults to <code>"UTF-8"</code>.

charset

The character to use for default values, defaults to <code>NULL</code>.

null_value

A list of strings with additional options.

options

Total of partitions used to distribute table or 0 (default) to avoid partitioning

repartition

memory

Overwrite the table with the given name if it already exists

overwrite

Provision, connect and interface to Apache Spark from within R.
This package supports connecting to local and remote Apache Spark clusters,
provides a dplyr-compatible back-end, and provides an interface to Spark's
built-in machine learning algorithms.

spark_read_csv: Read a CSV file into a Spark DataFrame

Description

Usage

Arguments

Value

Details

See Also