spark_read_csv

name

The path to the file. Needs to be accessible from the cluster. Supports: "hdfs://" or "s3n://"

path

Should the first row of data be used as a header? Defaults to <code>TRUE</code>.

header

The character used to delimit each column, defaults to <code>,</code>.

delimiter

The character used as a quote, defaults to <code>"hdfs://"</code>.

quote

The chatacter used to escape other characters, defaults to <code>\</code>.

escape

The character set, defaults to <code>"UTF-8"</code>.

charset

The character to use for default values, defaults to <code>NULL</code>.

null_value

A list of strings with additional options.

options

Total of partitions used to distribute table or 0 (default) to avoid partitioning

repartition

memory

Overwrite the table with the given name if it already exists

overwrite

Read a CSV file into a Spark DataFrame

Provision, connect and interface to Apache Spark from within R.
This package supports connecting to local and remote Apache Spark clusters,
provides a 'dplyr' compatible back-end, and provides an interface to Spark's
built-in machine learning algorithms.

spark_read_csv: Read a CSV file into a Spark DataFrame

Description

Usage

Arguments

Value

Details

See Also