spark_read_parquet

name

The path to the file. Needs to be accessible from the cluster. Supports: "hdfs://" or "s3n://"

path

A list of strings with additional options. See <a href="http://spark.apache.org/docs/latest/sql-programming-guide.html#configuration">http://spark.apache.org/docs/latest/sql-programming-guide.html#configuration</a>.

options

Total of partitions used to distribute table or 0 (default) to avoid partitioning

repartition

memory

Overwrite the table with the given name if it already exists

overwrite


Read a Parquet file into a Spark DataFrame


Provision, connect and interface to Apache Spark from within R.
This package supports connecting to local and remote Apache Spark clusters,
provides a dplyr-compatible back-end, and provides an interface to Spark's
built-in machine learning algorithms.

Javier Luraschi

sparklyr

R Interface to Apache Spark

spark_read_parquet function

A list of strings with additional options. See <a href = 'http://spark.apache.org/docs/latest/sql-programming-guide.html#configuration'>http://spark.apache.org/docs/latest/sql-programming-guide.html#configuration</a>.

spark_read_parquet: Read a Parquet file into a Spark DataFrame

Description

Usage

Arguments

Details

See Also