Learn R Programming

pysparklyr

Integrates sparklyr with PySpark and Databricks. The main reason of this package is because the new Spark and Databricks Connect connection method does not work with standard sparklyr integration.

Installing

To install the version in CRAN use:

install.packages("pysparklyr")

To get the development version from GitHub use:

remotes::install_github("mlverse/pysparklyr")

Using

To learn how to use, please visit the Spark / Databricks Connect article, available in the official sparklyr website: Spark Connect, and Databricks Connect v2

Copy Link

Version

Install

install.packages('pysparklyr')

Monthly Downloads

1,099

Version

0.2.0

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Edgar Ruiz

Last Published

January 27th, 2026

Functions in pysparklyr (0.2.0)

install_pyspark

Installs PySpark and Python dependencies
requirements_write

Writes the 'requirements.txt' file, containing the needed Python libraries
deploy_databricks

Deploys Databricks backed content to publishing server
pysparklyr-package

pysparklyr: Provides a 'PySpark' Back-End for the 'sparklyr' Package
connection_databricks_shinyapp

A Shiny app that can be used to construct a spark_connect statement
ml_prepare_dataset

Creates the 'label' and 'features' columns
spark_connect_service_start

Starts and stops Spark Connect locally
pyspark_config

Read Spark configuration
installed_components

Lists installed Python libraries