Learn R Programming

sparkwarc (version 0.1.6)

spark_read_warc_sample: Loads the sample warc file in Spark

Description

Loads the sample warc file in Spark

Usage

spark_read_warc_sample(sc, filter = "", include = "")

Arguments

sc

An active spark_connection.

filter

A regular expression used to filter to each warc entry efficiently by running native code using Rcpp.

include

A regular expression used to keep only matching lines efficiently by running native code using Rcpp.