spark_read_warc_sample: Loads the sample warc file in Spark
Description
Loads the sample warc file in Spark
Usage
spark_read_warc_sample(sc, filter = "", include = "")
Arguments
sc
An active spark_connection
.
filter
A regular expression used to filter to each warc entry
efficiently by running native code using Rcpp
.
include
A regular expression used to keep only matching lines
efficiently by running native code using Rcpp
.